Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: W1PI49_AMBTC (W1PI49)

Summary

This is the summary of UniProt entry W1PI49_AMBTC (W1PI49).

Description: Uncharacterized protein {ECO:0000313|EMBL:ERN07301.1}
Source organism: Amborella trichopoda (NCBI taxonomy ID 13333)
View Pfam proteome data.
Length: 510 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
low_complexity n/a 9 21
Pfam ATG16 32 166
coiled_coil n/a 77 153
disorder n/a 93 96
disorder n/a 134 138
coiled_coil n/a 162 185
low_complexity n/a 184 195
Pfam WD40 214 251
low_complexity n/a 251 265
Pfam WD40 296 335
Pfam WD40 339 376
low_complexity n/a 394 412
Pfam WD40 431 467
Pfam WD40 472 510

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession W1PI49. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MDGPAVMAIR HALSALRRRH LLEEGAHIPA FSALSRQFVT QGSEWKEKAE
50
51
GLEVELQQCY KAQARLSEQL VVEVAECRDA KALLQEKETS LNELQKEVAE
100
101
GRDEISKLKE LLEENRKALD LAISENQELR SQLEEEILKA RNAESENKML
150
151
IDRWMMQKMQ DAERLNEANA IYKEMMDKLK ASNMQQLVQQ QVDGVVRLSE
200
201
TWTSEALTDL PSPTWKHRLQ AHEGACGSIL FEPNSDCLIT GGLDQIIRVW
250
251
DTRTGTNTNT LTGCLGSILD LAIARDKCII AATSSNKMFV FDQLSRRLSH
300
301
TLTGHLDKVC AVDASKVSGK SLVSAAYDRT IRTWDLVKGY CTATLMCHSN
350
351
CNALCYGIDG QTVFTGHVDG NLRLWDLRVS GGGKMVSEVA AHAYSVTSVS
400
401
LSRGGLLVLT SGRDNVHNLF DVRASLEVCG NFRGGSGSAT NWNRSCISPD
450
451
DNYVVAGSSD GMVHVWSRRE KGEIVSMLRG HEASVLTCAW SDTGGPLASA
500
501
DKKGAVYLWV                                            
510
 

Show the unformatted sequence.

Checksums:
CRC64:90423F741E9F4FAF
MD5:5bffc0ad8d491ae256138b26157393e2