Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: A0A0D2SY52_GOSRA (A0A0D2SY52)

Summary

This is the summary of UniProt entry A0A0D2SY52_GOSRA (A0A0D2SY52).

Description: Uncharacterized protein {ECO:0000313|EMBL:KJB36260.1}
Source organism: Gossypium raimondii (New World cotton) (NCBI taxonomy ID 29730)
View Pfam proteome data.
Length: 644 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 7 8
disorder n/a 15 16
disorder n/a 24 52
low_complexity n/a 33 38
disorder n/a 56 69
disorder n/a 90 143
low_complexity n/a 130 148
Pfam POX 153 294
disorder n/a 177 226
low_complexity n/a 184 195
disorder n/a 305 309
low_complexity n/a 324 334
Pfam Homeobox_KN 360 399
disorder n/a 372 381
disorder n/a 410 467
low_complexity n/a 440 463
disorder n/a 469 475
disorder n/a 477 509
disorder n/a 513 517
disorder n/a 586 587
disorder n/a 593 595
disorder n/a 598 614
disorder n/a 621 622

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession A0A0D2SY52. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MATYFHGNPE IQAADDLQTL VLMNPAYVHY SNTPPPPPPS NNLVFVNSLS
50
51
PNAPSSHSQQ LVGIPLPAVT SGSNQDAISS LHGLVQRLHY NSYNPIDPSG
100
101
EPRDTPRAQQ GLSLTLSSQH QPGNYGSQPQ AVSGGSASSG SAVTNGVSGI
150
151
QSVLLSSKYL KAAQELLDEV VNVDNTGFTK TEMAKKGSGN DSNSSKATGE
200
201
LSAAAGDGSG GENAVGKRRT ELSTAERQEI QMKKAKLISM LDEVDQRYRQ
250
251
YHHQIQIVIS TFEQTAGIGS AKTYTALALK TISKQFRCLK DAIIGQIRAV
300
301
NKSLGEEDRL GGKTEGSRLK FVDHQLRQQR ALQQLGMIQH NAWRPQRGLP
350
351
ERSVSVLRAW LFEHFLHPYP KDSDKHMLAK QTGLTRSQVS NWFINARVRL
400
401
WKPMVEEMYL EEIKEQEQNH EGSASKSTGP TPAKNQAKSL SSSKQDNSAN
450
451
QNASSMSISM ASTSPLAGNG QNQSGFSFIG SSELEGITQG SPKKPRSTEV
500
501
LLQSPIDMDI KQREAADDVS IKFGKEGYSF MGTDTNFMGG FGQYPIAEMA
550
551
RFDAEHFAPR FPGNGVSLTL GLPHCENLSL PATHQTFLPN QTLQMGRRLD
600
601
IGEPNEYGAI NPSTPHSSVA YEIENIDVQN RKRFAAQLLP DFVA      
644
 

Show the unformatted sequence.

Checksums:
CRC64:C2689871AB591FE8
MD5:dcf2df20cdd8755fdf3ac5c2d7e6c25d