Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: CO4A1_DROME (P08120)

Summary

This is the summary of UniProt entry CO4A1_DROME (P08120).

Description: Collagen alpha-1(IV) chain {ECO:0000303|PubMed:3142875}
Source organism: Drosophila melanogaster (Fruit fly) (NCBI taxonomy ID 7227)
View Pfam proteome data.
Length: 1779 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 23
disorder n/a 36 43
disorder n/a 45 46
disorder n/a 60 61
Pfam Collagen 86 151
disorder n/a 89 1552
low_complexity n/a 91 103
Pfam Collagen 142 208
low_complexity n/a 142 163
Pfam Collagen 207 266
Pfam Collagen 293 346
Pfam Collagen 337 396
low_complexity n/a 359 369
low_complexity n/a 377 409
Pfam Collagen 434 493
low_complexity n/a 439 457
low_complexity n/a 463 484
low_complexity n/a 530 548
Pfam Collagen 575 644
low_complexity n/a 593 608
Pfam Collagen 652 710
low_complexity n/a 676 692
Pfam Collagen 690 751
Pfam Collagen 759 820
low_complexity n/a 770 785
Pfam Collagen 795 853
Pfam Collagen 851 913
low_complexity n/a 852 865
low_complexity n/a 910 937
Pfam Collagen 935 1008
low_complexity n/a 1000 1018
low_complexity n/a 1021 1036
low_complexity n/a 1047 1058
Pfam Collagen 1147 1221
low_complexity n/a 1148 1170
low_complexity n/a 1320 1344
low_complexity n/a 1455 1473
low_complexity n/a 1472 1488
Pfam Collagen 1489 1548
Pfam C4 1556 1661
disorder n/a 1564 1565
disorder n/a 1568 1569
Pfam C4 1664 1776
disorder n/a 1753 1759

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P08120. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MLPFWKRLLY AAVIAGALVG ADAQFWKTAG TAGSIQDSVK HYNRNEPKFP
50
51
IDDSYDIVDS AGVARGDLPP KNCTAGYAGC VPKCIAEKGN RGLPGPLGPT
100
101
GLKGEMGFPG MEGPSGDKGQ KGDPGPYGQR GDKGERGSPG LHGQAGVPGV
150
151
QGPAGNPGAP GINGKDGCDG QDGIPGLEGL SGMPGPRGYA GQLGSKGEKG
200
201
EPAKENGDYA KGEKGEPGWR GTAGLAGPQG FPGEKGERGD SGPYGAKGPR
250
251
GEHGLKGEKG ASCYGPMKPG APGIKGEKGE PASSFPVKPT HTVMGPRGDM
300
301
GQKGEPGLVG RKGEPGPEGD TGLDGQKGEK GLPGGPGDRG RQGNFGPPGS
350
351
TGQKGDRGEP GLNGLPGNPG QKGEPGRAGA TGKPGLLGPP GPPGGGRGTP
400
401
GPPGPKGPRG YVGAPGPQGL NGVDGLPGPQ GYNGQKGGAG LPGRPGNEGP
450
451
PGKKGEKGTA GLNGPKGSIG PIGHPGPPGP EGQKGDAGLP GYGIQGSKGD
500
501
AGIPGYPGLK GSKGERGFKG NAGAPGDSKL GRPGTPGAAG APGQKGDAGR
550
551
PGTPGQKGDM GIKGDVGGKC SSCRAGPKGD KGTSGLPGIP GKDGARGPPG
600
601
ERGYPGERGH DGINGQTGPP GEKGEDGRTG LPGATGEPGK PALCDLSLIE
650
651
PLKGDKGYPG APGAKGVQGF KGAEGLPGIP GPKGEFGFKG EKGLSGAPGN
700
701
DGTPGRAGRD GYPGIPGQSI KGEPGFHGRD GAKGDKGSFG RSGEKGEPGS
750
751
CALDEIKMPA KGNKGEPGQT GMPGPPGEDG SPGERGYTGL KGNTGPQGPP
800
801
GVEGPRGLNG PRGEKGNQGA VGVPGNPGKD GLRGIPGRNG QPGPRGEPGI
850
851
SRPGPMGPPG LNGLQGEKGD RGPTGPIGFP GADGSVGYPG DRGDAGLPGV
900
901
SGRPGIVGEK GDVGPIGPAG VAGPPGVPGI DGVRGRDGAK GEPGSPGLVG
950
951
MPGNKGDRGA PGNDGPKGFA GVTGAPGKRG PAGIPGVSGA KGDKGATGLT
1000
1001
GNDGPVGGRG PPGAPGLMGI KGDQGLAGAP GQQGLDGMPG EKGNQGFPGL
1050
1051
DGPPGLPGDA SEKGQKGEPG PSGLRGDTGP AGTPGWPGEK GLPGLAVHGR
1100
1101
AGPPGEKGDQ GRSGIDGRDG INGEKGEQGL QGVWGQPGEK GSVGAPGIPG
1150
1151
APGMDGLPGA AGAPGAVGYP GDRGDKGEPG LSGLPGLKGE TGPVGLQGFT
1200
1201
GAPGPKGERG IRGQPGLPAT VPDIRGDKGS QGERGYTGEK GEQGERGLTG
1250
1251
PAGVAGAKGD RGLQGPPGAS GLNGIPGAKG DIGPRGEIGY PGVTIKGEKG
1300
1301
LPGRPGRNGR QGLIGAPGLI GERGLPGLAG EPGLVGLPGP IGPAGSKGER
1350
1351
GLAGSPGQPG QDGFPGAPGL KGDTGPQGFK GERGLNGFEG QKGDKGDRGL
1400
1401
QGPSGLPGLV GQKGDTGYPG LNGNDGPVGA PGERGFTGPK GRDGRDGTPG
1450
1451
LPGQKGEPGM LPPPGPKGEP GQPGRNGPKG EPGRPGERGL IGIQGERGEK
1500
1501
GERGLIGETG NVGRPGPKGD RGEPGERGYE GAIGLIGQKG EPGAPAPAAL
1550
1551
DYLTGILITR HSQSETVPAC SAGHTELWTG YSLLYVDGND YAHNQDLGSP
1600
1601
GSCVPRFSTL PVLSCGQNNV CNYASRNDKT FWLTTNAAIP MMPVENIEIR
1650
1651
QYISRCVVCE APANVIAVHS QTIEVPDCPN GWEGLWIGYS FLMHTAVGNG
1700
1701
GGGQALQSPG SCLEDFRATP FIECNGAKGT CHFYETMTSF WMYNLESSQP
1750
1751
FERPQQQTIK AGERQSHVSR CQVCMKNSS                       
1779
 

Show the unformatted sequence.

Checksums:
CRC64:6770F18AE40A313E
MD5:90fd819d7eb397ce044fb52b4f3eca12