Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: COMA1_HUMAN (Q8NFW1)

Summary

This is the summary of UniProt entry COMA1_HUMAN (Q8NFW1).

Description: Collagen alpha-1(XXII) chain
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
View Pfam proteome data.
Length: 1626 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 27
low_complexity n/a 10 23
low_complexity n/a 20 31
Pfam VWA 38 212
low_complexity n/a 160 173
low_complexity n/a 441 462
disorder n/a 443 468
disorder n/a 482 1013
low_complexity n/a 498 513
Pfam Collagen 522 600
low_complexity n/a 550 563
Pfam Collagen 562 630
low_complexity n/a 568 586
low_complexity n/a 589 604
low_complexity n/a 636 649
low_complexity n/a 659 684
Pfam Collagen 695 773
low_complexity n/a 701 734
low_complexity n/a 733 773
Pfam Collagen 744 813
Pfam Collagen 798 856
low_complexity n/a 821 833
low_complexity n/a 836 851
Pfam Collagen 861 926
low_complexity n/a 906 939
low_complexity n/a 945 969
low_complexity n/a 966 990
low_complexity n/a 986 1001
disorder n/a 1015 1626
low_complexity n/a 1039 1065
Pfam Collagen 1043 1100
Pfam Collagen 1116 1174
low_complexity n/a 1117 1128
low_complexity n/a 1136 1167
Pfam Collagen 1156 1226
low_complexity n/a 1185 1221
low_complexity n/a 1218 1239
low_complexity n/a 1238 1275
Pfam Collagen 1249 1317
Pfam Collagen 1297 1364
low_complexity n/a 1302 1329
low_complexity n/a 1320 1344
low_complexity n/a 1355 1395
low_complexity n/a 1395 1419
Pfam Collagen 1401 1461
low_complexity n/a 1431 1452
Pfam Collagen 1494 1553
low_complexity n/a 1496 1511
low_complexity n/a 1558 1570
low_complexity n/a 1587 1602

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q8NFW1. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MAGLRGNAVA GLLWMLLLWS GGGGCQAQRA GCKSVHYDLV FLLDTSSSVG
50
51
KEDFEKVRQW VANLVDTFEV GPDRTRVGVV RYSDRPTTAF ELGLFGSQEE
100
101
VKAAARRLAY HGGNTNTGDA LRYITARSFS PHAGGRPRDR AYKQVAILLT
150
151
DGRSQDLVLD AAAAAHRAGI RIFAVGVGEA LKEELEEIAS EPKSAHVFHV
200
201
SDFNAIDKIR GKLRRRLCEN VLCPSVRVEG DRFKHTNGGT KEITGFDLMD
250
251
LFSVKEILGK RENGAQSSYV RMGSFPVVQS TEDVFPQGLP DEYAFVTTFR
300
301
FRKTSRKEDW YIWQVIDQYS IPQVSIRLDG ENKAVEYNAV GAMKDAVRVV
350
351
FRGSRVNDLF DRDWHKMALS IQAQNVSLHI DCALVQTLPI EERENIDIQG
400
401
KTVIGKRLYD SVPIDFDLQR IVIYCDSRHA ELETCCDIPS GPCQVTVVTE
450
451
PPPPPPPQRP PTPGSEQIGF LKTINCSCPA GEKGEMGVAG PMGLPGPKGD
500
501
IGAIGPVGAP GPKGEKGDVG IGPFGQGEKG EKGSLGLPGP PGRDGSKGMR
550
551
GEPGELGEPG LPGEVGMRGP QGPPGLPGPP GRVGAPGLQG ERGEKGTRGE
600
601
KGERGLDGFP GKPGDTGQQG RPGPSGVAGP QGEKGDVGPA GPPGVPGSVV
650
651
QQEGLKGEQG APGPRGHQGA PGPPGARGPI GPEGRDGPPG LQGLRGKKGD
700
701
MGPPGIPGLL GLQGPPGPPG VPGPPGPGGS PGLPGEIGFP GKPGPPGPTG
750
751
PPGKDGPNGP PGPPGTKGEP GERGEDGLPG KPGLRGEIGE QGLAGRPGEK
800
801
GEAGLPGAPG FPGVRGEKGD QGEKGELGLP GLKGDRGEKG EAGPAGPPGL
850
851
PGTTSLFTPH PRMPGEQGPK GEKGDPGLPG EPGLQGRPGE LGPQGPTGPP
900
901
GAKGQEGAHG APGAAGNPGA PGHVGAPGPS GPPGSVGAPG LRGTPGKDGE
950
951
RGEKGAAGEE GSPGPVGPRG DPGAPGLPGP PGKGKDGEPG LRGSPGLPGP
1000
1001
LGTKAACGKV RGSENCALGG QCVKGDRGAP GIPGSPGSRG DPGIGVAGPP
1050
1051
GPSGPPGDKG SPGSRGLPGF PGPQGPAGRD GAPGNPGERG PPGKPGLSSL
1100
1101
LSPGDINLLA KDVCNDCPPG PPGLPGLPGF KGDKGVPGKP GREGTEGKKG
1150
1151
EAGPPGLPGP PGIAGPQGSQ GERGADGEVG QKGDQGHPGV PGFMGPPGNP
1200
1201
GPPGADGIAG AAGPPGIQGS PGKEGPPGPQ GPSGLPGIPG EEGKEGRDGK
1250
1251
PGPPGEPGKA GEPGLPGPEG ARGPPGFKGH TGDSGAPGPR GESGAMGLPG
1300
1301
QEGLPGKDGD TGPTGPQGPQ GPRGPPGKNG SPGSPGEPGP SGTPGQKGSK
1350
1351
GENGSPGLPG FLGPRGPPGE PGEKGVPGKE GVPGKPGEPG FKGERGDPGI
1400
1401
KGDKGPPGGK GQPGDPGIPG HKGHTGLMGP QGLPGENGPV GPPGPPGQPG
1450
1451
FPGLRGESPS METLRRLIQE ELGKQLETRL AYLLAQMPPA YMKSSQGRPG
1500
1501
PPGPPGKDGL PGRAGPMGEP GRPGQGGLEG PSGPIGPKGE RGAKGDPGAP
1550
1551
GVGLRGEMGP PGIPGQPGEP GYAKDGLPGI PGPQGETGPA GHPGLPGPPG
1600
1601
PPGQCDPSQC AYFASLAARP GNVKGP                          
1626
 

Show the unformatted sequence.

Checksums:
CRC64:91018ABA2DD670EC
MD5:766d0912684514473948442014727b09

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.