Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: CO5A2_HUMAN (P05997)

Summary

This is the summary of UniProt entry CO5A2_HUMAN (P05997).

Description: Collagen alpha-2(V) chain
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 1499 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 26
low_complexity n/a 28 38
disorder n/a 35 37
Pfam VWC 41 96
disorder n/a 93 95
disorder n/a 97 98
disorder n/a 103 1300
Pfam Collagen 125 187
low_complexity n/a 126 144
low_complexity n/a 141 159
low_complexity n/a 162 186
Pfam Collagen 209 272
low_complexity n/a 212 252
low_complexity n/a 255 270
low_complexity n/a 321 349
low_complexity n/a 360 378
low_complexity n/a 408 420
low_complexity n/a 426 462
low_complexity n/a 473 489
low_complexity n/a 483 510
low_complexity n/a 503 525
low_complexity n/a 597 613
low_complexity n/a 615 630
low_complexity n/a 650 660
low_complexity n/a 666 709
low_complexity n/a 705 723
low_complexity n/a 735 759
low_complexity n/a 795 822
low_complexity n/a 825 841
Pfam Collagen 834 904
low_complexity n/a 911 945
low_complexity n/a 954 993
low_complexity n/a 1026 1056
low_complexity n/a 1083 1098
low_complexity n/a 1100 1119
Pfam Collagen 1110 1180
low_complexity n/a 1127 1146
low_complexity n/a 1149 1164
low_complexity n/a 1167 1185
Pfam Collagen 1170 1232
low_complexity n/a 1188 1200
low_complexity n/a 1206 1227
Pfam COLFI 1264 1498
disorder n/a 1310 1315
disorder n/a 1347 1350

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P05997. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MMANWAEARP LLILIVLLGQ FVSIKAQEED EDEGYGEEIA CTQNGQMYLN
50
51
RDIWKPAPCQ ICVCDNGAIL CDKIECQDVL DCADPVTPPG ECCPVCSQTP
100
101
GGGNTNFGRG RKGQKGEPGL VPVVTGIRGR PGPAGPPGSQ GPRGERGPKG
150
151
RPGPRGPQGI DGEPGVPGQP GAPGPPGHPS HPGPDGLSRP FSAQMAGLDE
200
201
KSGLGSQVGL MPGSVGPVGP RGPQGLQGQQ GGAGPTGPPG EPGDPGPMGP
250
251
IGSRGPEGPP GKPGEDGEPG RNGNPGEVGF AGSPGARGFP GAPGLPGLKG
300
301
HRGHKGLEGP KGEVGAPGSK GEAGPTGPMG AMGPLGPRGM PGERGRLGPQ
350
351
GAPGQRGAHG MPGKPGPMGP LGIPGSSGFP GNPGMKGEAG PTGARGPEGP
400
401
QGQRGETGPP GPVGSPGLPG AIGTDGTPGA KGPTGSPGTS GPPGSAGPPG
450
451
SPGPQGSTGP QGIRGQPGDP GVPGFKGEAG PKGEPGPHGI QGPIGPPGEE
500
501
GKRGPRGDPG TVGPPGPVGE RGAPGNRGFP GSDGLPGPKG AQGERGPVGS
550
551
SGPKGSQGDP GRPGEPGLPG ARGLTGNPGV QGPEGKLGPL GAPGEDGRPG
600
601
PPGSIGIRGQ PGSMGLPGPK GSSGDPGKPG EAGNAGVPGQ RGAPGKDGEV
650
651
GPSGPVGPPG LAGERGEQGP PGPTGFQGLP GPPGPPGEGG KPGDQGVPGD
700
701
PGAVGPLGPR GERGNPGERG EPGITGLPGE KGMAGGHGPD GPKGSPGPSG
750
751
TPGDTGPPGL QGMPGERGIA GTPGPKGDRG GIGEKGAEGT AGNDGARGLP
800
801
GPLGPPGPAG PTGEKGEPGP RGLVGPPGSR GNPGSRGENG PTGAVGFAGP
850
851
QGPDGQPGVK GEPGEPGQKG DAGSPGPQGL AGSPGPHGPN GVPGLKGGRG
900
901
TQGPPGATGF PGSAGRVGPP GPAGAPGPAG PLGEPGKEGP PGLRGDPGSH
950
951
GRVGDRGPAG PPGGPGDKGD PGEDGQPGPD GPPGPAGTTG QRGIVGMPGQ
1000
1001
RGERGMPGLP GPAGTPGKVG PTGATGDKGP PGPVGPPGSN GPVGEPGPEG
1050
1051
PAGNDGTPGR DGAVGERGDR GDPGPAGLPG SQGAPGTPGP VGAPGDAGQR
1100
1101
GDPGSRGPIG PPGRAGKRGL PGPQGPRGDK GDHGDRGDRG QKGHRGFTGL
1150
1151
QGLPGPPGPN GEQGSAGIPG PFGPRGPPGP VGPSGKEGNP GPLGPIGPPG
1200
1201
VRGSVGEAGP EGPPGEPGPP GPPGPPGHLT AALGDIMGHY DESMPDPLPE
1250
1251
FTEDQAAPDD KNKTDPGVHA TLKSLSSQIE TMRSPDGSKK HPARTCDDLK
1300
1301
LCHSAKQSGE YWIDPNQGSV EDAIKVYCNM ETGETCISAN PSSVPRKTWW
1350
1351
ASKSPDNKPV WYGLDMNRGS QFAYGDHQSP NTAITQMTFL RLLSKEASQN
1400
1401
ITYICKNSVG YMDDQAKNLK KAVVLKGAND LDIKAEGNIR FRYIVLQDTC
1450
1451
SKRNGNVGKT VFEYRTQNVA RLPIIDLAPV DVGGTDQEFG VEIGPVCFV 
1499
 

Show the unformatted sequence.

Checksums:
CRC64:E8C92BF5E749EC97
MD5:6de292cb70c9a38b804570dc7c658e6f

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.