Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: CO6A2_HUMAN (P12110)

Summary

This is the summary of UniProt entry CO6A2_HUMAN (P12110).

Description: Collagen alpha-2(VI) chain
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 1019 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 20
disorder n/a 25 37
Pfam VWA 46 232
disorder n/a 171 173
disorder n/a 188 189
disorder n/a 198 204
disorder n/a 207 208
disorder n/a 212 219
disorder n/a 222 224
disorder n/a 228 231
Pfam Collagen 254 317
disorder n/a 257 593
low_complexity n/a 285 300
low_complexity n/a 291 312
Pfam Collagen 301 369
low_complexity n/a 362 387
low_complexity n/a 399 429
Pfam Collagen 409 468
low_complexity n/a 426 451
low_complexity n/a 486 501
low_complexity n/a 517 539
Pfam Collagen 531 590
low_complexity n/a 542 555
low_complexity n/a 550 587
Pfam VWA 615 798
disorder n/a 658 661
Pfam VWA 833 1011
low_complexity n/a 859 870
disorder n/a 957 962

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P12110. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MLQGTCSVLL LWGILGAIQA QQQEVISPDT TERNNNCPEK TDCPIHVYFV
50
51
LDTSESVTMQ SPTDILLFHM KQFVPQFISQ LQNEFYLDQV ALSWRYGGLH
100
101
FSDQVEVFSP PGSDRASFIK NLQGISSFRR GTFTDCALAN MTEQIRQDRS
150
151
KGTVHFAVVI TDGHVTGSPC GGIKLQAERA REEGIRLFAV APNQNLKEQG
200
201
LRDIASTPHE LYRNDYATML PDSTEIDQDT INRIIKVMKH EAYGECYKVS
250
251
CLEIPGPSGP KGYRGQKGAK GNMGEPGEPG QKGRQGDPGI EGPIGFPGPK
300
301
GVPGFKGEKG EFGADGRKGA PGLAGKNGTD GQKGKLGRIG PPGCKGDPGN
350
351
RGPDGYPGEA GSPGERGDQG GKGDPGRPGR RGPPGEIGAK GSKGYQGNSG
400
401
APGSPGVKGA KGGPGPRGPK GEPGRRGDPG TKGSPGSDGP KGEKGDPGPE
450
451
GPRGLAGEVG NKGAKGDRGL PGPRGPQGAL GEPGKQGSRG DPGDAGPRGD
500
501
SGQPGPKGDP GRPGFSYPGP RGAPGEKGEP GPRGPEGGRG DFGLKGEPGR
550
551
KGEKGEPADP GPPGEPGPRG PRGVPGPEGE PGPPGDPGLT ECDVMTYVRE
600
601
TCGCCDCEKR CGALDVVFVI DSSESIGYTN FTLEKNFVIN VVNRLGAIAK
650
651
DPKSETGTRV GVVQYSHEGT FEAIQLDDER IDSLSSFKEA VKNLEWIAGG
700
701
TWTPSALKFA YDRLIKESRR QKTRVFAVVI TDGRHDPRDD DLNLRALCDR
750
751
DVTVTAIGIG DMFHEKHESE NLYSIACDKP QQVRNMTLFS DLVAEKFIDD
800
801
MEDVLCPDPQ IVCPDLPCQT ELSVAQCTQR PVDIVFLLDG SERLGEQNFH
850
851
KARRFVEQVA RRLTLARRDD DPLNARVALL QFGGPGEQQV AFPLSHNLTA
900
901
IHEALETTQY LNSFSHVGAG VVHAINAIVR SPRGGARRHA ELSFVFLTDG
950
951
VTGNDSLHES AHSMRKQNVV PTVLALGSDV DMDVLTTLSL GDRAAVFHEK
1000
1001
DYDSLAQPGF FDRFIRWIC                                  
1019
 

Show the unformatted sequence.

Checksums:
CRC64:6C513ADE46C1D111
MD5:b6c5aff01cdae7408a87a17590773e35

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.