Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
1  structure 1  species 0  interactions 1  sequence 1  architecture

Protein: CO4A3_HUMAN (Q01955)

Summary

This is the summary of UniProt entry CO4A3_HUMAN (Q01955).

Description: Collagen alpha-3(IV) chain
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 1670 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 5
sig_p n/a 1 28
low_complexity n/a 10 26
Pfam Collagen 41 104
low_complexity n/a 42 63
disorder n/a 45 1448
low_complexity n/a 93 118
Pfam Collagen 97 162
Pfam Collagen 169 223
low_complexity n/a 170 213
low_complexity n/a 232 250
low_complexity n/a 267 285
Pfam Collagen 282 344
Pfam Collagen 351 412
low_complexity n/a 358 394
Pfam Collagen 386 444
Pfam Collagen 413 478
low_complexity n/a 414 438
Pfam Collagen 482 546
low_complexity n/a 482 495
Pfam Collagen 588 648
low_complexity n/a 589 624
low_complexity n/a 632 646
low_complexity n/a 652 676
low_complexity n/a 678 714
Pfam Collagen 699 749
low_complexity n/a 722 744
Pfam Collagen 747 809
low_complexity n/a 748 761
low_complexity n/a 764 786
low_complexity n/a 787 807
Pfam Collagen 788 849
Pfam Collagen 847 905
Pfam Collagen 892 948
low_complexity n/a 894 912
Pfam Collagen 950 1009
low_complexity n/a 972 999
Pfam Collagen 997 1060
low_complexity n/a 1011 1027
low_complexity n/a 1060 1077
Pfam Collagen 1061 1122
low_complexity n/a 1087 1118
low_complexity n/a 1118 1151
Pfam Collagen 1119 1178
low_complexity n/a 1172 1186
Pfam Collagen 1176 1235
low_complexity n/a 1218 1235
low_complexity n/a 1238 1260
low_complexity n/a 1288 1309
Pfam Collagen 1292 1352
low_complexity n/a 1323 1342
low_complexity n/a 1356 1390
Pfam Collagen 1379 1441
low_complexity n/a 1384 1405
Pfam C4 1446 1553
disorder n/a 1452 1455
Pfam C4 1556 1667

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q01955. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MSARTAPRPQ VLLLPLLLVL LAAAPAASKG CVCKDKGQCF CDGAKGEKGE
50
51
KGFPGPPGSP GQKGFTGPEG LPGPQGPKGF PGLPGLTGSK GVRGISGLPG
100
101
FSGSPGLPGT PGNTGPYGLV GVPGCSGSKG EQGFPGLPGT LGYPGIPGAA
150
151
GLKGQKGAPA KEEDIELDAK GDPGLPGAPG PQGLPGPPGF PGPVGPPGPP
200
201
GFFGFPGAMG PRGPKGHMGE RVIGHKGERG VKGLTGPPGP PGTVIVTLTG
250
251
PDNRTDLKGE KGDKGAMGEP GPPGPSGLPG ESYGSEKGAP GDPGLQGKPG
300
301
KDGVPGFPGS EGVKGNRGFP GLMGEDGIKG QKGDIGPPGF RGPTEYYDTY
350
351
QEKGDEGTPG PPGPRGARGP QGPSGPPGVP GSPGSSRPGL RGAPGWPGLK
400
401
GSKGERGRPG KDAMGTPGSP GCAGSPGLPG SPGPPGPPGD IVFRKGPPGD
450
451
HGLPGYLGSP GIPGVDGPKG EPGLLCTQCP YIPGPPGLPG LPGLHGVKGI
500
501
PGRQGAAGLK GSPGSPGNTG LPGFPGFPGA QGDPGLKGEK GETLQPEGQV
550
551
GVPGDPGLRG QPGRKGLDGI PGTPGVKGLP GPKGELALSG EKGDQGPPGD
600
601
PGSPGSPGPA GPAGPPGYGP QGEPGLQGTQ GVPGAPGPPG EAGPRGELSV
650
651
STPVPGPPGP PGPPGHPGPQ GPPGIPGSLG KCGDPGLPGP DGEPGIPGIG
700
701
FPGPPGPKGD QGFPGTKGSL GCPGKMGEPG LPGKPGLPGA KGEPAVAMPG
750
751
GPGTPGFPGE RGNSGEHGEI GLPGLPGLPG TPGNEGLDGP RGDPGQPGPP
800
801
GEQGPPGRCI EGPRGAQGLP GLNGLKGQQG RRGKTGPKGD PGIPGLDRSG
850
851
FPGETGSPGI PGHQGEMGPL GQRGYPGNPG ILGPPGEDGV IGMMGFPGAI
900
901
GPPGPPGNPG TPGQRGSPGI PGVKGQRGTP GAKGEQGDKG NPGPSEISHV
950
951
IGDKGEPGLK GFAGNPGEKG NRGVPGMPGL KGLKGLPGPA GPPGPRGDLG
1000
1001
STGNPGEPGL RGIPGSMGNM GMPGSKGKRG TLGFPGRAGR PGLPGIHGLQ
1050
1051
GDKGEPGYSE GTRPGPPGPT GDPGLPGDMG KKGEMGQPGP PGHLGPAGPE
1100
1101
GAPGSPGSPG LPGKPGPHGD LGFKGIKGLL GPPGIRGPPG LPGFPGSPGP
1150
1151
MGIRGDQGRD GIPGPAGEKG ETGLLRAPPG PRGNPGAQGA KGDRGAPGFP
1200
1201
GLPGRKGAMG DAGPRGPTGI EGFPGPPGLP GAIIPGQTGN RGPPGSRGSP
1250
1251
GAPGPPGPPG SHVIGIKGDK GSMGHPGPKG PPGTAGDMGP PGRLGAPGTP
1300
1301
GLPGPRGDPG FQGFPGVKGE KGNPGFLGSI GPPGPIGPKG PPGVRGDPGT
1350
1351
LKIISLPGSP GPPGTPGEPG MQGEPGPPGP PGNLGPCGPR GKPGKDGKPG
1400
1401
TPGPAGEKGN KGSKGEPGPA GSDGLPGLKG KRGDSGSPAT WTTRGFVFTR
1450
1451
HSQTTAIPSC PEGTVPLYSG FSFLFVQGNQ RAHGQDLGTL GSCLQRFTTM
1500
1501
PFLFCNVNDV CNFASRNDYS YWLSTPALMP MNMAPITGRA LEPYISRCTV
1550
1551
CEGPAIAIAV HSQTTDIPPC PHGWISLWKG FSFIMFTSAG SEGTGQALAS
1600
1601
PGSCLEEFRA SPFLECHGRG TCNYYSNSYS FWLASLNPER MFRKPIPSTV
1650
1651
KAGELEKIIS RCQVCMKKRH                                 
1670
 

Show the unformatted sequence.

Checksums:
CRC64:AA65D50903D82B99
MD5:d51e1bfcf568b7524962d970452650c0

Structures

For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe SIFTS project, to allow us to map Pfam domains onto UniProt three-dimensional structures. The table below shows the mapping between Pfam domains, this UniProt entry and a corresponding three dimensional structure.

Pfam family UniProt residues PDB ID PDB chain ID PDB residues View
C4 1446 - 1553 5NB0 A 6 - 113 Jmol OpenAstexViewer
B 6 - 113 Jmol OpenAstexViewer
C 6 - 113 Jmol OpenAstexViewer
D 6 - 113 Jmol OpenAstexViewer
E 6 - 113 Jmol OpenAstexViewer
F 6 - 113 Jmol OpenAstexViewer
G 6 - 113 Jmol OpenAstexViewer
H 6 - 113 Jmol OpenAstexViewer
1556 - 1667 5NB0 A 116 - 227 Jmol OpenAstexViewer
B 116 - 227 Jmol OpenAstexViewer
C 116 - 227 Jmol OpenAstexViewer
D 116 - 227 Jmol OpenAstexViewer
E 116 - 227 Jmol OpenAstexViewer
F 116 - 227 Jmol OpenAstexViewer
G 116 - 227 Jmol OpenAstexViewer
H 116 - 227 Jmol OpenAstexViewer

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.