Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: COPA1_HUMAN (Q9BXS0)

Summary

This is the summary of UniProt entry COPA1_HUMAN (Q9BXS0).

Description: Collagen alpha-1(XXV) chain
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 654 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 3 28
low_complexity n/a 33 49
transmembrane n/a 34 54
disorder n/a 62 66
disorder n/a 72 84
disorder n/a 88 90
disorder n/a 93 94
disorder n/a 104 106
disorder n/a 113 654
Pfam Collagen 118 165
low_complexity n/a 122 162
low_complexity n/a 187 218
low_complexity n/a 218 233
low_complexity n/a 229 245
low_complexity n/a 272 287
Pfam Collagen 310 372
low_complexity n/a 321 343
low_complexity n/a 346 358
low_complexity n/a 360 388
Pfam Collagen 371 427
Pfam Collagen 447 505
low_complexity n/a 448 466
low_complexity n/a 516 550
Pfam Collagen 574 647
low_complexity n/a 603 615

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q9BXS0. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MLLKKHAGKG GGREPRSEDP TPAEQHCART MPPCAVLAAL LSVVAVVSCL
50
51
YLGVKTNDLQ ARIAALESAK GAPSIHLLPD TLDHLKTMVQ EKVERLLAQK
100
101
SYEHMAKIRI AREAPSECNC PAGPPGKRGK RGRRGESGPP GQPGPQGPPG
150
151
PKGDKGEQGD QGPRMVFPKI NHGFLSADQQ LIKRRLIKGD QGQAGPPGPP
200
201
GPPGPRGPPG DTGKDGPRGM PGVPGEPGKP GEQGLMGPLG PPGQKGSIGA
250
251
PGIPGMNGQK GEPGLPGAVG QNGIPGPKGE PGEQGEKGDA GENGPKGDTG
300
301
EKGDPGSSAA GIKGEPGESG RPGQKGEPGL PGLPGLPGIK GEPGFIGPQG
350
351
EPGLPGLPGT KGERGEAGPP GRGERGEPGA PGPKGKQGES GTRGPKGSKG
400
401
DRGEKGDSGA QGPRGPPGQK GDQGATEIID YNGNLHEALQ RITTLTVTGP
450
451
PGPPGPQGLQ GPKGEQGSPG IPGMDGEQGL KGSKGDMGDP GMTGEKGGIG
500
501
LPGLPGANGM KGEKGDSGMP GPQGPSIIGP PGPPGPHGPP GPMGPHGLPG
550
551
PKGTDGPMGP HGPAGPKGER GEKGAMGEPG PRGPYGLPGK DGEPGLDGFP
600
601
GPRGEKGDLG EKGEKGFRGV KGEKGEPGQP GLDGLDAPCQ LGPDGLPMPG
650
651
CWQK                                                  
654
 

Show the unformatted sequence.

Checksums:
CRC64:D6DFB4FB157C05A2
MD5:411e92966dd1502fa4cbf6613ba5b7c1

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.