Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: CLGN_HUMAN (O14967)

Summary

This is the summary of UniProt entry CLGN_HUMAN (O14967).

Description: Calmegin
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 610 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 19
disorder n/a 30 39
Pfam Calreticulin 62 429
disorder n/a 205 208
disorder n/a 253 255
low_complexity n/a 255 269
disorder n/a 258 310
disorder n/a 312 346
transmembrane n/a 471 492
low_complexity n/a 494 506
disorder n/a 516 518
low_complexity n/a 518 531
disorder n/a 521 608
low_complexity n/a 548 572

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession O14967. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MHFQAFWLCL GLLFISINAE FMDDDVETED FEENSEEIDV NESELSSEIK
50
51
YKTPQPIGEV YFAETFDSGR LAGWVLSKAK KDDMDEEISI YDGRWEIEEL
100
101
KENQVPGDRG LVLKSRAKHH AISAVLAKPF IFADKPLIVQ YEVNFQDGID
150
151
CGGAYIKLLA DTDDLILENF YDKTSYIIMF GPDKCGEDYK LHFIFRHKHP
200
201
KTGVFEEKHA KPPDVDLKKF FTDRKTHLYT LVMNPDDTFE VLVDQTVVNK
250
251
GSLLEDVVPP IKPPKEIEDP NDKKPEEWDE RAKIPDPSAV KPEDWDESEP
300
301
AQIEDSSVVK PAGWLDDEPK FIPDPNAEKP DDWNEDTDGE WEAPQILNPA
350
351
CRIGCGEWKP PMIDNPKYKG VWRPPLVDNP NYQGIWSPRK IPNPDYFEDD
400
401
HPFLLTSFSA LGLELWSMTS DIYFDNFIIC SEKEVADHWA ADGWRWKIMI
450
451
ANANKPGVLK QLMAAAEGHP WLWLIYLVTA GVPIALITSF CWPRKVKKKH
500
501
KDTEYKKTDI CIPQTKGVLE QEEKEEKAAL EKPMDLEEEK KQNDGEMLEK
550
551
EEESEPEEKS EEEIEIIEGQ EESNQSNKSG SEDEMKEADE STGSGDGPIK
600
601
SVRKRRVRKD                                            
610
 

Show the unformatted sequence.

Checksums:
CRC64:F024FC4010D42D7E
MD5:656b11a3890b1f3dc94e9f8d1e58a2e8

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.