Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: COOA1_MOUSE (Q30D77)

Summary

This is the summary of UniProt entry COOA1_MOUSE (Q30D77).

Description: Collagen alpha-1(XXIV) chain
Source organism: Mus musculus (Mouse) (NCBI taxonomy ID 10090)
Length: 1733 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
transmembrane n/a 21 39
low_complexity n/a 38 52
low_complexity n/a 58 72
disorder n/a 247 342
disorder n/a 350 394
disorder n/a 402 424
disorder n/a 427 429
disorder n/a 463 480
disorder n/a 484 485
disorder n/a 503 1510
Pfam Collagen 506 565
low_complexity n/a 508 529
low_complexity n/a 535 563
Pfam Collagen 561 623
low_complexity n/a 571 586
low_complexity n/a 586 601
Pfam Collagen 604 678
low_complexity n/a 654 673
low_complexity n/a 682 707
low_complexity n/a 699 724
low_complexity n/a 769 799
Pfam Collagen 772 837
low_complexity n/a 808 826
Pfam Collagen 865 938
low_complexity n/a 907 937
low_complexity n/a 936 952
Pfam Collagen 967 1042
low_complexity n/a 1056 1075
Pfam Collagen 1107 1180
Pfam Collagen 1159 1218
low_complexity n/a 1174 1189
low_complexity n/a 1192 1210
low_complexity n/a 1201 1214
Pfam Collagen 1218 1279
low_complexity n/a 1220 1239
low_complexity n/a 1232 1250
low_complexity n/a 1256 1283
Pfam Collagen 1270 1334
low_complexity n/a 1295 1310
low_complexity n/a 1322 1340
low_complexity n/a 1348 1373
low_complexity n/a 1373 1391
Pfam Collagen 1378 1443
low_complexity n/a 1433 1445
Pfam Collagen 1439 1500
low_complexity n/a 1454 1470
low_complexity n/a 1478 1500
disorder n/a 1514 1525
Pfam COLFI 1532 1616
low_complexity n/a 1539 1549
Pfam COLFI 1611 1732
disorder n/a 1709 1712

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q30D77. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MHLGAYRTRH GKVSPTTETK LFLRFIVLCV VWISVHAQGQ GIDILQQLGL
50
51
GGRDVRYTSS VTAVPSSSWS TPLPQGVHLT DFGVILTDNA YIESPLVNIL
100
101
PISLRQPLTV LIGLQSFKVN NAFLFSIRNN NRLQFGVQLL PKKLIVHVGG
150
151
KQTVTFNYSA HDERWHSFAI TVDHHVISMF VECGKRHFSG ETTSDVQTFD
200
201
PHSVFTLGSI NNSSAHFEGT VCQLEIMPST AASAEYCRHL KQQCLRADAS
250
251
QAQRNLPHTA GMPTRHPAHT PLPRGFPGTD SPQKRFTEQD SLPKGFDGTE
300
301
LPRETFADGK SIPNNRSNGS ATVHESQEHQ TPRAQLTSFH SGNISAVTLP
350
351
NYRIQAKEIT TKEETNLTLS VAHHLPSEAR MNEEGRINPL FAGFDNITQH
400
401
EEAAGLPLPK KASSGFAHTN QDTMKNLEKA LTANLYTNEL IEMERILNST
450
451
LYRVMYGPSV DNHLELRKEG EFYPDATNPI EGSYEPQAYD YYSYEDYNAV
500
501
LDMEYLRGPK GDPGPPGPPG PMGIPGPSGK RGPRGIPGPH GNPGLPGLPG
550
551
PKGPKGDPGL SPGQAASGEK GDPGLLGLVG PPGLQGAKGL KGHPGLPGLR
600
601
GEHGLPGLAG NIGSPGYPGR QGLAGPEGNP GSKGVRGFIG SPGEVGQLGP
650
651
EGERGTPGVR GKKGPKGRQG FPGDFGDRGP AGLDGSPGLV GGTGPPGFPG
700
701
VTGSVGPAGP TGPPGAPGPM GLSGSRGPSG IKGDKGEQGV AGEPGEPGYP
750
751
GDKGNIGSPG PPGIRGKSGP SGQPGDPGPQ GPSGPPGPEG FPGDIGIPGQ
800
801
NGPEGPKGHL GNRGPPGPPG LKGTQGEEGP IGPFGELGSR GKPGRKGYMG
850
851
EPGPEGLKGE VGDQGDIGKT GETGPVGLPG EVGITGSIGE KGERGSPGPL
900
901
GPQGEKGVMG YPGPPGAPGP MGPLGLPGLV GARGAPGSPG PKGQRGPRGP
950
951
DGLAGDQGGH GAKGEKGNQG KRGLPGLPGK AGSPGERGVQ GKPGFQGLPG
1000
1001
SSGDVGPAGE PGPRGLPGIA GLPGEMGVEG PPGTEGDSGL QGEPGAKGDG
1050
1051
GPAGSAGATG EPGPRGEPGA PGEEGLQGKD GLKGAPGGSG LPGEDGDKGE
1100
1101
MGLPGTAGPV GRPGQMGLPG PEGIVGTPGQ RGRLGKKGDK GQVGPTGEAG
1150
1151
SRGPPGSVGE NGPKGARGTR GAVGPLGLMG PEGEPGIPGY RGHQGQPGPS
1200
1201
GLPGPKGEKG YPGEDSTVLG PPGPPGEPGP MGEQGETGEH GEEGYKGHMG
1250
1251
VPGLRGATGQ QGPPGEPGDQ GGQGPKGERG SEGPQGKRGV PGPSGKPGIP
1300
1301
GVPGFPGPKG LQGYPGVDGM SGYPGKPGLP GKQGLLGVPG SPGRTGVAGS
1350
1351
PGPQGGKGAS GPPGSPGAPG PKGEQGLPGQ PGVPGQRGHR GTPGDQGLRG
1400
1401
APGLKGQPGE HGDQGLAGFQ GFPGPRGPEG DAGIVGIVGP KGPIGQRGNT
1450
1451
GPLGREGIIG PTGGTGPRGE KGFRGETGPQ GPRGQPGPPG PPGAPGPRRQ
1500
1501
MDINAAIRAL IESNSAQQME SYQNTEGTLI SHSSDIFKTL TYLSSLLSSI
1550
1551
KNPLGTRENP ARICKDLLSC QYKVSDGKYW IDPNLGCSSD AFEVFCNFSA
1600
1601
GGQTCLSPVS VTKLEFGVSK VQMNFLHLLS SEATHTITIH CLNTPRWSST
1650
1651
WADGPELPIS FKGWNGQIFE ENTLLEPQVL SDDCKIQDGS WHKAKFLFHT
1700
1701
QNPNQLPVTE VQNLPHLGTE QKRYIESNSV CFL                  
1733
 

Show the unformatted sequence.

Checksums:
CRC64:4DAA1E630EB923CE
MD5:01c8a16a253d42ae58db52e1618fec99

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.

AlphaFold Structure Prediction

The protein structure below has been predicted by DeepMind with AlphaFold. For more information, please visit the AlphaFold page for this protein.

Model confidence scale

  Very High (pLDDT > 90)
  Confident (90 > pLDDT > 70)
  Low (70 > pLDDT > 50)
  Very Low (pLDDT < 50)
Highly accurate protein structure prediction with AlphaFold. John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli & Demis Hassabis Nature 2021-07-15; DOI: 10.1038/s41586-021-03819-2;