Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: CO5A1_MOUSE (O88207)

Summary

This is the summary of UniProt entry CO5A1_MOUSE (O88207).

Description: Collagen alpha-1(V) chain
Source organism: Mus musculus (Mouse) (NCBI taxonomy ID 10090)
Length: 1838 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 36
low_complexity n/a 9 34
disorder n/a 59 62
Pfam Laminin_G_2 110 228
disorder n/a 151 153
disorder n/a 242 1606
low_complexity n/a 259 288
low_complexity n/a 300 314
low_complexity n/a 335 352
low_complexity n/a 374 387
low_complexity n/a 412 428
low_complexity n/a 460 496
Pfam Collagen 467 519
low_complexity n/a 493 517
low_complexity n/a 511 523
low_complexity n/a 545 558
Pfam Collagen 557 619
low_complexity n/a 562 586
low_complexity n/a 589 602
low_complexity n/a 595 613
low_complexity n/a 607 619
low_complexity n/a 643 673
low_complexity n/a 667 698
low_complexity n/a 712 742
low_complexity n/a 739 757
low_complexity n/a 760 793
low_complexity n/a 826 862
low_complexity n/a 895 925
low_complexity n/a 949 979
low_complexity n/a 984 1015
low_complexity n/a 1009 1033
low_complexity n/a 1075 1117
low_complexity n/a 1134 1165
low_complexity n/a 1174 1193
low_complexity n/a 1215 1243
low_complexity n/a 1249 1282
low_complexity n/a 1285 1316
low_complexity n/a 1309 1340
low_complexity n/a 1336 1372
low_complexity n/a 1369 1406
low_complexity n/a 1405 1421
low_complexity n/a 1450 1480
Pfam Collagen 1460 1529
low_complexity n/a 1485 1495
Pfam Collagen 1513 1575
low_complexity n/a 1521 1555
low_complexity n/a 1555 1570
Pfam COLFI 1607 1836
disorder n/a 1615 1617
disorder n/a 1622 1637
disorder n/a 1640 1642

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession O88207. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MDVHTRWKAA RPGALLLSSP LLLFLLLLWA PPSSRAAQPA DLLEMLDFHN
50
51
LPSGVTKTTG FCATRRSSSE PDVAYRVSKD AQLSMPTKQL YPESGFPEDF
100
101
SILTTVKAKK GSQAFLVSIY NEQGIQQLGL ELGRSPVFLY EDHTGKPGPE
150
151
EYPLFPGINL SDGKWHRIAL SVYKKNVTLI LDCKKKITKF LSRSDHPIID
200
201
TNGIVMFGSR ILDDEIFEGD IQQLLFVSDN RAAYDYCEHY SPDCDTAVPD
250
251
TPQSQDPNPD EYYPEGEGET YYYEYPYYED PEDPGKEPAP TQKPVEAARE
300
301
TTEVPEEQTQ PLPEAPTVPE TSDTADKEDS LGIGDYDYVP PDDYYTPPPY
350
351
EDFGYGEGVE NPDQPTNPDS GAEVPTSTTV TSNTSNPAPG EGKDDLGGEF
400
401
TEETIKNLEE NYYDPYFDPD SDSSVSPSEI GPGMPANQDT IFEGIGGPRG
450
451
EKGQKGEPAI IEPGMLIEGP PGPEGPAGLP GPPGTTGPTG QMGDPGERGP
500
501
PGRPGLPGAD GLPGPPGTML MLPFRFGGGG DAGSKGPMVS AQESQAQAIL
550
551
QQARLALRGP AGPMGLTGRP GPMGPPGSGG LKGEPGDMGP QGPRGVQGPP
600
601
GPTGKPGRRG RAGSDGARGM PGQTGPKGDR GFDGLAGLPG EKGHRGDPGP
650
651
SGPPGIPGDD GERGDDGEVG PRGLPGEPGP RGLLGPKGPP GPPGPPGVTG
700
701
MDGQPGPKGN VGPQGEPGPP GQQGNPGAQG LPGPQGAIGP PGEKGPLGKP
750
751
GLPGMPGADG PPGHPGKEGP PGEKGGQGPP GPQGPIGYPG PRGVKGADGI
800
801
RGLKGTKGEK GEDGFPGFKG DMGIKGDRGE IGPPGPRGED GPEGPKGRGG
850
851
PNGDPGPLGP TGEKGKLGVP GLPGYPGRQG PKGSIGFPGF PGANGEKGGR
900
901
GTPGKPGPRG QRGPTGPRGE RGPRGITGKP GPKGNSGGDG PAGPPGERGP
950
951
NGPQGPTGFP GPKGPPGPPG KDGLPGHPGQ RGETGFQGKT GPPGPPGVVG
1000
1001
PQGPTGETGP MGERGHPGPP GPPGEQGLPG AAGKEGTKGD PGPAGLPGKD
1050
1051
GPPGLRGFPG DRGLPGPVGA LGLKGSEGPP GPPGPAGSPG ERGPAGAAGP
1100
1101
IGIPGRPGPQ GPPGPAGEKG LPGEKGPQGP AGRDGLQGPV GLPGPAGPVG
1150
1151
PPGEDGDKGE IGEPGQKGSK GDKGEQGPPG PTGPQGPIGQ PGPSGADGEP
1200
1201
GPRGQQGLFG QKGDEGSRGF PGPPGPVGLQ GLPGPPGEKG ETGDVGQMGP
1250
1251
PGPPGPRGPS GAPGADGPQG PPGGIGNPGA VGEKGEPGEA GDPGLPGEGG
1300
1301
PLGPKGERGE KGEAGPSGAA GPPGPKGPPG DDGPKGSPGP VGFPGDPGPP
1350
1351
GEPGPAGQDG PPGDKGDDGE PGQTGSPGPT GEPGPSGPPG KRGPPGPAGP
1400
1401
EGRQGEKGAK GEAGLEGPPG KTGPIGPQGA PGKPGPDGLR GIPGPVGEQG
1450
1451
LPGSPGPDGP PGPMGPPGLP GLKGDSGPKG EKGHPGLIGL IGPPGEQGEK
1500
1501
GDRGLPGPQG SSGPKGDQGI TGPSGPLGPP GPPGLPGPPG PKGAKGSSGP
1550
1551
TGPKGEAGHP GLPGPPGPPG EVIQPLPIQA SRTRRNIDAS QLLDDGAGES
1600
1601
YVDYADGMEE IFGSLNSLKL EIEQMKRPLG TQQNPARTCK DLQLCHPDFP
1650
1651
DGEYWVDPNQ GCSRDSFKVY CNFTAGGSTC VFPDKKSEGA RITSWPKENP
1700
1701
GSWFSEFKRG KLLSYVDAEG NPVGVVQMTF LRLLSASAHQ NVTYNCYQSV
1750
1751
AWQDAATGSY DKAIRFLGSN DEEMSYDNNP YIRALVDGCA TKKGYQKTVL
1800
1801
EIDTPKVEQV PIVDIMFNDF GEASQKFGFE VGPACFLG             
1838
 

Show the unformatted sequence.

Checksums:
CRC64:D20F4E5198A09ECF
MD5:f56550c40973b535a216088c0671ea85

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.

AlphaFold Structure Prediction

The protein structure below has been predicted by DeepMind with AlphaFold. For more information, please visit the AlphaFold page for this protein.

Model confidence scale

  Very High (pLDDT > 90)
  Confident (90 > pLDDT > 70)
  Low (70 > pLDDT > 50)
  Very Low (pLDDT < 50)
Highly accurate protein structure prediction with AlphaFold. John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli & Demis Hassabis Nature 2021-07-15; DOI: 10.1038/s41586-021-03819-2;