Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: CORA1_MOUSE (Q5QNQ9)

Summary

This is the summary of UniProt entry CORA1_MOUSE (Q5QNQ9).

Description: Collagen alpha-1(XXVII) chain
Source organism: Mus musculus (Mouse) (NCBI taxonomy ID 10090)
Length: 1845 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
low_complexity n/a 2 21
disorder n/a 6 9
disorder n/a 185 186
disorder n/a 270 282
disorder n/a 284 1612
low_complexity n/a 325 343
low_complexity n/a 356 372
low_complexity n/a 428 443
low_complexity n/a 455 467
low_complexity n/a 584 597
Pfam Collagen 609 670
low_complexity n/a 619 664
Pfam Collagen 666 731
low_complexity n/a 676 691
low_complexity n/a 693 719
low_complexity n/a 790 808
low_complexity n/a 817 838
low_complexity n/a 858 880
low_complexity n/a 886 910
low_complexity n/a 912 935
low_complexity n/a 928 946
Pfam Collagen 1012 1080
low_complexity n/a 1015 1042
Pfam Collagen 1033 1103
Pfam Collagen 1063 1130
low_complexity n/a 1069 1090
low_complexity n/a 1111 1138
low_complexity n/a 1150 1168
Pfam Collagen 1207 1281
low_complexity n/a 1224 1240
low_complexity n/a 1255 1270
Pfam Collagen 1261 1324
low_complexity n/a 1299 1325
Pfam Collagen 1323 1384
low_complexity n/a 1325 1352
low_complexity n/a 1360 1403
low_complexity n/a 1400 1422
low_complexity n/a 1438 1466
Pfam Collagen 1544 1606
low_complexity n/a 1556 1574
low_complexity n/a 1586 1603
disorder n/a 1615 1616
disorder n/a 1618 1621
disorder n/a 1632 1635
Pfam COLFI 1643 1731
Pfam COLFI 1721 1844
disorder n/a 1821 1823

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q5QNQ9. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MGTGFARGAR GTAASGPGGG FLFAWILVSF TCHLASTQGA PEDVDVLQRL
50
51
GLSWTKAGGG RSPTPPGVIP FPSGFIFTQR AKLQAPTANV LPTTLGRELA
100
101
LVLSLCSHRV NHAFLFAIRS RKHKLQLGLQ FLPGRTIIHL GPRQSVAFDL
150
151
DVHDGRWHHL ALELRGRTVT MVTACGQHRV PVPLPSRRDS MLDPQGSFLL
200
201
GKVNPRAVQF EGALCQFSIH PVAQVAHNYC AHLRERCRQV DTYSPQVGTL
250
251
FPWDSGPAFA LHPEPALLGL GNLTRTPATL GARPVSRALA VTLAPAMPTK
300
301
PLRTVHPDVS EHSSSQTPLS PAKQSARKTP SPSSSASLAN STRVYRPAAA
350
351
QPRQITTTSP TKRSPTKPSV SPLSVTPMKS PHATQKTGVP SFTKPVPPTQ
400
401
KPAPFTSYLA PSKASSPTVR PVQKTFMTPR PPVPSPQPLR PTTGLSKKFT
450
451
NPTVAKSKSK TTSWASKPVL ARSSVPKTLQ QTVLSQSPVS YLGSQTLAPA
500
501
LPPLGVGNPR TMPPTRDSAL TPAGSKKFTG RETSKKTRQK SSPRKPEPLS
550
551
PGKSARDASP RDLTTKPSRP STPALVLAPA YLLSSSPQPT SSSFPFFHLL
600
601
GPTPFPMLMG PPGSKGDCGL PGPPGLPGLP GSPGARGPRG PPGPYGNPGP
650
651
PGPPGAKGQK GDPGLSPGQA HDGAKGNMGL PGLSGNPGPL GRKGHKGHPG
700
701
AAGHPGEQGQ PGPEGSPGAK GYPGRQGFPG PVGDPGPKGS RGYIGLPGLF
750
751
GLPGSDGERG LPGVPGKRGE MGRPGFPGDF GERGPPGLDG NPGEIGLPGP
800
801
PGVLGLIGDT GALGPVGYPG PKGMKGLMGG VGEPGLKGDK GEQGVPGVSG
850
851
DPGFQGDKGS HGLPGLPGGR GKPGPLGKAG DKGSLGFPGP PGPEGFPGDI
900
901
GPPGDNGPEG MKGKPGARGL PGPPGQLGPE GDEGPMGPPG VPGLEGQPGR
950
951
KGFPGRPGLD GSKGEPGDPG RPGPVGEQGL MGFIGLVGEP GIVGEKGDRG
1000
1001
VMGPPGAPGP KGSMGHPGTP GGIGNPGEPG PWGPPGSRGL PGMRGAKGHR
1050
1051
GPRGPDGPAG EQGSKGLKGR VGPRGRPGQP GQQGAAGERG HSGAKGFLGI
1100
1101
PGPSGPPGAK GLPGEPGSQG PQGPVGPPGE MGPKGPPGAV GEPGLPGDSG
1150
1151
MKGDLGPLGP PGEQGLIGQR GEPGLEGDHG PVGPDGLKGD RGDPGPDGEH
1200
1201
GEKGQEGLKG EDGSPGPPGI TGVPGREGKP GKQGEKGQRG AKGAKGHQGY
1250
1251
LGEMGIPGEP GPPGTPGPKG SRGTLGPTGA PGRMGAQGEP GLAGYNGHKG
1300
1301
ITGPLGPPGP KGEKGDQGED GKTEGPPGPP GDRGPVGDRG DRGEPGDPGY
1350
1351
PGQEGVQGLR GEPGQQGQPG HPGPRGRPGP KGSKGEEGPK GKPGKAGPSG
1400
1401
RRGTQGLQGL PGPRGVVGRQ GPEGTAGSDG IPGRDGRPGY QGDQGNDGDP
1450
1451
GPVGPAGRRG NPGVAGLPGA QGPPGFKGES GLPGQLGPPG KRGTEGGTGL
1500
1501
PGNQGEPGSK GQPGDSGEMG FPGVAGLFGP KGPPGDIGFK GIQGPRGPPG
1550
1551
LMGKEGIIGP PGMLGPSGLP GPKGDRGSRG DLGLQGPRGP PGPRGRPGPP
1600
1601
GPPWHPIQFQ QDDLGAAFQT WMDAQGAVRS EGYSYPDQLA LDQGGEIFKT
1650
1651
LHYLSNLIQS IKTPLGTKEN PARVCRDLMD CEQRMADGTY WVDPNLGCSS
1700
1701
DTIEVSCNFT QGGQTCLKPI TASKAEFAVS RVQMNFLHLL SSEGTQHITI
1750
1751
HCLNMTVWQE GPGRSSARQA VRFRAWNGQV FEAGGQFRPE VSMDGCKVHD
1800
1801
GRWHQTLFTF RTQDPQQLPI VSVDNLPPVS SGKQYRLEVG PACFL     
1845
 

Show the unformatted sequence.

Checksums:
CRC64:F5D7F43D834BB770
MD5:e96796bb9a6117732fc49bb187cf9049

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.

AlphaFold Structure Prediction

The protein structure below has been predicted by DeepMind with AlphaFold. For more information, please visit the AlphaFold page for this protein.

Model confidence scale

  Very High (pLDDT > 90)
  Confident (90 > pLDDT > 70)
  Low (70 > pLDDT > 50)
  Very Low (pLDDT < 50)
Highly accurate protein structure prediction with AlphaFold. John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli & Demis Hassabis Nature 2021-07-15; DOI: 10.1038/s41586-021-03819-2;