Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: Q2LDA1_DANRE (Q2LDA1)

Summary

This is the summary of UniProt entry Q2LDA1_DANRE (Q2LDA1).

Description: Collagen, type II, alpha 1a {ECO:0000313|Ensembl:ENSDARP00000091007}
Source organism: Danio rerio (Zebrafish) (Brachydanio rerio) (NCBI taxonomy ID 7955)
Length: 1491 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 26
low_complexity n/a 4 13
low_complexity n/a 9 24
disorder n/a 37 38
Pfam VWC 38 93
disorder n/a 100 1290
Pfam Collagen 120 185
low_complexity n/a 121 133
low_complexity n/a 157 182
low_complexity n/a 195 229
Pfam Collagen 203 263
low_complexity n/a 229 259
Pfam Collagen 262 321
low_complexity n/a 301 326
Pfam Collagen 316 381
low_complexity n/a 322 340
low_complexity n/a 334 352
low_complexity n/a 352 376
low_complexity n/a 397 416
low_complexity n/a 418 455
Pfam Collagen 472 547
low_complexity n/a 477 502
Pfam Collagen 574 643
low_complexity n/a 583 604
Pfam Collagen 628 695
low_complexity n/a 640 659
low_complexity n/a 682 706
low_complexity n/a 697 736
low_complexity n/a 733 757
Pfam Collagen 742 806
low_complexity n/a 790 808
Pfam Collagen 793 863
low_complexity n/a 811 829
low_complexity n/a 832 850
low_complexity n/a 862 889
low_complexity n/a 892 907
low_complexity n/a 901 934
low_complexity n/a 936 958
low_complexity n/a 952 982
Pfam Collagen 985 1062
low_complexity n/a 1012 1036
low_complexity n/a 1066 1087
low_complexity n/a 1093 1115
Pfam Collagen 1105 1172
low_complexity n/a 1144 1177
Pfam Collagen 1162 1222
low_complexity n/a 1180 1221
Pfam COLFI 1255 1490
disorder n/a 1338 1340

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q2LDA1. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MFRLLDSRTL LLLVATHSVL LSLVRCQQED DQEEFGGCVQ DGQQYADRAV
50
51
WKPEPCRVCV CDSGTVLCDE VICEDLNDCA NPIISPGECC PICPADTDDP
100
101
IGSLGAKGQK GEPGDITDVV GPRGPAGPMG PPGEQGTRGE RGAKGEKGSP
150
151
GPRGRDGEPG TPGNPGPPGP PGPNGPPGLG GNFAAQMAGG FDEKAGGAQM
200
201
GVMQGPMGPM GPRGPPGPSG APGPQGFQGN PGETGEPGPA GALGPRGPPG
250
251
PPGKPGSDGE AGKPGKAGER GPPGPQGARG FPGTPGLPGI KGHRGHPGLD
300
301
GAKGEAGAAG AKGEAGSNGE SGAPGPMGPR GLPGERGRPG ATGAAGARGN
350
351
DGLPGPAGPP GPVGPAGAPG FPGSPGSKGE AGPTGARGPE GAQGPRGEAG
400
401
TPGSPGPAGA SGNPGTDGIP GAKGSAGASG IAGAPGFPGP RGPPGPQGAT
450
451
GPLGPKGQSG DPGIPGFKGE AGPKGERGVL GPQGPPGPSG EEGKRGPRGE
500
501
PGSAGPLGPP GERGAPGNRG FPGQDGLAGA KGAPGDRGVP GLSGPKGGTG
550
551
DPGRPGEPGL PGARGLTGRP GDAGAQGKVG ATGAPGEDGR PGPPGPLGAR
600
601
GQPGVMGFPG PKGANGEPGK PGEKGLVGRT GLRGLPGKDG ETGPSGPPGP
650
651
VGAVGERGEQ GQPGPSGFQG LPGPTGAPGE PGKPGDQGVP GEGGAAGPTG
700
701
PRGERGFPGE RGGAGPQGLQ GPRGLPGTPG TDGPKGAIGP AGAAGAQGPP
750
751
GLQGMPGERG AVGISGAKGD RGDSGEKGPE GAPGKDGSRG LTGPIGPPGP
800
801
SGPNGAKGET GPIGSIGAPG ARGAPGDRGE IGAPGPAGFA GPPGADGQPG
850
851
NKGEQGESGQ KGDSGAPGPQ GPSGAPGPVG PTGVTGPKGA RGAQGAPGAT
900
901
GFPGAAGRVG PPGPNGNPGA AGPAGPSGKD GPKGVRGDAG PPGRAGDAGL
950
951
RGPPGAPGEK GEAGEDGPPG PDGPSGPAGL AGQRGIVGLP GQRGERGFPG
1000
1001
LPGPSGEPGK QGAPGGSGDR GPPGPVGPPG LTGPAGETGR EGNPGSDGPP
1050
1051
GRDGAAGVKG ERGNTGPIGA PGAPGAPGAP GSVGPIGKQG DRGENGPQGP
1100
1101
AGPPGPAGAR GMVGPQGPRG DKGEAGEAGE RGQKGHRGFT GLQGLPGPPG
1150
1151
SPGDQGAAGP AGPSGAKGPS GPVGPAGKDG SNGQPGPIGP PGPRGRSGES
1200
1201
GPVGPPGNPG PPGPPGPPGP GIDMSAFAGL SQPEKGPDPL RYMRADEASS
1250
1251
SLRQHDVEVD ATLKSINGQI EDIRSPDGSR KNPARSCRDL KLCHPEWKSG
1300
1301
DYWVDPNLGS AADAIKVFCN METGETCVKP STPKIPRKNW WTSKSKAQKH
1350
1351
VWFGESMNGG FHFSYADGSQ TPSTTTIQLN FLRLLSTEAT QTITYHCKNS
1400
1401
VAYMDQATGN LKKAILLQGS NDVEIRAEGN SRFTYGVLED GCKKHTGQWA
1450
1451
KTVIEYKTQK TSRLPIMDIA PMDIGGADQE FGVDIGAVCF L         
1491
 

Show the unformatted sequence.

Checksums:
CRC64:B83A0D20091418C4
MD5:3606e0497a3df96b601e30dd963322ca

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.

AlphaFold Structure Prediction

The protein structure below has been predicted by DeepMind with AlphaFold. For more information, please visit the AlphaFold page for this protein.

Model confidence scale

  Very High (pLDDT > 90)
  Confident (90 > pLDDT > 70)
  Low (70 > pLDDT > 50)
  Very Low (pLDDT < 50)
Highly accurate protein structure prediction with AlphaFold. John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli & Demis Hassabis Nature 2021-07-15; DOI: 10.1038/s41586-021-03819-2;