Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: A0A0R4IN07_DANRE (A0A0R4IN07)

Summary

This is the summary of UniProt entry A0A0R4IN07_DANRE (A0A0R4IN07).

Description: Collagen, type V, alpha 3b {ECO:0000313|Ensembl:ENSDARP00000136818}
Source organism: Danio rerio (Zebrafish) (Brachydanio rerio) (NCBI taxonomy ID 7955)
Length: 1723 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 26
disorder n/a 245 246
disorder n/a 248 1499
low_complexity n/a 264 274
low_complexity n/a 296 313
Pfam Collagen 347 399
low_complexity n/a 357 385
low_complexity n/a 430 460
Pfam Collagen 433 490
low_complexity n/a 477 505
low_complexity n/a 523 541
low_complexity n/a 559 580
low_complexity n/a 586 634
low_complexity n/a 637 658
low_complexity n/a 765 806
low_complexity n/a 861 883
low_complexity n/a 946 973
low_complexity n/a 979 1003
low_complexity n/a 1015 1039
low_complexity n/a 1051 1069
low_complexity n/a 1099 1114
low_complexity n/a 1132 1171
Pfam Collagen 1180 1247
low_complexity n/a 1189 1211
low_complexity n/a 1246 1276
Pfam Collagen 1290 1359
low_complexity n/a 1309 1324
low_complexity n/a 1327 1351
low_complexity n/a 1348 1376
Pfam Collagen 1392 1454
low_complexity n/a 1396 1417
low_complexity n/a 1426 1457
coiled_coil n/a 1472 1492
low_complexity n/a 1478 1496
Pfam COLFI 1492 1721
disorder n/a 1508 1515
disorder n/a 1660 1664
disorder n/a 1670 1672
low_complexity n/a 1684 1700

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession A0A0R4IN07. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MKMKSYRISG VSSLLMALFL QNTTLAANPI DVLKVLDLSE SMEGVSLEAG
50
51
LCSSRRGSEE ADLAYKIDKK IQVSVPTSQL FPDSEFPVDF SVLLTVRARR
100
101
GAQCFLLSVY DSEGVQQLGV ELGRSPVFLY EDQSGRPSPD LYPIFKKINL
150
151
ADGKWHRVAY SVEGQKVTLY LDCQKVATLD LPRGPEPKVS TAGVTVFGTR
200
201
LLDEEVFEGE IQQLLISPDP GAAADYCHTH IPDCDSALTY NSLSLDPVEV
250
251
NRPQRKPVEE EFEDDLYSDL YSDLPNADQN STEYELLEYE DSENGTEYVK
300
301
EYTEYDEYVE YEYRPAERQD QHFSSQSRGP EKGEKGEPAV LGEGTMVTGP
350
351
PGLPGVEGPQ GERGPTGPPG RIGDPGDPGP EGRQGLAGAD GIPGPPGNLL
400
401
MLPFQSGGDA RLGPVISAQE AQAQAILQHT KLSLKGPPGP LGLTGRPGPL
450
451
GSPGPRGLKG DHGLTGPPGP RGVLGAPGQN GKPGKRGRGG MDGGRGEPGE
500
501
TGVKGDRGFD GLPGLPGNKG HRGDTGKKGP VGPPGAPGEK GSDGQPGPRG
550
551
QSGEPGLAGL TGQRGLPGPP GQQGIRGIDG VQGSKGNLGP PGEPGAPGQQ
600
601
GNPGFQGFPG PQGPVGVPGE KGPQGKKGMK GLPGVDGPPG HPGREGPPGE
650
651
KGLPGAAGVQ GPVGYPGARG VKGADGLRGL KGSKGEKGED GFPGAKGEMG
700
701
AKGDNGDAGA QGMRGEDGPE GPKGQSGPLG EPGPAGIAGE KGKLGVPGLP
750
751
GYPGRQGQKG GDGFPGAVGV PGEKGKKGPP GPAGAAGQRG PNGARGARGA
800
801
RGPTGKSGDK GTSGHDGPPG STGDRGPQGP QGRVGEMGPK GPNGPAGKDG
850
851
LPGHPGQRGE PGFQGKTGPP GPTGVVGPQG NTGETGPMGE RGHPGSPGPA
900
901
GEQGLPGAAG KEGSKGDPGG QGTSGKNGPT GLKGFRGSRG APGAMGPVGL
950
951
KGGTGPIGPP GPAGPTGERG PPGLAGAIGQ PGRPGTSGGP GPMGEKGEPG
1000
1001
DKGLIGPAGQ DGEQGPVGLP GAAGPPGPPG EDGDKGETGA PGQKGSKGDK
1050
1051
GESGPPGPVG SQGPEGQPGA PGVDGEVGPT GQQGMYGQKG DEGARGFKGS
1100
1101
RGPGGLQGMP GPPGEKGESG NSGLLGPPGQ FGPRGAQGPS GGQGPPGRPG
1150
1151
VRGQPGGVGE KGEDGESGDP GAVGVSGSAG EKGEQGEKGD TGPPGAAGSA
1200
1201
GARGASGEDG AKGNGGPIGL PGDMGAPGEP GVNGIDGTAG SKGDAGDPGK
1250
1251
PGPPGPFGEP GPPGRPGRRG HLGPPGKEGR PGLKGDKGAP GYEGIIGKPG
1300
1301
PVGGQGTSGK PGPQGLPGIP GPAGEQGLNG PPGQSGPPGP MGPAGLAGLK
1350
1351
GDPGKKGEKG HGGLIGLIGP PGEFGEKGDR GLPGNQGPQG AKGDEGPVGP
1400
1401
AGLTGPPGPP GLSGSMGQKG SKGNQGPIGA RGDPGPAGPP GPPGSAAVGM
1450
1451
AAPPALGKRR RHVEVSVDGA ALEEAGSQVE QQMEEVQTEE VQMEEVFASL
1500
1501
ASMRTDVEGL RTPLGSFHSP ARTCRELRLC HPEYPDGVYW IDPNQGCHRD
1550
1551
AFKVFCNFTA DGETCLQPHS SVQTVKMASW SKEKPGTWFS TFKKGSQFSY
1600
1601
VDVDGNPVHV VQLGFLKLLS ATARQSFTYV CQNSAGWLDG STRSYTHALR
1650
1651
FRGSNGDELT QRNTHYIQPT HDGCQWRSGQ ERTVLQLDAP LPDVLPLLDV
1700
1701
SVSDFGSLKQ KFGFSVGQVC FSG                             
1723
 

Show the unformatted sequence.

Checksums:
CRC64:0B66871DCB841FFB
MD5:c983186e36448884e95dad21f0e1f5ab

AlphaFold Structure Prediction

The protein structure below has been predicted by DeepMind with AlphaFold. For more information, please visit the AlphaFold page for this protein.

Model confidence scale

  Very High (pLDDT > 90)
  Confident (90 > pLDDT > 70)
  Low (70 > pLDDT > 50)
  Very Low (pLDDT < 50)
Highly accurate protein structure prediction with AlphaFold. John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli & Demis Hassabis Nature 2021-07-15; DOI: 10.1038/s41586-021-03819-2;