Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: G5ED21_CAEEL (G5ED21)

Summary

This is the summary of UniProt entry G5ED21_CAEEL (G5ED21).

Description: DNA-directed RNA polymerase subunit {ECO:0000256|RuleBase:RU004279}
Source organism: Caenorhabditis elegans (NCBI taxonomy ID 6239)
Length: 1737 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
Pfam RNA_pol_Rpb1_1 13 471
low_complexity n/a 241 257
low_complexity n/a 263 274
disorder n/a 264 281
low_complexity n/a 267 287
Pfam RNA_pol_Rpb1_2 473 653
disorder n/a 518 525
disorder n/a 527 595
disorder n/a 598 602
Pfam RNA_pol_Rpb1_3 656 836
Pfam RNA_pol_Rpb1_4 893 987
Pfam RNA_pol_Rpb1_5 994 1665
low_complexity n/a 1104 1118
low_complexity n/a 1111 1127
disorder n/a 1239 1240
disorder n/a 1356 1357
disorder n/a 1382 1498
low_complexity n/a 1424 1439
low_complexity n/a 1446 1457
low_complexity n/a 1452 1466
low_complexity n/a 1463 1474
disorder n/a 1558 1561
disorder n/a 1656 1657
disorder n/a 1660 1661
disorder n/a 1691 1696
disorder n/a 1710 1737
coiled_coil n/a 1716 1736
low_complexity n/a 1719 1733

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession G5ED21. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MDFFVRNGEE PYMQFSNFKL RSYFPHEIDK LSVLKITQTK TFDEVGHPIA
50
51
GGLYDPILGP DNTFDMCMTC NQYERHCPGH MGHIQLAVPV FNPLLFQFTY
100
101
NLLKGSCVHC HRLTCKGDGV NARMLLAQLR CFELGVEHVA FDLESILRDK
150
151
IANADLFGDD DSKTFNDVDA CIAQLANKPL SELTAFKSMP TKNSVQLKKD
200
201
IITEFLRSHL FKRLQKCPLC KNRNGVLRND GARSILIDFT SGARGGGKSK
250
251
KAIIIIGDNG YNEESSTDED EEKKGGGGAE VTGDVVDLDE GSLEMQMKNV
300
301
RTGECDKLAW RGAEVREHFR MLFKNDGKLL LKLFPMLVDE LNGEDMICPL
350
351
DGLFLERILV PPKKFRPIRM FKGAQYEDPQ TLNLRKVLEA TETISAISLI
400
401
MKGDTSAQLK ELIANRVRGK TINAQMHDAY LQLQLRANAI FDQDLNKGDR
450
451
DSIAGIKQIL EKKQGLFRMH MMGKRVNFAC RSVITPDPYL DIDEIGIPDI
500
501
FAKKLTFTEP VNAFNVNEMK GLLRQGPHQH PGANFFVEPS GKKTMLGDKP
550
551
EEKKRRMQMA KTLNAATTEN LRQTPKVLRH MKNGDMIMMN RQPSLHKPSI
600
601
LGHRARVLTG QRALRMNYAP CKAYNADFDG DEMNGHLVQS HIAQTEVREI
650
651
ANVGSNFLVP KDATPLLGLI QDHVVSGVLL SVRDRFLNKE DFMHLVLASF
700
701
AQYSKRIEIP PPTILYPKRL WTGKQVITTI VKNCIPDGKP LINLDGKAKT
750
751
PLSCWIVPGF DAPQFDMSES HVVFRQGELL VGVLDKAHFG ATQFGLAHCA
800
801
FELYGHRCGV QLLSCFSRLF TTYLQFHGFT LGVADILVVK DADGKRKEAV
850
851
MESRTIGNQV VKTAFGLPDT ATPAEIKRTL AATYCNPRGQ GTDVKMLDFG
900
901
MKQGIAKYND AITKSCVPTG LLRLFPQNAL QLMIQSGAKG SAVNAIQISG
950
951
CLGQIELEGK RMAVTIAGRT LPSFRCFDPS PRAGGYIDQR FLTGMNPQEL
1000
1001
FFHTMAGREG LIDTAVKTSR SGYLQRCIIK HLEGIRVHYD STVRDHDGSV
1050
1051
IQFRYGEDGM DTTKATFLNK KTMPFLEDNL EAVTLASKPE GVTDADFGIK
1100
1101
ETEKRYKKIV KWKKKAEKSG KSGTKKSYFS AFTNFSAEHI GMEKKRILAM
1150
1151
WFELSLEERE QYARGIPKKC PEAVDERFNP TCKLGALPEK MLDEIEGFCT
1200
1201
RRVKKVDDEE PPKEVLKRTL YWKGMRSLAD PGENVGLLAA QSIGEPSTQM
1250
1251
TLNTFHFAGR GEMNVTLGIP RLREILMTAS KSIATPSASI AVIAGTSRDR
1300
1301
IDSIKRELDR VYLKQLLKNF SLEEKITLTQ NQSCRRYHLR IDILAAEKRE
1350
1351
LGARHLKRSQ IMEEIEKRFI LRVAQAIKKK YHEITDYQQM SHRTMRQGNM
1400
1401
AAGIETGGTK NRGLQGPDNG DSSDEEADGG READAAEARL HRRHRDEGAD
1450
1451
YEGEDEERVE VREEEEPMDS DSEDVKKEGL DGEDQTTEPL LVNSSRIQSV
1500
1501
QRLSENISSY TYDVKSNKWC EVVFELPLRN KTKMDVSSIV EKEVELFIVH
1550
1551
QTPGIERCVE TTEQKNGKEM TILQTQGVNL AAFFKHADVL DVNSVYSNDL
1600
1601
NLILENYGVE ACSKAITTEM NNVFAVYGIE VSKRHLSLTA DYMTFTGQIQ
1650
1651
PFNRGAMSSS SSPLQKMTFE TTMAFLREAL LHGEEDNVNS PSARLVMGAL
1700
1701
PRGGTGSFDL LLDTKMQSER EEHEAARARK RNAKQKF              
1737
 

Show the unformatted sequence.

Checksums:
CRC64:4A87531868576733
MD5:89907fea7db114edfaa747edb4ca2da0

AlphaFold Structure Prediction

The protein structure below has been predicted by DeepMind with AlphaFold. For more information, please visit the AlphaFold page for this protein.

Model confidence scale

  Very High (pLDDT > 90)
  Confident (90 > pLDDT > 70)
  Low (70 > pLDDT > 50)
  Very Low (pLDDT < 50)
Highly accurate protein structure prediction with AlphaFold. John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli & Demis Hassabis Nature 2021-07-15; DOI: 10.1038/s41586-021-03819-2;