Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: D4A5A6_RAT (D4A5A6)

Summary

This is the summary of UniProt entry D4A5A6_RAT (D4A5A6).

Description: DNA-directed RNA polymerase subunit {ECO:0000256|RuleBase:RU004279}
Source organism: Rattus norvegicus (Rat) (NCBI taxonomy ID 10116)
Length: 1970 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 11
Pfam RNA_pol_Rpb1_1 15 354
disorder n/a 37 59
disorder n/a 129 132
disorder n/a 165 182
disorder n/a 263 269
disorder n/a 323 331
disorder n/a 333 345
Pfam RNA_pol_Rpb1_2 356 521
disorder n/a 431 432
disorder n/a 442 443
Pfam RNA_pol_Rpb1_3 524 692
disorder n/a 606 609
disorder n/a 611 623
coiled_coil n/a 699 719
Pfam RNA_pol_Rpb1_4 716 823
disorder n/a 718 721
disorder n/a 724 733
disorder n/a 736 737
disorder n/a 744 746
disorder n/a 748 749
disorder n/a 756 757
Pfam RNA_pol_Rpb1_5 830 1427
Pfam RNA_pol_Rpb1_6 896 1079
coiled_coil n/a 949 969
low_complexity n/a 951 964
Pfam RNA_pol_Rpb1_7 1164 1299
coiled_coil n/a 1262 1282
disorder n/a 1269 1273
low_complexity n/a 1271 1283
disorder n/a 1497 1498
disorder n/a 1504 1505
disorder n/a 1507 1508
disorder n/a 1512 1519
disorder n/a 1524 1525
disorder n/a 1527 1534
disorder n/a 1539 1553
low_complexity n/a 1541 1560
disorder n/a 1557 1970
low_complexity n/a 1562 1587
low_complexity n/a 1608 1945
Pfam RNA_pol_Rpb1_R 1616 1629
Pfam RNA_pol_Rpb1_R 1630 1643
Pfam RNA_pol_Rpb1_R 1644 1657
Pfam RNA_pol_Rpb1_R 1658 1671
Pfam RNA_pol_Rpb1_R 1672 1685
Pfam RNA_pol_Rpb1_R 1686 1699
Pfam RNA_pol_Rpb1_R 1700 1713
Pfam RNA_pol_Rpb1_R 1714 1727
Pfam RNA_pol_Rpb1_R 1728 1741
Pfam RNA_pol_Rpb1_R 1735 1748
Pfam RNA_pol_Rpb1_R 1742 1755
Pfam RNA_pol_Rpb1_R 1749 1762
Pfam RNA_pol_Rpb1_R 1756 1769
Pfam RNA_pol_Rpb1_R 1763 1776
Pfam RNA_pol_Rpb1_R 1770 1783
Pfam RNA_pol_Rpb1_R 1778 1790
Pfam RNA_pol_Rpb1_R 1784 1797
Pfam RNA_pol_Rpb1_R 1791 1804
Pfam RNA_pol_Rpb1_R 1798 1811
Pfam RNA_pol_Rpb1_R 1805 1818
Pfam RNA_pol_Rpb1_R 1826 1839
Pfam RNA_pol_Rpb1_R 1833 1846
Pfam RNA_pol_Rpb1_R 1840 1853
Pfam RNA_pol_Rpb1_R 1847 1860
Pfam RNA_pol_Rpb1_R 1855 1867
Pfam RNA_pol_Rpb1_R 1861 1874
Pfam RNA_pol_Rpb1_R 1868 1881
Pfam RNA_pol_Rpb1_R 1889 1902
Pfam RNA_pol_Rpb1_R 1896 1909
Pfam RNA_pol_Rpb1_R 1903 1916
Pfam RNA_pol_Rpb1_R 1910 1923
Pfam RNA_pol_Rpb1_R 1917 1930
Pfam RNA_pol_Rpb1_R 1924 1936
Pfam RNA_pol_Rpb1_R 1931 1947
low_complexity n/a 1938 1959
Pfam RNA_pol_Rpb1_R 1941 1954
Pfam RNA_pol_Rpb1_R 1948 1960

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession D4A5A6. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MHGGGPPSGD SACPLRTIKR VQFGVLSPDE LKRMSVTEGG IKYPETTEGG
50
51
RPKLGGLMDP RQGVIERTGR CQTCAGNMTE CPGHFGHIEL AKPVFHVGFL
100
101
VKTMKVLRCV CFFCSKLLVD SNNPKIKDIL AKSKGQPKKR LTHVYDLCKG
150
151
KNICEGGEEM DNKFGVEQPE GDEDLTKEKG HGGCGRYQPR IRRSGLELYA
200
201
EWKHVNEDSQ EKKILLSPER VHEIFKRISD EECFVLGMEP RYARPEWMIV
250
251
TVLPVPPLSV RPAVVMQGSA RNQDDLTHKL ADIVKINNQL RRNEQNGAAA
300
301
HVIAEDVKLL QFHVATMVDN ELPGLPRAMQ KSGRPLKSLK QRLKGKEGRV
350
351
RGNLMGKRVD FSARTVITPD PNLSIDQVGV PRSIAANMTF AEIVTPFNID
400
401
RLQELVRRGN SQYPGAKYII RDNGDRIDLR FHPKPSDLHL QTGYKVERHM
450
451
CDGDIVIFNR QPTLHKMSMM GHRVRILPWS TFRLNLSVTT PYNADFDGDE
500
501
MNLHLPQSLE TRAEIQELAM VPRMIVTPQS NRPVMGIVQD TLTAVRKFTK
550
551
RDVFLERGEV MNLLMFLSTW DGKVPQPAIL KPRPLWTGKQ IFSLIIPGHI
600
601
NCIRTHSTHP DDEDSGPYKH ISPGDTKVVV ENGELIMGIL CKKSLGTSAG
650
651
SLVHISYLEM GHDVTRLFYS NIQTVINNWL LIEGHTIGIG DSIADSKTYQ
700
701
DIQNTIKKAK QDVIEVIEKA HNNELEPTPG NTLRQTFENQ VNRILNDARD
750
751
KTGSSAQKSL SEYNNFKSMV VSGAKGSKIN ISQVIAVVGQ QNVEGKRIPF
800
801
GFKHRTLPHF IKDDYGPESR GFVENSYLAG LTPTEFFFHA MGGREGLIDT
850
851
AVKTAETGYI QRRLIKSMES VMVKYDATVR NSINQVVQLR YGEDGLAGES
900
901
VEFQNLATLK PSNKAFEKKF RFDYTNERAL RRTLQEDLVK DVLSNAHIQN
950
951
ELEREFERMR EDREVLRVIF PTGDSKVVLP CNLLRMIWNA QKIFHINPRL
1000
1001
PSDLHPIKVV EGVKELSKKL VIVNGDDPLS RQAQENATLL FNIHLRSTLC
1050
1051
SRRMAEEFRL SGEAFDWLLG EIESKFNQAI AHPGEMVGAL AAQSLGEPAT
1100
1101
QMTLNTFHYA GVSAKNVTLG VPRLKELINI SKKPKTPSLT VFLLGQSARD
1150
1151
AERAKDILCR LEHTTLRKVT ANTAIYYDPN PQSTVVAEDQ EWVNVYYEMP
1200
1201
DFDVARISPW LLRVELDRKH MTDRKLTMEQ IAEKINAGFG DDLNCIFNDD
1250
1251
NAEKLVLRIR IMNSDENKMQ EEEEVVDKMD DDVFLRCIES NMLTDMTLQG
1300
1301
IEQISKVYMH LPQTDNKKKI IITEDGEFKA LQEWILETDG VSLMRVLSEK
1350
1351
DVDPVRTTSN DIVEIFTVLG IEAVRKALER ELYHVISFDG SYVNYRHLAL
1400
1401
LCDTMTCRGH LMAITRHGVN RQDTGPLMKC SFEETVDVLM EAAAHGESDP
1450
1451
MKGVSENIML GQLAPAGTGC FDLLLDAEKC KYGMEIPTNI PGLGAAGPTG
1500
1501
MFFGSAPSPM GGISPAMTPW NQGATPAYGA WSPSVGSGMT PGAAGFSPSA
1550
1551
ASDASGFSPG YSPAWSPTPG SPGSPGPSSP YIPSPGGAMS PSYSPTSPAY
1600
1601
EPRSPGGYTP QSPSYSPTSP SYSPTSPSYS PTSPNYSPTS PSYSPTSPSY
1650
1651
SPTSPSYSPT SPSYSPTSPS YSPTSPSYSP TSPSYSPTSP SYSPTSPSYS
1700
1701
PTSPSYSPTS PSYSPTSPSY SPTSPSYSPT SPSYSPTSPS YSPTSPNYSP
1750
1751
TSPNYTPTSP SYSPTSPSYS PTSPNYTPTS PNYSPTSPSY SPTSPSYSPT
1800
1801
SPSYSPSSPR YTPQSPTYTP SSPSYSPSSP SYSPTSPKYT PTSPSYSPSS
1850
1851
PEYTPTSPKY SPTSPKYSPT SPKYSPTSPT YSPTTPKYSP TSPTYSPTSP
1900
1901
VYTPTSPKYS PTSPTYSPTS PKYSPTSPTY SPTSPKGSTY SPTSPGYSPT
1950
1951
SPTYSLTSPA ISPEDSDEEN                                 
1970
 

Show the unformatted sequence.

Checksums:
CRC64:8B1DEFEA0A066208
MD5:c68b4b6a754a83f4a323995e6c643d8f

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.

AlphaFold Structure Prediction

The protein structure below has been predicted by DeepMind with AlphaFold. For more information, please visit the AlphaFold page for this protein.

Model confidence scale

  Very High (pLDDT > 90)
  Confident (90 > pLDDT > 70)
  Low (70 > pLDDT > 50)
  Very Low (pLDDT < 50)
Highly accurate protein structure prediction with AlphaFold. John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli & Demis Hassabis Nature 2021-07-15; DOI: 10.1038/s41586-021-03819-2;