Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: A4I766_LEIIN (A4I766)

Summary

This is the summary of UniProt entry A4I766_LEIIN (A4I766).

Description: DNA-directed RNA polymerase subunit {ECO:0000256|RuleBase:RU004279}
Source organism: Leishmania infantum (NCBI taxonomy ID 5671)
Length: 1662 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 2
Pfam RNA_pol_Rpb1_1 14 324
disorder n/a 235 236
disorder n/a 238 239
Pfam RNA_pol_Rpb1_2 326 489
Pfam RNA_pol_Rpb1_3 492 648
disorder n/a 572 574
Pfam RNA_pol_Rpb1_4 672 779
disorder n/a 779 780
Pfam RNA_pol_Rpb1_5 786 1396
Pfam RNA_pol_Rpb1_6 852 1053
disorder n/a 858 861
disorder n/a 863 865
disorder n/a 874 875
Pfam RNA_pol_Rpb1_7 1137 1265
disorder n/a 1153 1154
disorder n/a 1160 1162
disorder n/a 1169 1187
low_complexity n/a 1175 1188
disorder n/a 1221 1222
disorder n/a 1233 1239
disorder n/a 1465 1476
disorder n/a 1478 1495
disorder n/a 1497 1504
disorder n/a 1542 1543
disorder n/a 1546 1548
disorder n/a 1551 1554
low_complexity n/a 1557 1572
disorder n/a 1577 1578
disorder n/a 1595 1596
disorder n/a 1601 1662
low_complexity n/a 1629 1641

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession A4I766. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MSGGAPLPPS QMPLQKVHEV QFEVFKEAQI KAYAKCIIEH AKSYEHGQPV
50
51
RGGINDLRMG TTDFEYSCET CGLKHPECPG HFGYVELAEP IFNINVFDVV
100
101
LIALKCVCKY CGALLMDTND PSEMKKIAHL QGLNRLRMVA KLCGSVCKRS
150
151
KDIQGCEGKG RQPRIGRFAG IYPGLQIKVT QEEQDYVWHA ENARQVLDRV
200
201
SDSDALIMGF DPRFCHPRDL ILTVLPIPPP QVRPAVAFGS AKSDDELTHQ
250
251
IMSIVKRNIQ LRKDKESGVK AAVDRSRALL QEHVATFFNN ASTYYKPAKV
300
301
GDTKKLKSLT ERLKGKYGRL RGNLMGKRVD FSARTVITGD PNIDVDEVGV
350
351
PFSVAMTLTF PERVNVINKK RLTEFVQRTT YPSANYIIRP NGNVTKLALV
400
401
KDRNAVHLDV GDVVERHVID GDVVLFNRQP TLHRMSMMGH RVRVLNYNTF
450
451
RLNLSCTTPY NADFDGDEMN LHVPQSLLTK AELIEMMMVP KNFVSPNKSA
500
501
PCMGIVQDSL LGSYRLTDKD TFVDKYFIQS VALWLDLWEL PIPAILKPRP
550
551
LWTGKQIFSL ILPEVNHPAS PYDKPPFPHN DKKIMIQRGQ LLVGAITKGV
600
601
VGAAPGSLIH VIFNERGSDE VAKFINGVQR ITTYFNYCFA FSVGVQDTVA
650
651
DATTLKEMNN VLHKTRQSVE KIGAAANNGK LTRKAGMSLL QSFEADVNSA
700
701
LNKCREEAAK KALSNVRRTN SFKVMIEAGS KGSDLNICQI AVFVGQQNVA
750
751
GSRIPFGFRR RTLPHFMLDD YGETSRGMAT RGYVEGLQPH EFYFHTMAGR
800
801
EGLIDTAVKT SDTGYLQRKL VKALEDVHAS YDGTVRNANQ ELIQLAYGED
850
851
GLDGARIEGN QAFPIPHMTN SEMADKYRYE YNDEGSFSEN MGGHYMDPFV
900
901
RDSLLRDPQS VLKLQEEFEQ LMKDRAMSRL VIDMEDKNKL KMNLPVNVAR
950
951
LIQNARTTMG KRSQVSNLNP ITVINRVREL QEDLAQLFPS YHKDYNGRFA
1000
1001
NVLSQQRVER ALTLFGIHLR QILGSKRVLK EYKLNDKAFE YLLKEIRTKY
1050
1051
QQSLITPGEI IGAIAAQSCG EPATQMTLNT FHNAGISSKN VTLGVPRLLE
1100
1101
LLNVSKNQRN ASVAVCLIRE YQKRNKAQEA QQFIEYCTLA NITTTVQIIY
1150
1151
DPDPRNTVVA EDEEMIRWEQ AVMNEEDEEP DAEQPPSPFI ARLILDNDLF
1200
1201
NDKRLNMKDV KSAIRQVDDT YMVQANMEND GQRIVRLRPR KCTGADSVPA
1250
1251
LTKAVAQLLQ SVHLRGIPGI KKTLLKEGNT FRVDPETGGI KNESSWMVDT
1300
1301
EGTALQRIFV GVVNNEGKNI IDFSKTSSNK IPEVVTVLGI EAARRKLLSE
1350
1351
LREAYLAYGL NINYRHYTIL VDTMCQRGYL MAVSRTGINR SETSGPLMRC
1400
1401
SFEETVKVLM TAAAFGEKDP VRGVSASLVL GNQARIGTGL FDLLLDMSKL
1450
1451
QHVVPLDKAT EARTSNVYHT DASVAPGSST LQGSHGELPP STVHENSSLA
1500
1501
AGASSVYPRH ERAGMYGGIP IEASEVQFSS ALPTVMGTVL AASNTTEYHS
1550
1551
SHHLSGASTY LASSALPSAS ALDTELSSYH LQSVARSTAY GYAPVTASGM
1600
1601
PMPGGAQLSV GEGGSMPYPY EASNGERFGS LGGAASSRAS APYDPTQQPS
1650
1651
QEFSPTEEQE EP                                         
1662
 

Show the unformatted sequence.

Checksums:
CRC64:4943A244C2F248E7
MD5:004ffed81d2758d70d395ea0c65bca16

AlphaFold Structure Prediction

The protein structure below has been predicted by DeepMind with AlphaFold. For more information, please visit the AlphaFold page for this protein.

Model confidence scale

  Very High (pLDDT > 90)
  Confident (90 > pLDDT > 70)
  Low (70 > pLDDT > 50)
  Very Low (pLDDT < 50)
Highly accurate protein structure prediction with AlphaFold. John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Kathryn Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Alex Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stanislav Nikolov, Rishub Jain, Jonas Adler, Trevor Back, Stig Petersen, David Reiman, Ellen Clancy, Michal Zielinski, Martin Steinegger, Michalina Pacholska, Tamas Berghammer, Sebastian Bodenstein, David Silver, Oriol Vinyals, Andrew W. Senior, Koray Kavukcuoglu, Pushmeet Kohli & Demis Hassabis Nature 2021-07-15; DOI: 10.1038/s41586-021-03819-2;