Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: RPA1_HUMAN (O95602)

Summary

This is the summary of UniProt entry RPA1_HUMAN (O95602).

Description: DNA-directed RNA polymerase I subunit RPA1 EC=2.7.7.6
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 1720 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
Pfam RNA_pol_Rpb1_1 9 357
disorder n/a 156 157
disorder n/a 241 243
disorder n/a 352 358
Pfam RNA_pol_Rpb1_2 434 614
disorder n/a 494 498
disorder n/a 509 510
disorder n/a 520 526
disorder n/a 533 535
disorder n/a 560 563
Pfam RNA_pol_Rpb1_3 617 802
Pfam RNA_pol_Rpb1_4 837 951
Pfam RNA_pol_Rpb1_5 958 1670
disorder n/a 1104 1112
low_complexity n/a 1187 1198
low_complexity n/a 1272 1283
disorder n/a 1360 1505
low_complexity n/a 1388 1398
low_complexity n/a 1402 1415
coiled_coil n/a 1413 1448
low_complexity n/a 1419 1444
low_complexity n/a 1563 1576
disorder n/a 1575 1576

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession O95602. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MLISKNMPWR RLQGISFGMY SAEELKKLSV KSITNPRYLD SLGNPSANGL
50
51
YDLALGPADS KEVCSTCVQD FSNCSGHLGH IELPLTVYNP LLFDKLYLLL
100
101
RGSCLNCHML TCPRAVIHLL LCQLRVLEVG ALQAVYELER ILNRFLEENP
150
151
DPSASEIREE LEQYTTEIVQ NNLLGSQGAH VKNVCESKSK LIALFWKAHM
200
201
NAKRCPHCKT GRSVVRKEHN SKLTITFPAM VHRTAGQKDS EPLGIEEAQI
250
251
GKRGYLTPTS AREHLSALWK NEGFFLNYLF SGMDDDGMES RFNPSVFFLD
300
301
FLVVPPSRYR PVSRLGDQMF TNGQTVNLQA VMKDVVLIRK LLALMAQEQK
350
351
LPEEVATPTT DEEKDSLIAI DRSFLSTLPG QSLIDKLYNI WIRLQSHVNI
400
401
VFDSEMDKLM MDKYPGIRQI LEKKEGLFRK HMMGKRVDYA ARSVICPDMY
450
451
INTNEIGIPM VFATKLTYPQ PVTPWNVQEL RQAVINGPNV HPGASMVINE
500
501
DGSRTALSAV DMTQREAVAK QLLTPATGAP KPQGTKIVCR HVKNGDILLL
550
551
NRQPTLHRPS IQAHRARILP EEKVLRLHYA NCKAYNADFD GDEMNAHFPQ
600
601
SELGRAEAYV LACTDQQYLV PKDGQPLAGL IQDHMVSGAS MTTRGCFFTR
650
651
EHYMELVYRG LTDKVGRVKL LSPSILKPFP LWTGKQVVST LLINIIPEDH
700
701
IPLNLSGKAK ITGKAWVKET PRSVPGFNPD SMCESQVIIR EGELLCGVLD
750
751
KAHYGSSAYG LVHCCYEIYG GETSGKVLTC LARLFTAYLQ LYRGFTLGVE
800
801
DILVKPKADV KRQRIIEEST HCGPQAVRAA LNLPEAASYD EVRGKWQDAH
850
851
LGKDQRDFNM IDLKFKEEVN HYSNEINKAC MPFGLHRQFP ENSLQMMVQS
900
901
GAKGSTVNTM QISCLLGQIE LEGRRPPLMA SGKSLPCFEP YEFTPRAGGF
950
951
VTGRFLTGIK PPEFFFHCMA GREGLVDTAV KTSRSGYLQR CIIKHLEGLV
1000
1001
VQYDLTVRDS DGSVVQFLYG EDGLDIPKTQ FLQPKQFPFL ASNYEVIMKS
1050
1051
QHLHEVLSRA DPKKALHHFR AIKKWQSKHP NTLLRRGAFL SYSQKIQEAV
1100
1101
KALKLESENR NGRSPGTQEM LRMWYELDEE SRRKYQKKAA ACPDPSLSVW
1150
1151
RPDIYFASVS ETFETKVDDY SQEWAAQTEK SYEKSELSLD RLRTLLQLKW
1200
1201
QRSLCEPGEA VGLLAAQSIG EPSTQMTLNT FHFAGRGEMN VTLGIPRLRE
1250
1251
ILMVASANIK TPMMSVPVLN TKKALKRVKS LKKQLTRVCL GEVLQKIDVQ
1300
1301
ESFCMEEKQN KFQVYQLRFQ FLPHAYYQQE KCLRPEDILR FMETRFFKLL
1350
1351
MESIKKKNNK ASAFRNVNTR RATQRDLDNA GELGRSRGEQ EGDEEEEGHI
1400
1401
VDAEAEEGDA DASDAKRKEK QEEEVDYESE EEEEREGEEN DDEDMQEERN
1450
1451
PHREGARKTQ EQDEEVGLGT EEDPSLPALL TQPRKPTHSQ EPQGPEAMER
1500
1501
RVQAVREIHP FIDDYQYDTE ESLWCQVTVK LPLMKINFDM SSLVVSLAHG
1550
1551
AVIYATKGIT RCLLNETTNN KNEKELVLNT EGINLPELFK YAEVLDLRRL
1600
1601
YSNDIHAIAN TYGIEAALRV IEKEIKDVFA VYGIAVDPRH LSLVADYMCF
1650
1651
EGVYKPLNRF GIRSNSSPLQ QMTFETSFQF LKQATMLGSH DELRSPSACL
1700
1701
VVGKVVRGGT GLFELKQPLR                                 
1720
 

Show the unformatted sequence.

Checksums:
CRC64:AD4BA543FFB98DC6
MD5:64e450e11bb5d2dac6ca4a278d83a214

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.