Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
22  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: POLG_DEN3S (Q6YMS4)

Summary

This is the summary of UniProt entry POLG_DEN3S (Q6YMS4).

Description: Genome polyprotein EC=3.4.21.91
Source organism: Dengue virus type 3 (strain Sri Lanka/1266/2000) (DENV-3) (NCBI taxonomy ID )
Length: 3390 amino acids
Reference Proteome: x

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
Pfam Flavi_capsid 5 116
Pfam Flavi_propep 120 204
Pfam Flavi_M 207 280
Pfam Flavi_glycoprot 282 574
Pfam Flavi_glycop_C 576 671
Pfam Flavi_NS1 775 1128
Pfam Flavi_NS2A 1136 1342
Pfam Flavi_NS2B 1347 1473
Pfam Peptidase_S7 1489 1641
Pfam Flavi_DEAD 1657 1804
Pfam Flavi_NS4A 2097 2240
Pfam Flavi_NS4B 2243 2483
Pfam FtsJ 2544 2714
Pfam Flavi_NS5 2740 3383

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q6YMS4. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MNNQRKKTGK PSINMLKRVR NRVSTGSQLA KRFSKGLLNG QGPMKLVMAF
50
51
IAFLRFLAIP PTAGVLARWG TFKKSGAIKV LKGFKKEISN MLSIINQRKK
100
101
TSLCLMMILP AALAFHLTSR DGEPRMIVGK NERGKSLLFK TASGINMCTL
150
151
IAMDLGEMCD DTVTYKCPHI TEVEPEDIDC WCNLTSTWVT YGTCNQAGEH
200
201
RRDKRSVALA PHVGMGLDTR TQTWMSAEGA WRQVEKVETW ALRHPGFTIL
250
251
ALFLAHYIGT SLTQKVVIFI LLMLVTPSMT MRCVGVGNRD FVEGLSGATW
300
301
VDVVLEHGGC VTTMAKNKPT LDIELQKTEA TQLATLRKLC IEGKITNITT
350
351
DSRCPTQGEA VLPEEQDQNY VCKHTYVDRG WGNGCGLFGK GSLVTCAKFQ
400
401
CLEPIEGKVV QYENLKYTVI ITVHTGDQHQ VGNETQGVTA EITPQASTTE
450
451
AILPEYGTLG LECSPRTGLD FNEMILLTMK NKAWMVHRQW FFDLPLPWAS
500
501
GATTETPTWN RKELLVTFKN AHAKKQEVVV LGSQEGAMHT ALTGATEIQN
550
551
SGGTSIFAGH LKCRLKMDKL ELKGMSYAMC TNTFVLKKEV SETQHGTILI
600
601
KVEYKGEDAP CKIPFSTEDG QGKAHNGRLI TANPVVTKKE EPVNIEAEPP
650
651
FGESNIVIGI GDNALKINWY KKGSSIGKMF EATERGARRM AILGDTAWDF
700
701
GSVGGVLNSL GKMVHQIFGS AYTALFSGVS WVMKIGIGVL LTWIGLNSKN
750
751
TSMSFSCIAI GIITLYLGAV VQADMGCVIN WKGKELKCGS GIFVTNEVHT
800
801
WTEQYKFQAD SPKRLATAIA GAWENGVCGI RSTTRMENLL WKQIANELNY
850
851
ILWENNIKLT VVVGDTLGVL EQGKRTLTPQ PMELKYSWKT WGKAKIVTAE
900
901
TQNSSFIIDG PNTPECPSAS RAWNVWEVED YGFGVFTTNI WLKLREVYTQ
950
951
LCDHRLMSAA VKDERAVHAD MGYWIESQKN GSWKLEKASL IEVKTCTWPK
1000
1001
SHTLWTNGVL ESDMIIPKSL AGPISQHNYR PGYHTQTAGP WHLGKLELDF
1050
1051
NYCEGTTVVI TESCGTRGPS LRTTTVSGKL IHEWCCRSCT LPPLRYMGED
1100
1101
GCWYGMEIRP ISEKEENMVK SLVSAGSGKV DNFTMGVLCL AILFEEVLRG
1150
1151
KFGKKHMIAG VFFTFVLLLS GQITWRDMAH TLIMIGSNAS DRMGMGVTYL
1200
1201
ALIATFKIQP FLALGFFLRK LTSRENLLLG VGLAMATTLQ LPEDIEQMAN
1250
1251
GVALGLMALK LITQFETYQL WTALVSLTCS NTIFTLTVAW RTATLILAGV
1300
1301
SLLPVCQSSS MRKTDWLPMT VAAMGVPPLP LFIFSLKDTL KRRSWPLNEG
1350
1351
VMAVGLVSIL ASSLLRNDVP MAGPLVAGGL LIACYVITGT SADLTVEKAP
1400
1401
DVTWEEEAEQ TGVSHNLMIT VDDDGTMRIK DDETENILTV LLKTALLIVS
1450
1451
GIFPYSIPAT LLVWHTWQKQ TQRSGVLWDV PSPPETQKAE LEEGVYRIKQ
1500
1501
QGIFGKTQVG VGVQKEGVFH TMWHVTRGAV LTHNGKRLEP NWASVKKDLI
1550
1551
SYGGGWRLSA QWQKGEEVQV IAVEPGKNPK NFQTTPGTFQ TTTGEIGAIA
1600
1601
LDFKPGTSGS PIINREGKVV GLYGNGVVTK NGGYVSGIAQ TNAEPDGPTP
1650
1651
ELEEEMFKKR NLTIMDLHPG SGKTRKYLPA IVREAIKRRL RTLILAPTRV
1700
1701
VAAEMEEALK GLPIRYQTTA TKSEHTGREI VDLMCHATFT MRLLSPVRVP
1750
1751
NYNLIIMDEA HFTDPASIAA RGYISTRVGM GEAAAIFMTA TPPGTADAFP
1800
1801
QSNAPIQDEE RDIPERSWNS GNEWITDFAG KTVWFVPSIK AGNDIANCLR
1850
1851
KNGKKVIQLS RKTFDTEYQK TKLNDWDFVV TTDISEMGAN FKADRVIDPR
1900
1901
RCLKPVILTD GPERVILAGP MPVTAASAAQ RRGRVGRNPQ KENDQYIFTG
1950
1951
QPLNNDEDHA HWTEAKMLLD NINTPEGIIP ALFEPEREKS AAIDGEYRLK
2000
2001
GESRKTFVEL MRRGDLPVWL AHKVASEGIK YTDRKWCFDG QRNNQILEEN
2050
2051
MDVEIWTKEG EKKKLRPRWL DARTYSDPLA LKEFKDFAAG RKSIALDLVT
2100
2101
EIGRVPSHLA HRTRNALDNL VMLHTSEDGG RAYRHAVEEL PETMETLLLL
2150
2151
GLMILLTGGA MLFLISGKGI GKTSIGLICV IASSGMLWMA EVPLQWIASA
2200
2201
IVLEFFMMVL LIPEPEKQRT PQDNQLAYVV IGILTLAATI AANEMGLLET
2250
2251
TKRDLGMSKE PGVVSPTSYL DVDLHPASAW TLYAVATTVI TPMLRHTIEN
2300
2301
STANVSLAAI ANQAVVLMGL DKGWPISKMD LGVPLLALGC YSQVNPLTLT
2350
2351
AAVLLLITHY AIIGPGLQAK ATREAQKRTA AGIMKNPTVD GIMTIDLDSV
2400
2401
IFDSKFEKQL GQVMLLVLCA VQLLLMRTSW ALCEALTLAT GPITTLWEGS
2450
2451
PGKFWNTTIA VSMANIFRGS YLAGAGLAFS IMKSVGTGKR GTGSQGETLG
2500
2501
EKWKKKLNQL SRKEFDLYKK SGITEVDRTE AKEGLKRGET THHAVSRGSA
2550
2551
KLQWFVERNM VVPEGRVIDL GCGRGGWSYY CAGLKKVTEV RGYTKGGPGH
2600
2601
EEPVPMSTYG WNIVKLMSGK DVFYLPPEKC DTLLCDIGES SPSPTVEESR
2650
2651
TIRVLKMVEP WLKNNQFCIK VLNPYMPTVI EHLERLQRKH GGMLVRNPLS
2700
2701
RNSTHEMYWI SNGTGNIVSS VNMVSRLLLN RFTMTHRRPT IEKDVDLGAG
2750
2751
TRHVNAEPET PNMDVIGERI KRIKEEHNST WHYDDENPYK TWAYHGSYEV
2800
2801
KATGSASSMI NGVVKLLTKP WDVVPMVTQM AMTDTTPFGQ QRVFKEKVDT
2850
2851
RTPRPMPGTR KAMEITAEWL WRTLGRNKRP RLCTREEFTK KVRTNAAMGA
2900
2901
VFTEENQWDS AKAAVEDEEF WKLVDREREL HKLGKCGSCV YNMMGKREKK
2950
2951
LGEFGKAKGS RAIWYMWLGA RYLEFEALGF LNEDHWFSRE NSYSGVEGEG
3000
3001
LHKLGYILRD ISKIPGGAMY ADDTAGWDTR ITEDDLHNEE KIIQQMDPEH
3050
3051
RQLANAIFKL TYQNKVVKVQ RPTPTGTVMD IISRKDQRGS GQLGTYGLNT
3100
3101
FTNMEAQLVR QMEGEGVLTK ADLENPHLLE KKITQWLETK GVERLKRMAI
3150
3151
SGDDCVVKPI DDRFANALLA LNDMGKVRKD IPQWQPSKGW HDWQQVPFCS
3200
3201
HHFHELIMKD GRKLVVPCRP QDELIGRARI SQGAGWSLRE TACLGKAYAQ
3250
3251
MWSLMYFHRR DLRLASNAIC SAVPVHWVPT SRTTWSIHAH HQWMTTEDML
3300
3301
TVWNRVWIEE NPWMEDKTPV TTWENVPYLG KREDQWCGSL IGLTSRATWA
3350
3351
QNIPTAIQQV RSLIGNEEFL DYMPSMKRFR KEEESEGAIW           
3390
 

Show the unformatted sequence.

Checksums:
CRC64:14C0E2C0C9189CCB
MD5:3138f96a6ca1c685f8d3899a8a5e9a80

Structures

For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe SIFTS project, to allow us to map Pfam domains onto UniProt three-dimensional structures. The table below shows the mapping between Pfam domains, this UniProt entry and a corresponding three dimensional structure.

Pfam family UniProt residues PDB ID PDB chain ID PDB residues View
Flavi_NS5 2740 - 2752 5EIW A 250 - 262 Jmol OpenAstexViewer
C 250 - 262 Jmol OpenAstexViewer
2740 - 3373 5DTO A 250 - 883 Jmol OpenAstexViewer
5JJR A 250 - 883 Jmol OpenAstexViewer
2740 - 3374 5JJS A 250 - 884 Jmol OpenAstexViewer
2758 - 3378 4C11 A 268 - 888 Jmol OpenAstexViewer
2758 - 3379 4C11 B 268 - 889 Jmol OpenAstexViewer
2762 - 3373 3VWS A 272 - 883 Jmol OpenAstexViewer
4HHJ A 272 - 883 Jmol OpenAstexViewer
5F3T A 272 - 883 Jmol OpenAstexViewer
5F3Z A 272 - 883 Jmol OpenAstexViewer
5F41 A 272 - 883 Jmol OpenAstexViewer
5HMZ A 272 - 883 Jmol OpenAstexViewer
5HN0 A 272 - 883 Jmol OpenAstexViewer
5I3P A 272 - 883 Jmol OpenAstexViewer
5I3Q A 272 - 883 Jmol OpenAstexViewer
2762 - 3378 5HMX A 272 - 888 Jmol OpenAstexViewer
2762 - 3379 5HMW A 272 - 889 Jmol OpenAstexViewer
5HMY A 272 - 889 Jmol OpenAstexViewer
2763 - 3373 2J7U A 273 - 883 Jmol OpenAstexViewer
2J7W A 273 - 883 Jmol OpenAstexViewer
5IQ6 A 273 - 883 Jmol OpenAstexViewer
FtsJ 2544 - 2714 5DTO A 54 - 224 Jmol OpenAstexViewer
5EIW A 54 - 224 Jmol OpenAstexViewer
C 54 - 224 Jmol OpenAstexViewer
5JJR A 54 - 224 Jmol OpenAstexViewer
5JJS A 54 - 224 Jmol OpenAstexViewer
Peptidase_S7 1606 - 1615 5WJN C 1 - 10 Jmol OpenAstexViewer
F 1 - 10 Jmol OpenAstexViewer
I 1 - 10 Jmol OpenAstexViewer
5WKH C 1 - 10 Jmol OpenAstexViewer
H 1 - 10 Jmol OpenAstexViewer