Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
1  structure 1  species 0  interactions 1  sequence 1  architecture

Protein: KMT2D_HUMAN (O14686)

Summary

This is the summary of UniProt entry KMT2D_HUMAN (O14686).

Description: Histone-lysine N-methyltransferase 2D EC=2.1.1.43
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 5537 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 57
low_complexity n/a 94 104
disorder n/a 96 98
disorder n/a 100 117
disorder n/a 121 124
low_complexity n/a 133 147
Pfam zf-HC5HC2H 139 218
Pfam PHD 275 323
disorder n/a 364 1329
low_complexity n/a 373 385
low_complexity n/a 429 464
low_complexity n/a 458 482
low_complexity n/a 487 569
low_complexity n/a 563 597
low_complexity n/a 598 630
low_complexity n/a 626 648
low_complexity n/a 644 684
low_complexity n/a 680 692
low_complexity n/a 689 713
low_complexity n/a 707 735
low_complexity n/a 841 870
low_complexity n/a 874 926
low_complexity n/a 922 953
low_complexity n/a 992 1007
low_complexity n/a 1001 1013
low_complexity n/a 1033 1044
low_complexity n/a 1042 1056
low_complexity n/a 1103 1110
low_complexity n/a 1280 1291
low_complexity n/a 1302 1327
disorder n/a 1341 1361
low_complexity n/a 1350 1357
Pfam PHD 1379 1429
Pfam PHD 1429 1477
low_complexity n/a 1557 1573
disorder n/a 1570 1573
disorder n/a 1602 1606
disorder n/a 1610 1765
low_complexity n/a 1610 1619
low_complexity n/a 1632 1655
low_complexity n/a 1677 1689
low_complexity n/a 1750 1762
disorder n/a 1795 1950
disorder n/a 1955 2010
low_complexity n/a 1973 1992
disorder n/a 2056 2798
coiled_coil n/a 2061 2081
low_complexity n/a 2106 2121
low_complexity n/a 2189 2209
low_complexity n/a 2348 2365
low_complexity n/a 2376 2400
low_complexity n/a 2408 2430
low_complexity n/a 2444 2461
low_complexity n/a 2488 2505
low_complexity n/a 2544 2558
low_complexity n/a 2588 2603
low_complexity n/a 2606 2623
coiled_coil n/a 2672 2703
low_complexity n/a 2675 2696
low_complexity n/a 2706 2721
low_complexity n/a 2766 2777
disorder n/a 2800 3074
low_complexity n/a 2809 2821
low_complexity n/a 2861 2873
low_complexity n/a 2886 2905
low_complexity n/a 2929 2939
coiled_coil n/a 3063 3083
disorder n/a 3076 3241
low_complexity n/a 3084 3101
low_complexity n/a 3197 3209
disorder n/a 3244 3248
coiled_coil n/a 3249 3269
disorder n/a 3251 3404
low_complexity n/a 3251 3288
low_complexity n/a 3300 3319
low_complexity n/a 3366 3378
disorder n/a 3407 3409
disorder n/a 3417 3418
disorder n/a 3455 3464
disorder n/a 3466 3503
disorder n/a 3538 3624
coiled_coil n/a 3564 3612
low_complexity n/a 3598 3611
disorder n/a 3626 3835
low_complexity n/a 3630 3656
low_complexity n/a 3675 3693
low_complexity n/a 3686 3701
low_complexity n/a 3714 3758
coiled_coil n/a 3716 3743
low_complexity n/a 3782 3805
disorder n/a 3838 3897
low_complexity n/a 3855 3866
low_complexity n/a 3871 3885
low_complexity n/a 3896 3973
coiled_coil n/a 3897 3972
disorder n/a 3971 3972
disorder n/a 3976 3985
low_complexity n/a 3987 3993
disorder n/a 3988 4194
low_complexity n/a 4011 4022
low_complexity n/a 4072 4107
low_complexity n/a 4111 4124
low_complexity n/a 4119 4128
disorder n/a 4196 4455
low_complexity n/a 4208 4235
low_complexity n/a 4251 4274
low_complexity n/a 4276 4295
low_complexity n/a 4299 4320
low_complexity n/a 4319 4330
disorder n/a 4473 4475
disorder n/a 4477 4490
disorder n/a 4494 4541
disorder n/a 4543 4550
low_complexity n/a 4547 4563
disorder n/a 4587 4600
disorder n/a 4602 4731
low_complexity n/a 4609 4632
low_complexity n/a 4693 4703
disorder n/a 4733 4734
disorder n/a 4736 4737
disorder n/a 4762 4764
disorder n/a 4769 4774
disorder n/a 4818 4861
low_complexity n/a 4825 4847
disorder n/a 4874 4980
low_complexity n/a 4905 4932
low_complexity n/a 4926 4957
low_complexity n/a 4952 4969
disorder n/a 4998 5020
Pfam zf-HC5HC2H_2 5030 5144
low_complexity n/a 5048 5055
Pfam FYRN 5181 5232
Pfam FYRC 5238 5322
disorder n/a 5350 5351
disorder n/a 5360 5364
disorder n/a 5366 5368
disorder n/a 5375 5376
Pfam SET 5408 5513
low_complexity n/a 5493 5501

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession O14686. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MDSQKLAGED KDSEPAADGP AASEDPSATE SDLPNPHVGE VSVLSSGSPR
50
51
LQETPQDCSG GPVRRCALCN CGEPSLHGQR ELRRFELPFD WPRCPVVSPG
100
101
GSPGPNEAVL PSEDLSQIGF PEGLTPAHLG EPGGSCWAHH WCAAWSAGVW
150
151
GQEGPELCGV DKAIFSGISQ RCSHCTRLGA SIPCRSPGCP RLYHFPCATA
200
201
SGSFLSMKTL QLLCPEHSEG AAYLEEARCA VCEGPGELCD LFFCTSCGHH
250
251
YHGACLDTAL TARKRAGWQC PECKVCQACR KPGNDSKMLV CETCDKGYHT
300
301
FCLKPPMEEL PAHSWKCKAC RVCRACGAGS AELNPNSEWF ENYSLCHRCH
350
351
KAQGGQTIRS VAEQHTPVCS RFSPPEPGDT PTDEPDALYV ACQGQPKGGH
400
401
VTSMQPKEPG PLQCEAKPLG KAGVQLEPQL EAPLNEEMPL LPPPEESPLS
450
451
PPPEESPTSP PPEASRLSPP PEELPASPLP EALHLSRPLE ESPLSPPPEE
500
501
SPLSPPPESS PFSPLEESPL SPPEESPPSP ALETPLSPPP EASPLSPPFE
550
551
ESPLSPPPEE LPTSPPPEAS RLSPPPEESP MSPPPEESPM SPPPEASRLF
600
601
PPFEESPLSP PPEESPLSPP PEASRLSPPP EDSPMSPPPE ESPMSPPPEV
650
651
SRLSPLPVVS RLSPPPEESP LSPPPEESPT SPPPEASRLS PPPEDSPTSP
700
701
PPEDSPASPP PEDSLMSLPL EESPLLPLPE EPQLCPRSEG PHLSPRPEEP
750
751
HLSPRPEEPH LSPQAEEPHL SPQPEEPCLC AVPEEPHLSP QAEGPHLSPQ
800
801
PEELHLSPQT EEPHLSPVPE EPCLSPQPEE SHLSPQSEEP CLSPRPEESH
850
851
LSPELEKPPL SPRPEKPPEE PGQCPAPEEL PLFPPPGEPS LSPLLGEPAL
900
901
SEPGEPPLSP LPEELPLSPS GEPSLSPQLM PPDPLPPPLS PIITAAAPPA
950
951
LSPLGELEYP FGAKGDSDPE SPLAAPILET PISPPPEANC TDPEPVPPMI
1000
1001
LPPSPGSPVG PASPILMEPL PPQCSPLLQH SLVPQNSPPS QCSPPALPLS
1050
1051
VPSPLSPIGK VVGVSDEAEL HEMETEKVSE PECPALEPSA TSPLPSPMGD
1100
1101
LSCPAPSPAP ALDDFSGLGE DTAPLDGIDA PGSQPEPGQT PGSLASELKG
1150
1151
SPVLLDPEEL APVTPMEVYP ECKQTAGQGS PCEEQEEPRA PVAPTPPTLI
1200
1201
KSDIVNEISN LSQGDASASF PGSEPLLGSP DPEGGGSLSM ELGVSTDVSP
1250
1251
ARDEGSLRLC TDSLPETDDS LLCDAGTAIS GGKAEGEKGR RRSSPARSRI
1300
1301
KQGRSSSFPG RRRPRGGAHG GRGRGRARLK STASSIETLV VADIDSSPSK
1350
1351
EEEEEDDDTM QNTVVLFSNT DKFVLMQDMC VVCGSFGRGA EGHLLACSQC
1400
1401
SQCYHPYCVN SKITKVMLLK GWRCVECIVC EVCGQASDPS RLLLCDDCDI
1450
1451
SYHTYCLDPP LLTVPKGGWK CKWCVSCMQC GAASPGFHCE WQNSYTHCGP
1500
1501
CASLVTCPIC HAPYVEEDLL IQCRHCERWM HAGCESLFTE DDVEQAADEG
1550
1551
FDCVSCQPYV VKPVAPVAPP ELVPMKVKEP EPQYFRFEGV WLTETGMALL
1600
1601
RNLTMSPLHK RRQRRGRLGL PGEAGLEGSE PSDALGPDDK KDGDLDTDEL
1650
1651
LKGEGGVEHM ECEIKLEGPV SPDVEPGKEE TEESKKRKRK PYRPGIGGFM
1700
1701
VRQRKSHTRT KKGPAAQAEV LSGDGQPDEV IPADLPAEGA VEQSLAEGDE
1750
1751
KKKQQRRGRK KSKLEDMFPA YLQEAFFGKE LLDLSRKALF AVGVGRPSFG
1800
1801
LGTPKAKGDG GSERKELPTS QKGDDGPDIA DEESRGLEGK ADTPGPEDGG
1850
1851
VKASPVPSDP EKPGTPGEGM LSSDLDRIST EELPKMESKD LQQLFKDVLG
1900
1901
SEREQHLGCG TPGLEGSRTP LQRPFLQGGL PLGNLPSSSP MDSYPGLCQS
1950
1951
PFLDSRERGG FFSPEPGEPD SPWTGSGGTT PSTPTTPTTE GEGDGLSYNQ
2000
2001
RSLQRWEKDE ELGQLSTISP VLYANINFPN LKQDYPDWSS RCKQIMKLWR
2050
2051
KVPAADKAPY LQKAKDNRAA HRINKVQKQA ESQINKQTKV GDIARKTDRP
2100
2101
ALHLRIPPQP GALGSPPPAA APTIFIGSPT TPAGLSTSAD GFLKPPAGSV
2150
2151
PGPDSPGELF LKLPPQVPAQ VPSQDPFGLA PAYPLEPRFP TAPPTYPPYP
2200
2201
SPTGAPAQPP MLGASSRPGA GQPGEFHTTP PGTPRHQPST PDPFLKPRCP
2250
2251
SLDNLAVPES PGVGGGKASE PLLSPPPFGE SRKALEVKKE ELGASSPSYG
2300
2301
PPNLGFVDSP SSGTHLGGLE LKTPDVFKAP LTPRASQVEP QSPGLGLRPQ
2350
2351
EPPPAQALAP SPPSHPDIFR PGSYTDPYAQ PPLTPRPQPP PPESCCALPP
2400
2401
RSLPSDPFSR VPASPQSQSS SQSPLTPRPL SAEAFCPSPV TPRFQSPDPY
2450
2451
SRPPSRPQSR DPFAPLHKPP RPQPPEVAFK AGSLAHTSLG AGGFPAALPA
2500
2501
GPAGELHAKV PSGQPPNFVR SPGTGAFVGT PSPMRFTFPQ AVGEPSLKPP
2550
2551
VPQPGLPPPH GINSHFGPGP TLGKPQSTNY TVATGNFHPS GSPLGPSSGS
2600
2601
TGESYGLSPL RPPSVLPPPA PDGSLPYLSH GASQRSGITS PVEKREDPGT
2650
2651
GMGSSLATAE LPGTQDPGMS GLSQTELEKQ RQRQRLRELL IRQQIQRNTL
2700
2701
RQEKETAAAA AGAVGPPGSW GAEPSSPAFE QLSRGQTPFA GTQDKSSLVG
2750
2751
LPPSKLSGPI LGPGSFPSDD RLSRPPPPAT PSSMDVNSRQ LVGGSQAFYQ
2800
2801
RAPYPGSLPL QQQQQQLWQQ QQATAATSMR FAMSARFPST PGPELGRQAL
2850
2851
GSPLAGISTR LPGPGEPVPG PAGPAQFIEL RHNVQKGLGP GGTPFPGQGP
2900
2901
PQRPRFYPVS EDPHRLAPEG LRGLAVSGLP PQKPSAPPAP ELNNSLHPTP
2950
2951
HTKGPTLPTG LELVNRPPSS TELGRPNPLA LEAGKLPCED PELDDDFDAH
3000
3001
KALEDDEELA HLGLGVDVAK GDDELGTLEN LETNDPHLDD LLNGDEFDLL
3050
3051
AYTDPELDTG DKKDIFNEHL RLVESANEKA EREALLRGVE PGPLGPEERP
3100
3101
PPAADASEPR LASVLPEVKP KVEEGGRHPS PCQFTIATPK VEPAPAANSL
3150
3151
GLGLKPGQSM MGSRDTRMGT GPFSSSGHTA EKASFGATGG PPAHLLTPSP
3200
3201
LSGPGGSSLL EKFELESGAL TLPGGPAASG DELDKMESSL VASELPLLIE
3250
3251
DLLEHEKKEL QKKQQLSAQL QPAQQQQQQQ QQHSLLSAPG PAQAMSLPHE
3300
3301
GSSPSLAGSQ QQLSLGLAGA RQPGLPQPLM PTQPPAHALQ QRLAPSMAMV
3350
3351
SNQGHMLSGQ HGGQAGLVPQ QSSQPVLSQK PMGTMPPSMC MKPQQLAMQQ
3400
3401
QLANSFFPDT DLDKFAAEDI IDPIAKAKMV ALKGIKKVMA QGSIGVAPGM
3450
3451
NRQQVSLLAQ RLSGGPSSDL QNHVAAGSGQ ERSAGDPSQP RPNPPTFAQG
3500
3501
VINEADQRQY EEWLFHTQQL LQMQLKVLEE QIGVHRKSRK ALCAKQRTAK
3550
3551
KAGREFPEAD AEKLKLVTEQ QSKIQKQLDQ VRKQQKEHTN LMAEYRNKQQ
3600
3601
QQQQQQQQQQ QQHSAVLALS PSQSPRLLTK LPGQLLPGHG LQPPQGPPGG
3650
3651
QAGGLRLTPG GMALPGQPGG PFLNTALAQQ QQQQHSGGAG SLAGPSGGFF
3700
3701
PGNLALRSLG PDSRLLQERQ LQLQQQRMQL AQKLQQQQQQ QQQQQHLLGQ
3750
3751
VAIQQQQQQG PGVQTNQALG PKPQGLMPPS SHQGLLVQQL SPQPPQGPQG
3800
3801
MLGPAQVAVL QQQHPGALGP QGPHRQVLMT QSRVLSSPQL AQQGQGLMGH
3850
3851
RLVTAQQQQQ QQQHQQQGSM AGLSHLQQSL MSHSGQPKLS AQPMGSLQQL
3900
3901
QQQQQLQQQQ QLQQQQQQQL QQQQQLQQQQ LQQQQQQQQL QQQQQQQLQQ
3950
3951
QQQQLQQQQQ QQQQQFQQQQ QQQQMGLLNQ SRTLLSPQQQ QQQQVALGPG
4000
4001
MPAKPLQHFS SPGALGPTLL LTGKEQNTVD PAVSSEATEG PSTHQGGPLA
4050
4051
IGTTPESMAT EPGEVKPSLS GDSQLLLVQP QPQPQPSSLQ LQPPLRLPGQ
4100
4101
QQQQVSLLHT AGGGSHGQLG SGSSSEASSV PHLLAQPSVS LGDQPGSMTQ
4150
4151
NLLGPQQPML ERPMQNNTGP QPPKPGPVLQ SGQGLPGVGI MPTVGQLRAQ
4200
4201
LQGVLAKNPQ LRHLSPQQQQ QLQALLMQRQ LQQSQAVRQT PPYQEPGTQT
4250
4251
SPLQGLLGCQ PQLGGFPGPQ TGPLQELGAG PRPQGPPRLP APPGALSTGP
4300
4301
VLGPVHPTPP PSSPQEPKRP SQLPSPSSQL PTEAQLPPTH PGTPKPQGPT
4350
4351
LEPPPGRVSP AAAQLADTLF SKGLGPWDPP DNLAETQKPE QSSLVPGHLD
4400
4401
QVNGQVVPEA SQLSIKQEPR EEPCALGAQS VKREANGEPI GAPGTSNHLL
4450
4451
LAGPRSEAGH LLLQKLLRAK NVQLSTGRGS EGLRAEINGH IDSKLAGLEQ
4500
4501
KLQGTPSNKE DAAARKPLTP KPKRVQKASD RLVSSRKKLR KEDGVRASEA
4550
4551
LLKQLKQELS LLPLTEPAIT ANFSLFAPFG SGCPVNGQSQ LRGAFGSGAL
4600
4601
PTGPDYYSQL LTKNNLSNPP TPPSSLPPTP PPSVQQKMVN GVTPSEELGE
4650
4651
HPKDAASARD SERALRDTSE VKSLDLLAAL PTPPHNQTED VRMESDEDSD
4700
4701
SPDSIVPASS PESILGEEAP RFPHLGSGRW EQEDRALSPV IPLIPRASIP
4750
4751
VFPDTKPYGA LGLEVPGKLP VTTWEKGKGS EVSVMLTVSA AAAKNLNGVM
4800
4801
VAVAELLSMK IPNSYEVLFP ESPARAGTEP KKGEAEGPGG KEKGLEGKSP
4850
4851
DTGPDWLKQF DAVLPGYTLK SQLDILSLLK QESPAPEPPT QHSYTYNVSN
4900
4901
LDVRQLSAPP PEEPSPPPSP LAPSPASPPT EPLVELPTEP LAEPPVPSPL
4950
4951
PLASSPESAR PKPRARPPEE GEDSRPPRLK KWKGVRWKRL RLLLTIQKGS
5000
5001
GRQEDEREVA EFMEQLGTAL RPDKVPRDMR RCCFCHEEGD GATDGPARLL
5050
5051
NLDLDLWVHL NCALWSTEVY ETQGGALMNV EVALHRGLLT KCSLCQRTGA
5100
5101
TSSCNRMRCP NVYHFACAIR AKCMFFKDKT MLCPMHKIKG PCEQELSSFA
5150
5151
VFRRVYIERD EVKQIASIIQ RGERLHMFRV GGLVFHAIGQ LLPHQMADFH
5200
5201
SATALYPVGY EATRIYWSLR TNNRRCCYRC SIGENNGRPE FVIKVIEQGL
5250
5251
EDLVFTDASP QAVWNRIIEP VAAMRKEADM LRLFPEYLKG EELFGLTVHA
5300
5301
VLRIAESLPG VESCQNYLFR YGRHPLMELP LMINPTGCAR SEPKILTHYK
5350
5351
RPHTLNSTSM SKAYQSTFTG ETNTPYSKQF VHSKSSQYRR LRTEWKNNVY
5400
5401
LARSRIQGLG LYAAKDLEKH TMVIEYIGTI IRNEVANRRE KIYEEQNRGI
5450
5451
YMFRINNEHV IDATLTGGPA RYINHSCAPN CVAEVVTFDK EDKIIIISSR
5500
5501
RIPKGEELTY DYQFDFEDDQ HKIPCHCGAW NCRKWMN              
5537
 

Show the unformatted sequence.

Checksums:
CRC64:31C6DAB0A754F72A
MD5:a354bfe1de21c999e8b503585ab8bee0

Structures

For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe SIFTS project, to allow us to map Pfam domains onto UniProt three-dimensional structures. The table below shows the mapping between Pfam domains, this UniProt entry and a corresponding three dimensional structure.

Pfam family UniProt residues PDB ID PDB chain ID PDB residues View
SET 5408 - 5513 4Z4P A 5408 - 5513 Jmol OpenAstexViewer

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.