This family contains a diverse range of mostly DNA-binding domains that contain a helix-turn-helix motif.
This clan contains 353 families and the total number of domains in the clan is 2638755. The clan was built by A Bateman.
- Brennan RG, Matthews BW; , J Biol Chem 1989;264:1903-1906.: The helix-turn-helix DNA binding motif. PUBMED:2644244 EPMC:2644244
- Brennan RG; , Cell 1993;74:773-776.: The winged-helix DNA-binding motif: another helix-turn-helix takeoff. PUBMED:8374950 EPMC:8374950
- Wintjens R, Rooman M; , J Mol Biol 1996;262:294-313.: Structural classification of HTH DNA-binding domains and protein-DNA interaction modes. PUBMED:8831795 EPMC:8831795
This clan contains the following 353 member families:AbiEi_3_N AbiEi_4 ANAPC2 AphA_like AraR_C Arg_repressor ARID ArsR B-block_TFIIIC B5 Bac_DnaA_C Baculo_PEP_N BetR BHD_3 BLACT_WH Bot1p BrkDBD BsuBI_PstI_RE_N C_LFY_FLO CaiF_GrlA CarD_CdnL_TRCF CDC27 Cdc6_C Cdh1_DBD_1 CDT1 CDT1_C CENP-B_N Costars CPSase_L_D3 Cro Crp CSN4_RPN5_eIF3a CSN8_PSD8_EIF3K CtsR Cullin_Nedd8 CUT CUTL CvfB_WH DBD_HTH DDRGK DEP Dimerisation Dimerisation2 DNA_meth_N DpnI_C DprA_WH DsrC DsrD DUF1016_N DUF1133 DUF1153 DUF1323 DUF134 DUF1441 DUF1492 DUF1495 DUF1670 DUF1804 DUF1819 DUF1836 DUF1870 DUF2089 DUF2250 DUF2316 DUF2513 DUF2582 DUF3116 DUF3253 DUF3853 DUF3860 DUF3908 DUF433 DUF4364 DUF4423 DUF4447 DUF480 DUF4817 DUF5635 DUF573 DUF6088 DUF6262 DUF6362 DUF6432 DUF6462 DUF6471 DUF722 DUF739 DUF742 DUF977 E2F_TDP EAP30 eIF-5_eIF-2B ELL ESCRT-II Ets EutK_C Exc F-112 FaeA Fe_dep_repr_C Fe_dep_repress FeoC FokI_C FokI_N Forkhead FtsK_gamma FUR GcrA GerE GntR GP3_package HARE-HTH HemN_C HNF-1_N Homeobox_KN Homeodomain Homez HPD HrcA_DNA-bdg HSF_DNA-bind HTH_1 HTH_10 HTH_11 HTH_12 HTH_13 HTH_15 HTH_16 HTH_17 HTH_18 HTH_19 HTH_20 HTH_21 HTH_22 HTH_23 HTH_24 HTH_25 HTH_26 HTH_27 HTH_28 HTH_29 HTH_3 HTH_30 HTH_31 HTH_32 HTH_33 HTH_34 HTH_35 HTH_36 HTH_37 HTH_38 HTH_39 HTH_40 HTH_41 HTH_42 HTH_43 HTH_45 HTH_46 HTH_47 HTH_48 HTH_49 HTH_5 HTH_50 HTH_51 HTH_52 HTH_53 HTH_54 HTH_55 HTH_56 HTH_57 HTH_58 HTH_59 HTH_6 HTH_7 HTH_8 HTH_9 HTH_ABP1_N HTH_AraC HTH_AsnC-type HTH_CodY HTH_Crp_2 HTH_DeoR HTH_IclR HTH_Mga HTH_micro HTH_OrfB_IS605 HTH_PafC HTH_ParB HTH_psq HTH_SUN2 HTH_Tnp_1 HTH_Tnp_1_2 HTH_Tnp_4 HTH_Tnp_IS1 HTH_Tnp_IS630 HTH_Tnp_ISL3 HTH_Tnp_Mu_1 HTH_Tnp_Mu_2 HTH_Tnp_Tc3_1 HTH_Tnp_Tc3_2 HTH_Tnp_Tc5 HTH_WhiA HxlR IBD IF2_N IRF KicB KilA-N Kin17_mid KORA KorB La LacI LexA_DNA_bind Linker_histone LZ_Tnp_IS481 MADF_DNA_bdg MAGE MARF1_LOTUS MarR MarR_2 MerR MerR-DNA-bind MerR_1 MerR_2 Mga Mnd1 MogR_DNAbind Mor MotA_activ MqsA_antitoxin MRP-L20 MukE Myb_DNA-bind_2 Myb_DNA-bind_3 Myb_DNA-bind_4 Myb_DNA-bind_5 Myb_DNA-bind_6 Myb_DNA-bind_7 Myb_DNA-binding Neugrin NFRKB_winged NOD2_WH NUMOD1 ORC_WH_C OST-HTH P22_Cro PaaX PadR PapB PAX PCI Penicillinase_R Phage_AlpA Phage_antitermQ Phage_CI_repr Phage_CII Phage_NinH Phage_Nu1 Phage_rep_O Phage_rep_org_N Phage_terminase PheRS_DBD1 PheRS_DBD2 PheRS_DBD3 Pou Pox_D5 PqqD PRC2_HTH_1 PUFD PuR_N Put_DNA-bind_N pXO2-72 Raf1_HTH Rap1-DNA-bind Rep_3 RepA_C RepA_N RepC RepL Replic_Relax RFX_DNA_binding Ribosomal_S18 Ribosomal_S19e Ribosomal_S25 Rio2_N RNA_pol_Rpc34 RNA_pol_Rpc82 RNase_H2-Ydr279 ROQ_II RP-C RPA RPA_C RQC Rrf2 RTP RuvB_C S10_plectin SAC3_GANP SANT_DAMP1_like SatD SelB-wing_1 SelB-wing_2 SelB-wing_3 SgrR_N Sigma54_CBD Sigma54_DBD Sigma70_ECF Sigma70_ner Sigma70_r2 Sigma70_r3 Sigma70_r4 Sigma70_r4_2 SinI Ski_Sno SLIDE Slx4 SMC_Nse1 SMC_ScpB SoPB_HTH SpoIIID SRP19 SRP_SPB STN1_2 Sulfolobus_pRN Suv3_N Swi6_N SWIRM Tau95 TBPIP TEA Terminase_5 TetR_N TFA2_Winged_2 TFIIE_alpha TFIIE_beta TFIIF_alpha TFIIF_beta Tn7_Tnp_TnsA_C Tn916-Xis TraI_2_C Trans_reg_C TrfA TrmB tRNA_bind_2 tRNA_bind_3 Trp_repressor UPF0122 UPF0175 Vir_act_alpha_C YdaS_antitoxin YjcQ YokU z-alpha
External database links
|SCOP:||46785 46894 88659 46689 48295|
Below is a listing of the unique domain organisations or architectures from this clan. More...
The graphic that is shown by default represents the longest sequence with a given architecture. Each row contains the following information:
- the number of sequences which exhibit this architecture
a textual description of the architecture, e.g. Gla, EGF x 2, Trypsin.
This example describes an architecture with one
Gladomain, followed by two consecutive
EGFdomains, and finally a single
- a link to the page in the Pfam site showing information about the sequence that the graphic describes
- the UniProt description of the protein sequence
- the number of residues in the sequence
- the Pfam graphic itself.
Note that you can see the family page for a particular domain by
clicking on the graphic. You can also choose to see all sequences which
have a given architecture by clicking on the
in each row.
Finally, because some families can be found in a very large number of architectures, we load only the first fifty architectures by default. If you want to see more architectures, click the button at the bottom of the page to load the next set.
Loading domain graphics...
The table below shows the number of occurrences of each domain throughout the sequence database. More...
In brackets beside each number is the percentage of the total number of sequence hits for the clan that are represented by this domain. The rightmost column provides a link to the alignments tab for each domain. Finally, the last row in the table provides a link to the HTML representation of the alignment for the seed alignments for all members of this clan.
Please note: the clan alignment can be extremely large and the resulting HTML file is often too large to be rendered by web-browsers. Please consider downloading the alignment (by right-clicking the link) rather than viewing it in your browser.
Please note: Clan alignments can be very large and can cause problems for some browsers. Read the note above before viewing.
This diagram shows the relationships between members of this clan. More...
Relationships between families in a clan are determined using HHsearch. Families are deemed to be closely related if their E-value is less than 10-3 and these relationships are shown with a solid line. Less closely related family pairs, with an E-value of between 10-3 and 10-1, are shown with a dashed line.
The E-value for each pair of closely or partially related families is shown next to the line linking the families. You can see the information regarding a Pfam family by clicking on the family box.
This tree shows the occurrence of the domains in this clan across different species. More...
For all of the domain matches in a full alignment we count the number of domains that are found on all sequences in the alignment. This total is shown in the purple box.
We also count the number of unique sequences on which each domain is found, which is shown in green. Note that a domain may appear multiple times on the same sequence, leading to the difference between these two numbers.
Finally, we group sequences from the same organism according to the NCBI code that is assigned by UniProt, allowing us to count the number of distinct sequences on which the domain is found. This value is shown in the pink boxes.
We use the NCBI species tree to group organisms according to their taxonomy and this forms the structure of the displayed tree. Note that in some cases the trees are too large (have too many nodes) to allow us to build an interactive tree, but in most cases you can still view the tree in a plain text, non-interactive representation.
You can use the tree controls to manipulate how the interactive tree is displayed:
- show/hide the summary boxes
- expand/collapse the tree or expand it to a given depth
- select a sub-tree or a set of species within the tree and view them graphically or as an alignment
- save a plain text representation of the tree
For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the MSD group, to allow us to map Pfam domains onto UniProt three-dimensional structures. The table below shows the mapping between the Pfam families in this clan, the corresponding UniProt entries, and the region of the three-dimensional structures that are available for that sequence.
Loading structure mapping...