This family contains a diverse range of mostly DNA-binding domains that contain a helix-turn-helix motif.
This clan contains 254 families and the total number of domains in the clan is 685611. The clan was built by A Bateman.
- Brennan RG, Matthews BW; , J Biol Chem 1989;264:1903-1906.: The helix-turn-helix DNA binding motif. PUBMED:2644244 EPMC:2644244
- Brennan RG; , Cell 1993;74:773-776.: The winged-helix DNA-binding motif: another helix-turn-helix takeoff. PUBMED:8374950 EPMC:8374950
- Wintjens R, Rooman M; , J Mol Biol 1996;262:294-313.: Structural classification of HTH DNA-binding domains and protein-DNA interaction modes. PUBMED:8831795 EPMC:8831795
This clan contains the following 254 member families:AbiEi_3_N AbiEi_4 ANAPC2 AphA_like Arg_repressor ARID B-block_TFIIIC Bac_DnaA_C BetR Bot1p BrkDBD Cdc6_C CENP-B_N Cro Crp CSN8_PSD8_EIF3K Cullin_Nedd8 CUT DDRGK DEP Dimerisation Dimerisation2 DsrD DUF1133 DUF1153 DUF1323 DUF134 DUF1441 DUF1492 DUF1495 DUF1670 DUF1804 DUF1836 DUF1870 DUF2089 DUF2250 DUF2316 DUF2582 DUF3116 DUF3253 DUF3853 DUF3860 DUF3908 DUF433 DUF4364 DUF4447 DUF480 DUF739 DUF742 DUF977 E2F_TDP EAP30 ELL ESCRT-II Ets Exc F-112 FaeA Fe_dep_repr_C Fe_dep_repress FeoC FokI_C FokI_N Forkhead Ftsk_gamma FUR GcrA GerE GntR HARE-HTH HemN_C HNF-1_N Homeobox Homeobox_KN Homez HPD HrcA_DNA-bdg HSF_DNA-bind HTH_1 HTH_10 HTH_11 HTH_12 HTH_13 HTH_15 HTH_16 HTH_17 HTH_18 HTH_19 HTH_20 HTH_21 HTH_22 HTH_23 HTH_24 HTH_25 HTH_26 HTH_27 HTH_28 HTH_29 HTH_3 HTH_30 HTH_31 HTH_32 HTH_33 HTH_34 HTH_35 HTH_36 HTH_37 HTH_38 HTH_39 HTH_40 HTH_41 HTH_42 HTH_43 HTH_45 HTH_46 HTH_47 HTH_5 HTH_6 HTH_7 HTH_8 HTH_9 HTH_AraC HTH_AsnC-type HTH_CodY HTH_Crp_2 HTH_DeoR HTH_IclR HTH_Mga HTH_micro HTH_OrfB_IS605 HTH_psq HTH_Tnp_1 HTH_Tnp_1_2 HTH_Tnp_4 HTH_Tnp_IS1 HTH_Tnp_IS630 HTH_Tnp_ISL3 HTH_Tnp_Mu_1 HTH_Tnp_Mu_2 HTH_Tnp_Tc3_1 HTH_Tnp_Tc3_2 HTH_Tnp_Tc5 HTH_WhiA HxlR IBD IF2_N IRF KicB KORA KorB La LacI LexA_DNA_bind Linker_histone LZ_Tnp_IS481 MADF_DNA_bdg MarR MarR_2 MerR MerR-DNA-bind MerR_1 MerR_2 Mga Mnd1 Mor MotA_activ MqsA_antitoxin MRP-L20 Myb_DNA-bind_2 Myb_DNA-bind_3 Myb_DNA-bind_4 Myb_DNA-bind_5 Myb_DNA-bind_6 Myb_DNA-bind_7 Myb_DNA-binding Neugrin NUMOD1 OST-HTH P22_Cro PaaX PadR PAX PCI Penicillinase_R Phage_AlpA Phage_antitermQ Phage_CI_repr Phage_CII Phage_rep_org_N Phage_terminase Pou Pox_D5 PuR_N Put_DNA-bind_N Rap1-DNA-bind Rep_3 RepA_C RepA_N RepC RepL Replic_Relax RFX_DNA_binding Ribosomal_S19e Ribosomal_S25 Rio2_N RNA_pol_Rpc34 RP-C RPA RPA_C RQC Rrf2 RTP RuvB_C SAC3_GANP SANT_DAMP1_like SatD SelB-wing_1 SelB-wing_2 SelB-wing_3 SgrR_N Sigma54_CBD Sigma54_DBD Sigma70_ECF Sigma70_ner Sigma70_r2 Sigma70_r3 Sigma70_r4 Sigma70_r4_2 SLIDE SMC_ScpB SpoIIID STN1_2 Sulfolobus_pRN SWIRM TBPIP Terminase_5 TetR_N TFIIE_alpha TFIIE_beta TFIIF_alpha TFIIF_beta Tn7_Tnp_TnsA_C Tn916-Xis TraI_2_C Trans_reg_C TrfA TrmB Trp_repressor UPF0122 Vir_act_alpha_C YdaS_antitoxin YjcQ YokU z-alpha
External database links
|SCOP:||46785 46894 88659 46689 48295|
Below is a listing of the unique domain organisations or architectures from this clan. More...
The graphic that is shown by default represents the longest sequence with a given architecture. Each row contains the following information:
- the number of sequences which exhibit this architecture
a textual description of the architecture, e.g. Gla, EGF x 2, Trypsin.
This example describes an architecture with one
Gladomain, followed by two consecutive
EGFdomains, and finally a single
- a link to the page in the Pfam site showing information about the sequence that the graphic describes
- the UniProt description of the protein sequence
- the number of residues in the sequence
- the Pfam graphic itself.
Note that you can see the family page for a particular domain by
clicking on the graphic. You can also choose to see all sequences which
have a given architecture by clicking on the
in each row.
Finally, because some families can be found in a very large number of architectures, we load only the first fifty architectures by default. If you want to see more architectures, click the button at the bottom of the page to load the next set.
Loading domain graphics...
The table below shows the number of occurrences of each domain throughout the sequence database. More...
In brackets beside each number is the percentage of the total number of sequence hits for the clan that are represented by this domain. The rightmost column provides a link to the alignments tab for each domain. Finally, the last row in the table provides a link to the HTML representation of the alignment for the seed alignments for all members of this clan.
Please note: the clan alignment can be extremely large and the resulting HTML file is often too large to be rendered by web-browsers. Please consider downloading the alignment (by right-clicking the link) rather than viewing it in your browser.
Please note: Clan alignments can be very large and can cause problems for some browsers. Read the note above before viewing.
This diagram shows the relationships between members of this clan. More...
Relationships between families in a clan are determined using HHsearch. Families are deemed to be closely related if their E-value is less than 10-3 and these relationships are shown with a solid line. Less closely related family pairs, with an E-value of between 10-3 and 10-1, are shown with a dashed line.
The E-value for each pair of closely or partially related families is shown next to the line linking the families. You can see the information regarding a Pfam family by clicking on the family box.
This tree shows the occurrence of the domains in this clan across different species. More...
For all of the domain matches in a full alignment we count the number of domains that are found on all sequences in the alignment. This total is shown in the purple box.
We also count the number of unique sequences on which each domain is found, which is shown in green. Note that a domain may appear multiple times on the same sequence, leading to the difference between these two numbers.
Finally, we group sequences from the same organism according to the NCBI code that is assigned by UniProt, allowing us to count the number of distinct sequences on which the domain is found. This value is shown in the pink boxes.
We use the NCBI species tree to group organisms according to their taxonomy and this forms the structure of the displayed tree. Note that in some cases the trees are too large (have too many nodes) to allow us to build an interactive tree, but in most cases you can still view the tree in a plain text, non-interactive representation.
You can use the tree controls to manipulate how the interactive tree is displayed:
- show/hide the summary boxes
- expand/collapse the tree or expand it to a given depth
- select a sub-tree or a set of species within the tree and view them graphically or as an alignment
- save a plain text representation of the tree
There are 312 interactions for this clan. More...
We determine these interactions using iPfam, which considers the interactions between residues in three-dimensional protein structures and maps those interactions back to Pfam families. You can find more information about the iPfam algorithm in journal article that accompanies the website.
For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the MSD group, to allow us to map Pfam domains onto UniProt three-dimensional structures. The table below shows the mapping between the Pfam families in this clan, the corresponding UniProt entries, and the region of the three-dimensional structures that are available for that sequence.
Loading structure mapping...