Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
106  structures 7008  species 0  interactions 19033  sequences 44  architectures

Family: Bac_DNA_binding (PF00216)

Summary: Bacterial DNA-binding protein

Pfam includes annotations and additional family information from a range of different sources. These sources can be accessed via the tabs below.

This is the Wikipedia entry entitled "Bacterial DNA binding protein". More...

Bacterial DNA binding protein Edit Wikipedia article

PDB 1p51 EBI.jpg
anabaena hu-dna cocrystal structure (ahu6)

In molecular biology, bacterial DNA binding proteins are a family of small, usually basic proteins of about 90 residues that bind DNA and are known as histone-like proteins.[1][2] Since bacterial binding proteins have a diversity of functions, it has been difficult to develop a common function for all of them. They are commonly referred to as histone-like and have many similar traits with the eukaryotic histone proteins. Eukaryotic histones package DNA to help it to fit in the nucleus, and they are known to be the most conserved proteins in nature.[3] Examples include the HU protein in Escherichia coli, a dimer of closely related alpha and beta chains and in other bacteria can be a dimer of identical chains. HU-type proteins have been found in a variety of eubacteria (including cyanobacteria) and archaebacteria, and are also encoded in the chloroplast genome of some algae.[4] The integration host factor (IHF), a dimer of closely related chains which is suggested to function in genetic recombination as well as in translational and transcriptional control[5] is found in Enterobacteria and viral proteins including the African swine fever virus protein A104R (or LMW5-AR).[6]


Histone-like proteins are present in many Eubacteria, Cyanobacteria, and Archaebacteria. These proteins participate in all DNA-dependent functions; in these processes, bacterial DNA binding proteins have an architectural role, maintaining structural integrity as transcription, recombination, replication, or any other DNA-dependent process proceeds. Eukaryotic histones were first discovered through experiments in 0.4M NaCl. In these high salt concentrations, the eukaryotic histone protein is eluted from a DNA solution in which single stranded DNA is bound covalently to cellulose. Following elution, the protein readily binds DNA, indicating the protein's high affinity for DNA. Histone-like proteins were unknown to be present in bacteria until similarities between eukaryotic histones and the HU-protein were noted, particularly because of the abundancy, basicity, and small size of both of the proteins.[7] Upon further investigation, it was discovered that the amino acid composition of HU resembles that of eukaryotic histones, thus prompting further research into the exact function of bacterial DNA binding proteins and discoveries of other related proteins in bacteria.

Role in DNA replication

Research suggests that bacterial DNA binding protein has an important role during DNA replication; the protein is involved in stabilizing the lagging strand as well as interacting with DNA polymerase III. The role of single-stranded DNA binding (SSB) protein during DNA replication in Escherichia coli cells has been studied, specifically the interactions between SSB and the χ subunit of DNA polymerase III in environments of varying salt concentrations.[8]

In DNA replication at the lagging strand site, DNA polymerase III removes nucleotides individually from the DNA binding protein. An unstable SSB/DNA system would result in rapid disintegration of the SSB, which stalls DNA replication. Research has shown that the ssDNA is stabilized by the interaction of SSB and the χ subunit of DNA polymerase III in E. coli, thus preparing for replication by maintaining the correct conformation that increases the binding affinity of enzymes to ssDNA. Furthermore, binding of SSB to DNA polymerase III at the replication fork prevents dissociation of SSB, consequently increasing the efficiency of DNA polymerase III to synthesize a new DNA strand.



[9](i) RNA polymerase at the promoter is surrounded by curved DNA. (ii) This curved DNA wraps around the polymerase. (iii) H-NS binds to the curved DNA to lock the RNA polymerase at the promoter and prevents transcription from occurring. (iv) Environmental signals and transcription factors release the DNA bacterial binding protein and allows transcription to proceed.

Initially, bacterial DNA binding proteins were thought to help stabilize bacterial DNA. Currently, many more functions of bacteria DNA binding proteins have been discovered, including the regulation of gene expression by histone-like nucleoid-structuring protein, H-NS.

H-NS is about 15.6 kDa and assists in the regulation of bacterial transcription in bacteria by repressing and activating certain genes. H-NS binds to DNA with an intrinsic curvature. In E. coli, H-NS binds to a P1 promoter decreasing rRNA production during stationary and slow growth periods. RNA polymerase and H-NS DNA binding protein have overlapping binding sites; it is thought that H-NS regulates rRNA production by acting on the transcription initiation site. It has been found that H-NS and RNA polymerase both bind to the P1 promoter and form a complex. When H-NS is bound with RNA Polymerase to the promoter region, there are structural differences in the DNA that are accessible.[10] It has also been found that H-NS can affect translation as well by binding to mRNA and causing its degradation.


HU is a small (10 kDa[11]) bacterial histone-like protein that resembles the eukaryotic Histone H2B. HU acts similarly to a histone by inducing negative supercoiling into circular DNA with the assistance of topoisomerase. The protein has been implicated in DNA replication, recombination, and repair. With an α-helical hydrophobic core and two positively charged β-ribbon arms, HU binds non-specifically to dsDNA with low affinity but binds to altered DNA—such as junctions, nicks, gaps, forks, and overhangs—with high affinity. The arms bind to the minor groove of DNA in low affinity states; in high affinity states, a component of the α-helical core interacts with the DNA as well. However, this protein’s function is not solely confined to DNA; HU also binds to RNA and DNA-RNA hybrids with the same affinity as supercoiled DNA.[12]

Recent research has revealed that HU binds with high specificity to the mRNA of rpoS,[13] a transcript for the stress sigma factor of RNA polymerase, and stimulates translation of the protein. Additional to this RNA function, it was also demonstrated that HU binds DsrA, a small non-coding RNA that regulates transcription through repressing H-NS and stimulates translation through increasing expression of rpoS. These interactions suggest that HU has multiple influences on transcription and translation in bacterial cells.


Integration host factor, IHF, is a nucleoid-associated protein only found in gram negative bacteria.[14] It is a 20 kDa heterodimer, composed of α and β subunits that bind to the sequence 5' - WATCAANNNNTTR - 3' and bends the DNA approximately 160 degrees.[15] The β arms of IHF have Proline residues that help stabilize the DNA kinks. These kinks can help compact DNA and allow for supercoiling. The mode of binding to DNA depends on environmental factors, such as the concentration of ions present. With a high concentration of KCl, there is weak DNA bending. It has been found that sharper DNA bending occurs when the concentration of KCl is less than 100 mM, and IHF is not concentrated.[16]

IHF was discovered as a necessary co-factor for recombination of λ phage in to E.coli. In 2016 it was discovered that IHF also plays a key role in CRISPR type I and type II systems. It has a major role in allowing the Cas1-Cas2 complex to integrate new spacers into the CRISPR sequence. The bending of the DNA by IHF is thought to alter spacing in the DNA major and minor grooves, allowing the Cas1-Cas2 complex to make contact with the DNA bases.[17] This is a key function in the CRISPR system as it ensures that new spacers area always added at the beginning of the CRISPE sequence next to the leader sequence. This directing of integration by IHF ensures that spacers are added chronologically, allowing better protection against the most recent viral infection.[18]


Table 1. Comparison of some DNA Binding Proteins
DNA Binding Protein Size Structure Binding Site Effect
H-NS 15.6 kDa exists in dimers to physically prevent RNA polymerase from binding to promoter binds to bent DNA, binds to P1 promoter in E. coli regulation of gene expression
HU 10 kDa α-helical core and two positively charged β-ribbon arms binds non-specifically to dsDNA, binds to DsrA, a small non-coding RNA that regulates transcription induces negative supercoiling into circular DNA
IHF 20 kDa αβαβ hetrodimer binds to specific sequences of DNAcreates kinks in DNA

Implications and further research

The functions of bacterial DNA-binding proteins are not limited to DNA replication. Researchers have been investigating other pathways these proteins affect. The DNA-binding protein H-NS has been known to play roles in chromosome organization and gene regulation; however, recent studies have also confirmed their role in indirectly regulating flagella functions.[19] Some motility regulatory linkages that H-NS influences include the messenger molecule Cyclic di-GMP, the bio-film regulatory protein CsgD, and the sigma factors, σ(S) and σ(F). Further studies are aiming to characterize the ways this nucleoid-organizing protein affects the motility of the cell through other regulatory pathways.

Other researchers have used bacterial DNA-binding proteins to research Salmonella enterica serovar Typhimurium, in which the T6SS genes are activated from a macrophage infection. When S. Typhimurium infects, their efficiency can be improved through a sense-and-kill mechanism with T6SS H-NS silencing.[20] Assays are created that combine reporter fusions, electrophoretic mobility shift assays, DNase footprinting, and fluorescence microscopy to silence the T6SS gene cluster by the histone-like nucleoid structuring H-NS protein.

See also


  1. ^ Drlica K, Rouviere-Yaniv J (September 1987). "Histonelike proteins of bacteria". Microbiological Reviews. 51 (3): 301–19. PMC 373113. PMID 3118156.
  2. ^ Pettijohn DE (September 1988). "Histone-like proteins and bacterial chromosome structure". The Journal of Biological Chemistry. 263 (26): 12793–6. PMID 3047111.
  3. ^ Griffiths, Anthony; Wessler, Susan; Carroll, Sean; Doebly, John. Introduction to Genetic Analysis (10 ed.). New York: W. H. Freeman and Company. pp. 428–429.
  4. ^ Wang SL, Liu XQ (December 1991). "The plastid genome of Cryptomonas phi encodes an hsp70-like protein, a histone-like protein, and an acyl carrier protein". Proceedings of the National Academy of Sciences of the United States of America. 88 (23): 10783–7. doi:10.1073/pnas.88.23.10783. PMC 53015. PMID 1961745.
  5. ^ Friedman DI (November 1988). "Integration host factor: a protein for all reasons" (PDF). Cell. 55 (4): 545–54. doi:10.1016/0092-8674(88)90213-9. hdl:2027.42/27063. PMID 2972385.
  6. ^ Neilan JG, Lu Z, Kutish GF, Sussman MD, Roberts PC, Yozawa T, Rock DL (March 1993). "An African swine fever virus gene with similarity to bacterial DNA binding proteins, bacterial integration host factors, and the Bacillus phage SPO1 transcription factor, TF1". Nucleic Acids Research. 21 (6): 1496. doi:10.1093/nar/21.6.1496. PMC 309344. PMID 8464748.
  7. ^ Drlica K, Rouviere-Yaniv J (September 1987). "Histonelike proteins of bacteria". Microbiological Reviews. 51 (3): 301–19. PMC 373113. PMID 3118156.
  8. ^ Witte G, Urbanke C, Curth U (August 2003). "DNA polymerase III chi subunit ties single-stranded DNA binding protein to the bacterial replication machinery". Nucleic Acids Research. 31 (15): 4434–40. doi:10.1093/nar/gkg498. PMC 169888. PMID 12888503.
  9. ^ Dorman, Charles J; Deighan, Padraig (2003-04-01). "Regulation of gene expression by histone-like proteins in bacteria". Current Opinion in Genetics & Development. 13 (2): 179–184. doi:10.1016/S0959-437X(03)00025-X.
  10. ^ Schröder O, Wagner R (May 2000). "The bacterial DNA-binding protein H-NS represses ribosomal RNA transcription by trapping RNA polymerase in the initiation complex". Journal of Molecular Biology. 298 (5): 737–48. doi:10.1006/jmbi.2000.3708. PMID 10801345.
  11. ^ Serban D, Arcineigas SF, Vorgias CE, Thomas GJ (April 2003). "Structure and dynamics of the DNA-binding protein HU of B. stearothermophilus investigated by Raman and ultraviolet-resonance Raman spectroscopy". Protein Science. 12 (4): 861–70. doi:10.1110/ps.0234103. PMC 2323852. PMID 12649443.
  12. ^ Balandina A, Kamashev D, Rouviere-Yaniv J (August 2002). "The bacterial histone-like protein HU specifically recognizes similar structures in all nucleic acids. DNA, RNA, and their hybrids". The Journal of Biological Chemistry. 277 (31): 27622–8. doi:10.1074/jbc.M201978200. PMID 12006568.
  13. ^ Balandina A, Claret L, Hengge-Aronis R, Rouviere-Yaniv J (February 2001). "The Escherichia coli histone-like protein HU regulates rpoS translation". Molecular Microbiology. 39 (4): 1069–79. doi:10.1046/j.1365-2958.2001.02305.x. PMID 11251825.
  14. ^ Dillon SC, Dorman CJ (March 2010). "Bacterial nucleoid-associated proteins, nucleoid structure and gene expression". Nature Reviews. Microbiology. 8 (3): 185–95. doi:10.1038/nrmicro2261. PMID 20140026.
  15. ^ Nuñez JK, Bai L, Harrington LB, Hinder TL, Doudna JA (June 2016). "CRISPR Immunological Memory Requires a Host Factor for Specificity". Molecular Cell. 62 (6): 824–833. doi:10.1016/j.molcel.2016.04.027. PMID 27211867.
  16. ^ Lin J, Chen H, Dröge P, Yan J (2012). "Physical organization of DNA by multiple non-specific DNA-binding modes of integration host factor (IHF)". PLOS ONE. 7 (11): e49885. doi:10.1371/journal.pone.0049885. PMC 3498176. PMID 23166787.
  17. ^ Nuñez JK, Bai L, Harrington LB, Hinder TL, Doudna JA (June 2016). "CRISPR Immunological Memory Requires a Host Factor for Specificity". Molecular Cell. 62 (6): 824–833. doi:10.1016/j.molcel.2016.04.027. PMID 27211867.
  18. ^ Sorek R, Lawrence CM, Wiedenheft B (2013). "CRISPR-mediated adaptive immune systems in bacteria and archaea". Annual Review of Biochemistry. 82 (1): 237–66. doi:10.1146/annurev-biochem-072911-172315. PMID 23495939.
  19. ^ Kim EA, Blair DF (October 2015). "Function of the Histone-Like Protein H-NS in Motility of Escherichia coli: Multiple Regulatory Roles Rather than Direct Action at the Flagellar Motor". Journal of Bacteriology. 197 (19): 3110–20. doi:10.1128/JB.00309-15. PMC 4560294. PMID 26195595.
  20. ^ Brunet YR, Khodr A, Logger L, Aussel L, Mignot T, Rimsky S, Cascales E (July 2015). "H-NS Silencing of the Salmonella Pathogenicity Island 6-Encoded Type VI Secretion System Limits Salmonella enterica Serovar Typhimurium Interbacterial Killing". Infection and Immunity. 83 (7): 2738–50. doi:10.1128/IAI.00198-15. PMC 4468533. PMID 25916986.
This article incorporates text from the public domain Pfam and InterPro: IPR000119

This page is based on a Wikipedia article. The text is available under the Creative Commons Attribution/Share-Alike License.

This tab holds the annotation information that is stored in the Pfam database. As we move to using Wikipedia as our main source of annotation, the contents of this tab will be gradually replaced by the Wikipedia tab.

Bacterial DNA-binding protein Provide feedback

No Pfam abstract.

Literature references

  1. Vis H, Mariani M, Vorgias CE, Wilson KS, Kaptein R, Boelens R; , J Mol Biol 1995;254:692-703.: Solution structure of the HU protein from Bacillus stearothermophilus. PUBMED:7500343 EPMC:7500343

Internal database links

External database links

This tab holds annotation information from the InterPro database.

InterPro entry IPR000119

Bacteria synthesise a set of small, usually basic proteins of about 90 residues that bind DNA and are known as histone-like proteins [ PUBMED:3118156 , PUBMED:3047111 ]. Examples include the HU protein in Escherichia coli which is a dimer of closely related alpha and beta chains and in other bacteria can be a dimer of identical chains. HU-type proteins have been found in a variety of eubacteria, cyanobacteria and archaebacteria, and are also encoded in the chloroplast genome of some algae [ PUBMED:1961745 ]. The integration host factor (IHF), a dimer of closely related chains which seem to function in genetic recombination as well as in translational and transcriptional control [ PUBMED:2972385 ] is found in enterobacteria and viral proteins include the African Swine fever virus protein Pret-047 (also known as A104R or LMW5-AR) [ PUBMED:8464748 ].

The exact function of these proteins is not yet clear but they are capable of wrapping DNA and stabilising it from denaturation under extreme environmental conditions. The structure is known for one of these proteins [ PUBMED:6540370 ]. The protein exists as a dimer and two "beta-arms" function as the non-specific binding site for bacterial DNA.

Gene Ontology

The mapping between Pfam and Gene Ontology is provided by InterPro. If you use this data please cite InterPro.

Domain organisation

Below is a listing of the unique domain organisations or architectures in which this domain is found. More...

Loading domain graphics...

Pfam Clan

This family is a member of clan IHF-likeDNA-bdg (CL0548), which has the following description:

This superfamily is characterised by being a dimer of identical subunits of a core of four helices in a bundle, partly opened, capped with a beta-sheet. All members appear to be prokaryotic DNA-binding domains.

The clan contains the following 8 members:

Bac_DNA_binding HU-CCDC81_bac_1 HU-CCDC81_bac_2 HU-CCDC81_euk_1 HU-CCDC81_euk_2 HU-DNA_bdg HU-HIG Tra_M


We store a range of different sequence alignments for families. As well as the seed alignment from which the family is built, we provide the full alignment, generated by searching the sequence database (reference proteomes) using the family HMM. We also generate alignments using four representative proteomes (RP) sets and the UniProtKB sequence database. More...

View options

We make a range of alignments for each Pfam-A family. You can see a description of each above. You can view these alignments in various ways but please note that some types of alignment are never generated while others may not be available for all families, most commonly because the alignments are too large to handle.

Representative proteomes UniProt
Jalview View  View  View  View  View  View  View 
HTML View             
PP/heatmap 1            

1Cannot generate PP/Heatmap alignments for seeds; no PP data available

Key: ✓ available, x not generated, not available.

Format an alignment

Representative proteomes UniProt

Download options

We make all of our alignments available in Stockholm format. You can download them here as raw, plain text files or as gzip-compressed files.

Representative proteomes UniProt
Raw Stockholm Download   Download   Download   Download   Download   Download   Download  
Gzipped Download   Download   Download   Download   Download   Download   Download  

You can also download a FASTA format file containing the full-length sequences for all sequences in the full alignment.

HMM logo

HMM logos is one way of visualising profile HMMs. Logos provide a quick overview of the properties of an HMM in a graphical form. You can see a more detailed description of HMM logos and find out how you can interpret them here. More...


This page displays the phylogenetic tree for this family's seed alignment. We use FastTree to calculate neighbour join trees with a local bootstrap based on 100 resamples (shown next to the tree nodes). FastTree calculates approximately-maximum-likelihood phylogenetic trees from our seed alignment.

Note: You can also download the data file for the tree.

Curation and family details

This section shows the detailed information about the Pfam family. You can see the definitions of many of the terms in this section in the glossary and a fuller explanation of the scoring system that we use in the scores section of the help pages.

Curation View help on the curation process

Seed source: Prosite
Previous IDs: none
Type: Domain
Sequence Ontology: SO:0000417
Author: Finn RD
Number in seed: 102
Number in full: 19033
Average length of the domain: 89.70 aa
Average identity of full alignment: 35 %
Average coverage of the sequence by the domain: 83.25 %

HMM information View help on HMM parameters

HMM build commands:
build method: hmmbuild -o /dev/null HMM SEED
search method: hmmsearch -Z 57096847 -E 1000 --cpu 4 HMM pfamseq
Model details:
Parameter Sequence Domain
Gathering cut-off 22.4 22.4
Trusted cut-off 22.4 22.4
Noise cut-off 22.3 22.3
Model length: 90
Family (HMM) version: 23
Download: download the raw HMM for this family

Species distribution

Sunburst controls


Weight segments by...

Change the size of the sunburst


Colour assignments

Archea Archea Eukaryota Eukaryota
Bacteria Bacteria Other sequences Other sequences
Viruses Viruses Unclassified Unclassified
Viroids Viroids Unclassified sequence Unclassified sequence


Align selected sequences to HMM

Generate a FASTA-format file

Clear selection

This visualisation provides a simple graphical representation of the distribution of this family across species. You can find the original interactive tree in the adjacent tab. More...

Loading sunburst data...

Tree controls


The tree shows the occurrence of this domain across different species. More...


Please note: for large trees this can take some time. While the tree is loading, you can safely switch away from this tab but if you browse away from the family page entirely, the tree will not be loaded.


For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe group, to allow us to map Pfam domains onto UniProt sequences and three-dimensional protein structures. The table below shows the structures on which the Bac_DNA_binding domain has been found. There are 106 instances of this domain found in the PDB. Note that there may be multiple copies of the domain in a single PDB structure, since many structures contain multiple copies of the same protein sequence.

Loading structure mapping...

AlphaFold Structure Predictions