Summary: Aconitase family (aconitate hydratase)
Pfam includes annotations and additional family information from a range of different sources. These sources can be accessed via the tabs below.
This is the Wikipedia entry entitled "Aconitase". More...
The Wikipedia text that you see displayed here is a download from Wikipedia. This means that the information we display is a copy of the information from the Wikipedia database. The button next to the article title ("Edit Wikipedia article") takes you to the edit page for the article directly within Wikipedia. You should be aware you are not editing our local copy of this information. Any changes that you make to the Wikipedia article will not be displayed here until we next download the article from Wikipedia. We currently download new content on a nightly basis.
Does Pfam agree with the content of the Wikipedia entry ?
Pfam has chosen to link families to Wikipedia articles. In some case we have created or edited these articles but in many other cases we have not made any direct contribution to the content of the article. The Wikipedia community does monitor edits to try to ensure that (a) the quality of article annotation increases, and (b) vandalism is very quickly dealt with. However, we would like to emphasise that Pfam does not curate the Wikipedia entries and we cannot guarantee the accuracy of the information on the Wikipedia page.
Editing Wikipedia articles
Before you edit for the first time
Wikipedia is a free, online encyclopedia. Although anyone can edit or contribute to an article, Wikipedia has some strong editing guidelines and policies, which promote the Wikipedia standard of style and etiquette. Your edits and contributions are more likely to be accepted (and remain) if they are in accordance with this policy.
You should take a few minutes to view the following pages:
How your contribution will be recorded
Anyone can edit a Wikipedia entry. You can do this either as a new user or you can register with Wikipedia and log on. When you click on the "Edit Wikipedia article" button, your browser will direct you to the edit page for this entry in Wikipedia. If you are a registered user and currently logged in, your changes will be recorded under your Wikipedia user name. However, if you are not a registered user or are not logged on, your changes will be logged under your computer's IP address. This has two main implications. Firstly, as a registered Wikipedia user your edits are more likely seen as valuable contribution (although all edits are open to community scrutiny regardless). Secondly, if you edit under an IP address you may be sharing this IP address with other users. If your IP address has previously been blocked (due to being flagged as a source of 'vandalism') your edits will also be blocked. You can find more information on this and creating a user account at Wikipedia.
If you have problems editing a particular page, contact us at email@example.com and we will try to help.
The community annotation is a new facility of the Pfam web site. If you have problems editing or experience problems with these pages please contact us.
Aconitase Edit Wikipedia article
Illustration of pig aconitase in complex with the [Fe4S4] cluster. The protein is colored by secondary structure, and iron atoms are blue and the sulfur red.
|PDB structures||RCSB PDB PDBe PDBsum|
|Gene Ontology||AmiGO / EGO|
Structure of aconitase.
Aconitase (aconitate hydratase; EC 126.96.36.199) is an enzyme that catalyses the stereo-specific isomerization of citrate to isocitrate via cis-aconitate in the tricarboxylic acid cycle, a non-redox-active process.
In contrast with the majority of iron-sulfur proteins that function as electron carriers, the iron-sulfur cluster of aconitase reacts directly with an enzyme substrate. Aconitase has an active [Fe4S4]2+ cluster, which may convert to an inactive [Fe3S4]+ form. Three cysteine (Cys) residues have been shown to be ligands of the [Fe4S4] centre. In the active state, the labile iron ion of the [Fe4S4] cluster is not coordinated by Cys but by water molecules.
The iron-responsive element-binding protein (IRE-BP) and 3-isopropylmalate dehydratase (α-isopropylmalate isomerase; EC 188.8.131.52), an enzyme catalysing the second step in the biosynthesis of leucine, are known aconitase homologues. Iron regulatory elements (IREs) constitute a family of 28-nucleotide, non-coding, stem-loop structures that regulate iron storage, heme synthesis and iron uptake. They also participate in ribosome binding and control the mRNA turnover (degradation). The specific regulator protein, the IRE-BP, binds to IREs in both 5' and 3' regions, but only to RNA in the apo form, without the Fe-S cluster. Expression of IRE-BP in cultured cells has revealed that the protein functions either as an active aconitase, when cells are iron-replete, or as an active RNA-binding protein, when cells are iron-depleted. Mutant IRE-BPs, in which any or all of the three Cys residues involved in Fe-S formation are replaced by serine, have no aconitase activity, but retain RNA-binding properties.
Aconitase, displayed in the structures in the right margin of this page, has two slightly different structures, depending on whether it is activated or inactivated. In the inactive form, its structure is divided into four domains. Counting from the N-terminus, only the first three of these domains are involved in close interactions with the [3Fe-4S] cluster, but the active site consists of residues from all four domains, including the larger C-terminal domain. The Fe-S cluster and a SO42- anion also reside in the active site. When the enzyme is activated, it gains an additional iron atom, creating a [4Fe-4S] cluster. However, the structure of the rest of the enzyme is nearly unchanged; the conserved atoms between the two forms are in essentially the same positions, up to a difference of 0.1 angstroms.
Aconitase employs a dehydration-hydration mechanism. The catalytic residues involved are His-101 and Ser-642. His-101 protonates the hydroxyl group on C3 of citrate, allowing it to leave as water, and Ser-642 concurrently abstracts the proton on C2, forming a double bond between C2 and C3, forming a cis-aconitate intermediate. At this point, the intermediate is rotated 180°. This rotation is referred to as a "flip." Because of this flip, the intermediate is said to move from a "citrate mode" to a "isocitrate mode."
How exactly this flip occurs is debatable. One theory is that, in the rate-limiting step of the mechanism, the cis-aconitate is released from the enzyme, then reattached in the isocitrate mode to complete the reaction. This rate-liming step ensures that the right stereochemistry, specifically (2R,3S), is formed in the final product. Another hypothesis is that cis-aconitate stays bound to the enzyme while it flips from the citrate to the isocitrate mode.
In either case, flipping cis-aconitate allows the dehydration and hydration steps to occur on opposite faces of the intermediate. Aconitase catalyzes trans elimination/addition of water, and the flip guarantees that the correct stereochemistry is formed in the product. To complete the reaction, the serine and histidine residues reverse their original catalytic actions: the histidine, now basic, abstracts a proton from water, priming it as a nucleophile to attack at C2, and the protonated serine is deprotonated by the cis-aconitate double bond to complete the hydration, producing isocitrate.
A serious ailment associated with aconitase is known as aconitase deficiency. It is caused by a mutation in the gene for iron-sulfur cluster scaffold protein (ISCU), which helps build the Fe-S cluster on which the activity of aconitase depends. The main symptoms are myopathy and exercise intolerance; physical strain is lethal for some patients because it can lead to circulatory shock. There are no known treatments for aconitase deficiency.
Another disease associated with aconitase is Friedreich's ataxia (FRDA), which is caused when the Fe-S proteins in aconitase and succinate dehydrogenase have decreased activity. A proposed mechanism for this connection is that decreased Fe-S activity in aconitase and succinate dehydrogenase is correlated with excess iron concentration in the mitochondria and insufficient iron in the cytoplasm, disrupting iron homeostasis. This deviance from homeostasis causes FRDA, a neurodegenerative disease for which no effective treatments have been found.
Finally, aconitase is thought to be associated with diabetes. Although the exact connection is still being determined, multiple theories exist. In a study of organs from mice with alloxan diabetes (experimentally induced diabetes) and genetic diabetes, lower aconitase activity was found to decrease the rates of metabolic reactions involving citrate, pyruvate, and malate. In addition, citrate concentration was observed to be unusually high. Since these abnormal data were found in diabetic mice, the study concluded that low aconitase activity is likely correlated with genetic and alloxan diabetes. Another theory is that, in diabetic hearts, accelerated phosphorylation of heart aconitase by protein kinase C causes aconitase to speed up the final step of its reverse reaction relative to its forward reaction. That is, it converts isocitrate back to cis-aconitate more rapidly than usual, but the forward reaction proceeds at the usual rate. This imbalance may contribute to disrupted metabolism in diabetics.
Aconitases are expressed in bacteria to humans. Humans express the following two aconitase isozymes:
Interactive pathway map
Click on genes, proteins and metabolites below to link to respective articles. [§ 1]
- The interactive pathway map can be edited at WikiPathways: "TCACycle_WP78".
- PDB 7ACN; Lauble, H.; Kennedy, M. C.; Beinert, H.; Stout, C. D. (1992). "Crystal structures of aconitase with isocitrate and nitroisocitrate bound". Biochemistry 31 (10): 2735–48. doi:10.1021/bi00125a014. PMID 1547214.
- PDB 1ACO; Lauble, H; Kennedy, MC; Beinert, H; Stout, CD (1994). "Crystal Structures of Aconitase with Trans-aconitate and Nitrocitrate Bound". Journal of Molecular Biology 237 (4): 437–51. doi:10.1006/jmbi.1994.1246. PMID 8151704.
- Beinert, H; Kennedy, MC (1993). "Aconitase, a two-faced protein: Enzyme and iron regulatory factor". The FASEB Journal 7 (15): 1442–9. PMID 8262329.
- Flint, Dennis H.; Allen, Ronda M. (1996). "Iron−Sulfur Proteins with Nonredox Functions". Chemical Reviews 96 (7): 2315–34. doi:10.1021/cr950041r.
- Beinert, Helmut; Kennedy, Mary Claire; Stout, C. David (1996). "Aconitase as Iron−Sulfur Protein, Enzyme, and Iron-Regulatory Protein". Chemical Reviews 96 (7): 2335–74. doi:10.1021/cr950040z. PMID 11848830.
- Gardner, Paul R. (2002). "Aconitase: Sensitive target and measure of superoxide". Superoxide Dismutase. Methods in Enzymology 349. pp. 9–23. doi:10.1016/S0076-6879(02)49317-2. ISBN 978-0-12-182252-1.
- Robbins AH, Stout CD (1989). "The structure of aconitase". Proteins 5 (4): 289–312. doi:10.1002/prot.340050406. PMID 2798408.
- Robbins AH, Stout CD (May 1989). "Structure of activated aconitase: formation of the [4Fe-4S] cluster in the crystal". Proc. Natl. Acad. Sci. U.S.A. 86 (10): 3639–43. doi:10.1073/pnas.86.10.3639. PMC 287193. PMID 2726740.
- Lauble H, Kennedy MC, Beinert H, Stout CD (March 1992). "Crystal structures of aconitase with isocitrate and nitroisocitrate bound". Biochemistry 31 (10): 2735–48. doi:10.1021/bi00125a014. PMID 1547214.
- Takusagawa F. "Chapter 16: Citric Acid Cycle". Takusagawa’s Note. The University of Kansas. Retrieved 2011-07-10.
- Beinert H, Kennedy MC, Stout CD (November 1996). "Aconitase as Ironminus signSulfur Protein, Enzyme, and Iron-Regulatory Protein". Chem. Rev. 96 (7): 2335–2374. doi:10.1021/cr950040z. PMID 11848830.
- PDB 1C96; Lloyd SJ, Lauble H, Prasad GS, Stout CD (December 1999). "The mechanism of aconitase: 1.8 A resolution crystal structure of the S642a:citrate complex". Protein Sci. 8 (12): 2655–62. doi:10.1110/ps.8.12.2655. PMC 2144235. PMID 10631981.
- Han D, Canali R, Garcia J, Aguilera R, Gallaher TK, Cadenas E (September 2005). "Sites and mechanisms of aconitase inactivation by peroxynitrite: modulation by citrate and glutathione". Biochemistry 44 (36): 11986–96. doi:10.1021/bi0509393. PMID 16142896.
- Lauble H, Stout CD (May 1995). "Steric and conformational features of the aconitase mechanism". Proteins 22 (1): 1–11. doi:10.1002/prot.340220102. PMID 7675781.
- "Aconitase family". The Prosthetic groups and Metal Ions in Protein Active Sites Database Version 2.0. The University of Leeds. 1999-02-02. Archived from the original on 8 June 2011. Retrieved 2011-07-10.
- Orphanet, "Aconitase deficiency," April 2008, http://www.orpha.net/consor/cgi-bin/OC_Exp.php?lng=EN&Expert=43115
- Hall, R E; Henriksson, K G; Lewis, S F; Haller, R G; Kennaway, N G (1993). "Mitochondrial myopathy with succinate dehydrogenase and aconitase deficiency. Abnormalities of several iron-sulfur proteins". Journal of Clinical Investigation 92 (6): 2660–6. doi:10.1172/JCI116882. PMC 288463. PMID 8254022.
- Ye, Hong; Rouault, Tracey A. (2010). "Human Iron−Sulfur Cluster Assembly, Cellular Iron Homeostasis, and Disease". Biochemistry 49 (24): 4945–56. doi:10.1021/bi1004798. PMC 2885827. PMID 20481466.
- Boquist, L.; Ericsson, I.; Lorentzon, R.; Nelson, L. (1985). "Alterations in mitochondrial aconitase activity and respiration, and in concentration of citrate in some organs of mice with experimental or genetic diabetes". FEBS Letters 183 (1): 173–6. doi:10.1016/0014-5793(85)80979-0. PMID 3884379.
- Lin, G.; Brownsey, R. W.; MacLeod, K. M. (2009). "Regulation of mitochondrial aconitase by phosphorylation in diabetic rat heart". Cellular and Molecular Life Sciences 66 (5): 919–32. doi:10.1007/s00018-009-8696-3. PMID 19153662.
- "Alloxan Diabetes - Medical Definition," Stedman's Medical Dictionary, 2006 Lippincott Williams & Wilkins, http://www.medilexicon.com/medicaldictionary.php?t=24313
- Aconitase at the US National Library of Medicine Medical Subject Headings (MeSH)
- Proteopedia Aconitase - the Aconitase structure in interactive 3D
This tab holds the annotation information that is stored in the Pfam database. As we move to using Wikipedia as our main source of annotation, the contents of this tab will be gradually replaced by the Wikipedia tab.
Aconitase family (aconitate hydratase) Provide feedback
No Pfam abstract.
External database links
This tab holds annotation information from the InterPro database.
InterPro entry IPR001030
Aconitase (aconitate hydratase; EC) is an iron-sulphur protein that contains a [4Fe-4S]-cluster and catalyses the interconversion of isocitrate and citrate via a cis-aconitate intermediate. Aconitase functions in both the TCA and glyoxylate cycles, however unlike the majority of iron-sulphur proteins that function as electron carriers, the [4Fe-4S]-cluster of aconitase reacts directly with an enzyme substrate. In eukaryotes there is a cytosolic form (cAcn) and a mitochondrial form (mAcn) of the enzyme. In bacteria there are also 2 forms, aconitase A (AcnA) and B (AcnB). Several aconitases are known to be multi-functional enzymes with a second non-catalytic, but essential function that arises when the cellular environment changes, such as when iron levels drop [PUBMED:10087914, PUBMED:15877277]. Eukaryotic cAcn and mAcn, and bacterial AcnA have the same domain organisation, consisting of three N-terminal alpha/beta/alpha domains, a linker region, followed by a C-terminal 'swivel' domain with a beta/beta/alpha structure (1-2-3-linker-4), although mAcn is small than cAcn. However, bacterial AcnB has a different organisation: it contains an N-terminal HEAT-like domain, followed by the 'swivel' domain, then the three alpha/beta/alpha domains (HEAT-4-1-2-3) [PUBMED:9020582]. Below is a description of some of the multi-functional activities associated with different aconitases.
- Eukaryotic mAcn catalyses the second step of the mitochondrial TCA cycle, which is important for energy production, providing high energy electrons in the form of NADH and FADH2 to the mitochondrial oxidative phosphorylation pathway [PUBMED:15543948]. The TCA cycle also provides precursors for haem and amino acid production. This enzyme has a second, non-catalytic but essential role in mitochondrial DNA (mtDNA) maintenance: mAcn acts to stabilise mtDNA, forming part of mtDNA protein-DNA complexes known as nucleoids. mAcn is thought to reversibly model nucleoids to directly influence mitochondrial gene expression in response to changes in the cellular environment. Therefore, mAcn can influence the expression of components of the oxidative phosphorylation pathway encoded in mtDNA.
- Eukaryotic cAcn enzyme balances the amount of citrate and isocitrate in the cytoplasm, which in turn creates a balance between the amount of NADPH generated from isocitrate by isocitrate dehydrogenase with the amount of acetyl-CoA generated from citrate by citrate lyase. Fatty acid synthesis requires both NADPH and acetyl-CoA, as do other metabolic processes, including the need for NADPH to combat oxidative stress. The enzymatic form of cAcn predominates when iron levels are normal, but if they drop sufficiently to cause the disassembly of the [4Fe-4S]-cluster, then cAcn undergoes a conformational change from a compact enzyme to a more open L-shaped protein known as iron regulatory protein 1 (IRP1; or IRE-binding protein 1, IREBP1) [PUBMED:17185597, PUBMED:16407072]. As IRP1, the catalytic site and the [4Fe-4S]-cluster are lost, and two new RNA-binding sites appear. IRP1 functions in the post-transcriptional regulation of genes involved in iron metabolism - it binds to mRNA iron-responsive elements (IRE), 30-nucleotide stem-loop structures at the 3' or 5' end of specific transcripts. Transcripts containing an IRE include ferritin L and H subunits (iron storage), transferrin (iron plasma chaperone), transferrin receptor (iron uptake into cells), ferroportin (iron exporter), mAcn, succinate dehydrogenase, erythroid aminolevulinic acid synthetase (tetrapyrrole biosynthesis), among others. If the IRE is in the 5'-UTR of the transcript (e.g. in ferritin mRNA), then IRP1-binding prevents its translation by blocking the transcript from binding to the ribosome. If the IRE is in the 3'-UTR of the transcript (e.g. transferrin receptor), then IRP1-binding protects it from endonuclease degradation, thereby prolonging the half-life of the transcript and enabling it to be translated [PUBMED:15604397].
- IRP2 is another IRE-binding protein that binds to the same transcripts as IRP1. However, since IRP1 is predominantly in the enzymatic cAcn form, it is IRP2 that acts as the major metabolic regulator that maintains iron homeostasis [PUBMED:16850017]. Although IRP2 is homologous to IRP1, IRP2 lacks aconitase activity, and is known only to have a single function in the post-transcriptional regulation of iron metabolism genes [PUBMED:17513696]. In iron-replete cells, IRP2 activity is regulated primarily by iron-dependent degradation through the ubiquitin-proteasomal system.
- Bacterial AcnB is also known to be multi-functional. In addition to its role in the TCA cycle, AcnB was shown to be a post-transcriptional regulator of gene expression in Escherichia coli and Salmonella enterica [PUBMED:15882410, PUBMED:15009904]. In S. enterica, AcnB initiates a regulatory cascade controlling flagella biosynthesis through an interaction with the ftsH transcript, an alternative RNA polymerase sigma factor. This binding lowers the intracellular concentration of FtsH protease, which in turn enhances the amount of RNA polymerase sigma32 factor (normally degraded by FtsH protease), and sigma32 then increases the synthesis of chaperone DnaK, which in turn promotes the synthesis of the flagellar protein FliC. AcnB regulates the synthesis of other proteins as well, such as superoxide dismutase (SodA) and other enzymes involved in oxidative stress.
3-isopropylmalate dehydratase (or isopropylmalate isomerase; EC) catalyses the stereo-specific isomerisation of 2-isopropylmalate and 3-isopropylmalate, via the formation of 2-isopropylmaleate. This enzyme performs the second step in the biosynthesis of leucine, and is present in most prokaryotes and many fungal species. The prokaryotic enzyme is a heterodimer composed of a large (LeuC) and small (LeuD) subunit, while the fungal form is a monomeric enzyme. Both forms of isopropylmalate are related and are part of the larger aconitase family [PUBMED:9020582]. Aconitases are mostly monomeric proteins which share four domains in common and contain a single, labile [4Fe-4S] cluster. Three structural domains (1, 2 and 3) are tightly packed around the iron-sulphur cluster, while a fourth domain (4) forms a deep active-site cleft. The prokaryotic enzyme is encoded by two adjacent genes, leuC and leuD, corresponding to aconitase domains 1-3 and 4 respectively [PUBMED:1400210, PUBMED:9813279]. LeuC does not bind an iron-sulphur cluster. It is thought that some prokaryotic isopropylamalate dehydrogenases can also function as homoaconitase EC, converting cis-homoaconitate to homoisocitric acid in lysine biosynthesis [PUBMED:15522288]. Homoaconitase has been identified in higher fungi (mitochondria) and several archaea and one thermophilic species of bacteria, Thermus thermophilus [PUBMED:16524361].
This entry represents a region containing 3 domains, each with a 3-layer alpha/beta/alpha topology. This regions represents the [4Fe-4S] cluster-binding region found at the N-terminal of eukaryotic mAcn, cAcn/IPR1 and IRP2, and bacterial AcnA, but in the C-terminal of bacterial AcnB. This domain is also found in the large subunit of isopropylmalate dehydratase (LeuC).
More information about these proteins can be found at Protein of the Month: Aconitase [PUBMED:].
The mapping between Pfam and Gene Ontology is provided by InterPro. If you use this data please cite InterPro.
|Biological process||metabolic process (GO:0008152)|
Below is a listing of the unique domain organisations or architectures in which this domain is found. More...
The graphic that is shown by default represents the longest sequence with a given architecture. Each row contains the following information:
- the number of sequences which exhibit this architecture
a textual description of the architecture, e.g. Gla, EGF x 2, Trypsin.
This example describes an architecture with one
Gladomain, followed by two consecutive
EGFdomains, and finally a single
- a link to the page in the Pfam site showing information about the sequence that the graphic describes
- the UniProt description of the protein sequence
- the number of residues in the sequence
- the Pfam graphic itself.
Note that you can see the family page for a particular domain by clicking on the graphic. You can also choose to see all sequences which have a given architecture by clicking on the Show link in each row.
Finally, because some families can be found in a very large number of architectures, we load only the first fifty architectures by default. If you want to see more architectures, click the button at the bottom of the page to load the next set.
Loading domain graphics...
We store a range of different sequence alignments for families. As well as the seed alignment from which the family is built, we provide the full alignment, generated by searching the sequence database using the family HMM. We also generate alignments using four representative proteomes (RP) sets, the NCBI sequence database, and our metagenomics sequence database. More...
There are various ways to view or download the sequence alignments that we store. We provide several sequence viewers and a plain-text Stockholm-format file for download.
We make a range of alignments for each Pfam-A family:
- the curated alignment from which the HMM for the family is built
- the alignment generated by searching the sequence database using the HMM
- Representative Proteomes (RPs) at 15%, 35%, 55% and 75% co-membership thresholds
- alignment generated by searching the NCBI sequence database using the family HMM
- alignment generated by searching the metagenomics sequence database using the family HMM
You can see the alignments as HTML or in three different sequence viewers:
- a Java applet developed at the University of Dundee. You will need Java installed before running jalview
- an HTML page showing the whole alignment.Please note: full Pfam alignments can be very large. These HTML views are extremely large and often cause problems for browsers. Please use either jalview or the Pfam viewer if you have trouble viewing the HTML version
- an HTML-based representation of the alignment, coloured according to the posterior-probability (PP) values from the HMM. As for the standard HTML view, heatmap alignments can also be very large and slow to render.
- Pfam viewer
- an HTML-based viewer that uses DAS to retrieve alignment fragments on request
You can download (or view in your browser) a text representation of a Pfam alignment in various formats:
You can also change the order in which sequences are listed in the alignment, change how insertions are represented, alter the characters that are used to represent gaps in sequences and, finally, choose whether to download the alignment or to view it in your browser directly.
You may find that large alignments cause problems for the viewers and the reformatting tool, so we also provide all alignments in Stockholm format. You can download either the plain text alignment, or a gzipped version of it.
We make a range of alignments for each Pfam-A family. You can see a description of each above. You can view these alignments in various ways but please note that some types of alignment are never generated while others may not be available for all families, most commonly because the alignments are too large to handle.
1Cannot generate PP/Heatmap alignments for seeds; no PP data available
Key: available, not generated, — not available.
Format an alignment
We make all of our alignments available in Stockholm format. You can download them here as raw, plain text files or as gzip-compressed files.
You can also download a FASTA format file containing the full-length sequences for all sequences in the full alignment.
MyHits provides a collection of tools to handle multiple sequence alignments. For example, one can refine a seed alignment (sequence addition or removal, re-alignment or manual edition) and then search databases for remote homologs using HMMER3.
HMM logos is one way of visualising profile HMMs. Logos provide a quick overview of the properties of an HMM in a graphical form. You can see a more detailed description of HMM logos and find out how you can interpret them here. More...
If you find these logos useful in your own work, please consider citing the following article:
This page displays the phylogenetic tree for this family's seed alignment. We use FastTree to calculate neighbour join trees with a local bootstrap based on 100 resamples (shown next to the tree nodes). FastTree calculates approximately-maximum-likelihood phylogenetic trees from our seed alignment.
Note: You can also download the data file for the tree.
Curation and family details
This section shows the detailed information about the Pfam family. You can see the definitions of many of the terms in this section in the glossary and a fuller explanation of the scoring system that we use in the scores section of the help pages.
|Number in seed:||11|
|Number in full:||14106|
|Average length of the domain:||338.30 aa|
|Average identity of full alignment:||28 %|
|Average coverage of the sequence by the domain:||64.53 %|
|HMM build commands:||
build method: hmmbuild -o /dev/null HMM SEED
search method: hmmsearch -Z 23193494 -E 1000 --cpu 4 HMM pfamseq
|Family (HMM) version:||15|
|Download:||download the raw HMM for this family|
Weight segments by...
Change the size of the sunburst
selected sequences to HMM
a FASTA-format file
- 0 sequences
- 0 species
This visualisation provides a simple graphical representation of the distribution of this family across species. You can find the original interactive tree in the More....
This chart is a modified "sunburst" visualisation of the species tree for this family. It shows each node in the tree as a separate arc, arranged radially with the superkingdoms at the centre and the species arrayed around the outermost ring.
How the sunburst is generated
The tree is built by considering the taxonomic lineage of each sequence that has a match to this family. For each node in the resulting tree, we draw an arc in the sunburst. The radius of the arc, its distance from the root node at the centre of the sunburst, shows the taxonomic level ("superkingdom", "kingdom", etc). The length of the arc represents either the number of sequences represented at a given level, or the number of species that are found beneath the node in the tree. The weighting scheme can be changed using the sunburst controls.
In order to reduce the complexity of the representation, we reduce the number of taxonomic levels that we show. We consider only the following eight major taxonomic levels:
Colouring and labels
Segments of the tree are coloured approximately according to their superkingdom. For example, archeal branches are coloured with shades of orange, eukaryotes in shades of purple, etc. The colour assignments are shown under the sunburst controls. Where space allows, the name of the taxonomic level will be written on the arc itself.
As you move your mouse across the sunburst, the current node will be highlighted. In the top section of the controls panel we show a summary of the lineage of the currently highlighed node. If you pause over an arc, a tooltip will be shown, giving the name of the taxonomic level in the title and a summary of the number of sequences and species below that node in the tree.
Anomalies in the taxonomy tree
There are some situations that the sunburst tree cannot easily handle and for which we have work-arounds in place.
Missing taxonomic levels
Some species in the taxonomic tree may not have one or more of the main eight levels that we display. For example, Bos taurus is not assigned an order in the NCBI taxonomic tree. In such cases we mark the omitted level with, for example, "No order", in both the tooltip and the lineage summary.
Unmapped species names
The tree is built by looking at each sequence in the full alignment for the family. We take the name of the species given by UniProt and try to map that to the full taxonomic tree from NCBI. In some cases, the name chosen by UniProt does not map to any node in the NCBI tree, perhaps because the chosen name is listed as a synonym or a misspelling in the NCBI taxonomy.
So that these nodes are not simply omitted from the sunburst tree, we group them together in a separate branch (or segment of the sunburst tree). Since we cannot determine the lineage for these unmapped species, we show all levels between the superkingdom and the species as "uncategorised".
Since we reduce the species tree to only the eight main taxonomic levels, sequences that are mapped to the sub-species level in the tree would not normally be shown. Rather than leave out these species, we map them instead to their parent species. So, for example, for sequences belonging to one of the Vibrio cholerae sub-species in the NCBI taxonomy, we show them instead as belonging to the species Vibrio cholerae.
Too many species/sequences
For large species trees, you may see blank regions in the outer layers of the sunburst. These occur when there are large numbers of arcs to be drawn in a small space. If an arc is less than approximately one pixel wide, it will not be drawn and the space will be left blank. You may still be able to get some information about the species in that region by moving your mouse across the area, but since each arc will be very small, it will be difficult to accurately locate a particular species.
The tree shows the occurrence of this domain across different species. More...
We show the species tree in one of two ways. For smaller trees we try to show an interactive representation, which allows you to select specific nodes in the tree and view them as an alignment or as a set of Pfam domain graphics.
Unfortunately we have found that there are problems viewing the interactive tree when the it becomes larger than a certain limit. Furthermore, we have found that Internet Explorer can become unresponsive when viewing some trees, regardless of their size. We therefore show a text representation of the species tree when the size is above a certain limit or if you are using Internet Explorer to view the site.
If you are using IE you can still load the interactive tree by clicking the "Generate interactive tree" button, but please be aware of the potential problems that the interactive species tree can cause.
For all of the domain matches in a full alignment, we count the number that are found on all sequences in the alignment. This total is shown in the purple box.
We also count the number of unique sequences on which each domain is found, which is shown in green. Note that a domain may appear multiple times on the same sequence, leading to the difference between these two numbers.
Finally, we group sequences from the same organism according to the NCBI code that is assigned by UniProt, allowing us to count the number of distinct sequences on which the domain is found. This value is shown in the pink boxes.
We use the NCBI species tree to group organisms according to their taxonomy and this forms the structure of the displayed tree. Note that in some cases the trees are too large (have too many nodes) to allow us to build an interactive tree, but in most cases you can still view the tree in a plain text, non-interactive representation. Those species which are represented in the seed alignment for this domain are highlighted.
You can use the tree controls to manipulate how the interactive tree is displayed:
- show/hide the summary boxes
- highlight species that are represented in the seed alignment
- expand/collapse the tree or expand it to a given depth
- select a sub-tree or a set of species within the tree and view them graphically or as an alignment
- save a plain text representation of the tree
Please note: for large trees this can take some time. While the tree is loading, you can safely switch away from this tab but if you browse away from the family page entirely, the tree will not be loaded.
There are 3 interactions for this family. More...
We determine these interactions using iPfam, which considers the interactions between residues in three-dimensional protein structures and maps those interactions back to Pfam families. You can find more information about the iPfam algorithm in the journal article that accompanies the website.
For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe group, to allow us to map Pfam domains onto UniProt sequences and three-dimensional protein structures. The table below shows the structures on which the Aconitase domain has been found. There are 25 instances of this domain found in the PDB. Note that there may be multiple copies of the domain in a single PDB structure, since many structures contain multiple copies of the same protein seqence.
Loading structure mapping...