Summary: FTO catalytic domain
Pfam includes annotations and additional family information from a range of different sources. These sources can be accessed via the tabs below.
This is the Wikipedia entry entitled "FTO gene". More...
The Wikipedia text that you see displayed here is a download from Wikipedia. This means that the information we display is a copy of the information from the Wikipedia database. The button next to the article title ("Edit Wikipedia article") takes you to the edit page for the article directly within Wikipedia. You should be aware you are not editing our local copy of this information. Any changes that you make to the Wikipedia article will not be displayed here until we next download the article from Wikipedia. We currently download new content on a nightly basis.
Does Pfam agree with the content of the Wikipedia entry ?
Pfam has chosen to link families to Wikipedia articles. In some case we have created or edited these articles but in many other cases we have not made any direct contribution to the content of the article. The Wikipedia community does monitor edits to try to ensure that (a) the quality of article annotation increases, and (b) vandalism is very quickly dealt with. However, we would like to emphasise that Pfam does not curate the Wikipedia entries and we cannot guarantee the accuracy of the information on the Wikipedia page.
Editing Wikipedia articles
Before you edit for the first time
Wikipedia is a free, online encyclopedia. Although anyone can edit or contribute to an article, Wikipedia has some strong editing guidelines and policies, which promote the Wikipedia standard of style and etiquette. Your edits and contributions are more likely to be accepted (and remain) if they are in accordance with this policy.
You should take a few minutes to view the following pages:
How your contribution will be recorded
Anyone can edit a Wikipedia entry. You can do this either as a new user or you can register with Wikipedia and log on. When you click on the "Edit Wikipedia article" button, your browser will direct you to the edit page for this entry in Wikipedia. If you are a registered user and currently logged in, your changes will be recorded under your Wikipedia user name. However, if you are not a registered user or are not logged on, your changes will be logged under your computer's IP address. This has two main implications. Firstly, as a registered Wikipedia user your edits are more likely seen as valuable contribution (although all edits are open to community scrutiny regardless). Secondly, if you edit under an IP address you may be sharing this IP address with other users. If your IP address has previously been blocked (due to being flagged as a source of 'vandalism') your edits will also be blocked. You can find more information on this and creating a user account at Wikipedia.
If you have problems editing a particular page, contact us at firstname.lastname@example.org and we will try to help.
The community annotation is a new facility of the Pfam web site. If you have problems editing or experience problems with these pages please contact us.
FTO gene Edit Wikipedia article
|Fat mass and obesity associated|
|RNA expression pattern|
Fat mass and obesity-associated protein also known as alpha-ketoglutarate-dependent dioxygenase FTO is an enzyme that in humans is encoded by the FTO gene located on chromosome 16. As one homolog in the AlkB family proteins, it is the first mRNA demethylase that has been identified. Certain variants of the FTO gene appear to be correlated with obesity in humans.
The amino acid sequence of the transcribed FTO protein shows high similarity with the enzyme AlkB which oxidatively demethylates DNA. Recombinant FTO protein was first discovered to catalyze demethylation of 3-methylthymine in single-stranded DNA, and 3-methyluridine in single-stranded RNA, with low efficiency. The nucleoside N6-methyladenosine, an abundant modification in RNA, was then found to be a major substrate of FTO. The FTO gene expression was also found to be significantly upregulated in the hypothalamus of rats after food deprivation and strongly negatively correlated with the expression of orexigenic galanin like peptide which is involved in the stimulation of food intake.
Increases in hypothalamic expression of FTO are associated with the regulation of energy intake but not feeding reward.
FTO demethylates m6A in mRNA
N6-methyladenosine (m6A) is an abundant modification in mRNA and is found within some viruses, and most eukaryotes including mammals, insects, plants,and yeast. It is also found in tRNA, rRNA, and small nuclear RNA (snRNA) as well as several long non-coding RNA, such as Xist. Adenosine methylation is directed by a large m6A methyltransferase complex containing METTL3 as the SAM-binding sub-unit. In vitro, this methyltransferase complex preferentially methylates RNA oligonucleotides containing GGACU and a similar preference was identified in vivo in mapped m6A sites in Rous sarcoma virus genomic RNA and in bovine prolactin mRNA. In plants, the majority of the m6A is found within 150 nucleotides before the start of the poly(A) tail.
Mapping of m6A in human and mouse RNA has identified over 18,000 m6A sites in the transcripts of more than 7,000 human genes with a consensus sequence of [G/A/U][G>A]m6AC[U>A/C]< consistent with the previously identified motif. Sites preferentially appear in two distinct landmarksâ€”around stop codons and within long internal exonsâ€”and are highly conserved between human and mouse. A subset of stimulus-dependent, dynamically modulated sites has been identified. Silencing the m6A methyltransferase significantly affects gene expression and alternative RNA splicing patterns, resulting in modulation of the p53 (also known as TP53) signalling pathway and apoptosis.
FTO demethylates m6A containing RNA efficiently in vitro. FTO knockdown with siRNA led to increased amounts of m6A in polyA-RNA, whereas overexpression of FTO resulted in decreased amounts of m6A in human cells. FTO partially co-localizes with nuclear speckles, which supports the notion that m6A in nuclear RNA is a major physiological substrate of FTO. Function of FTO likely affects the processing of pre-mRNA, other nuclear RNAs, or both. The discovery of the FTO-mediated oxidative demethylation of m6A in nuclear RNA may initiate further investigations on biological regulation based on reversible chemical modification of RNA.
The FTO gene is widely expressed in both fetal and adult tissues.
Association with obesity
A study of 38,759 Europeans for variants of FTO identified an obesity risk allele. In particular, carriers of one copy of the allele weighed on average 1.2 kilograms (2.6 lb) more than people with no copies. Carriers of two copies (16% of the subjects) weighed 3 kilograms (6.6 lb) more and had a 1.67-fold higher rate of obesity than those with no copies. The association was observed in ages 7 and upwards. This gene is not directly associated with diabetes however increased body-fat also increases the risk of developing Type 2 Diabetes.
Simultaneously, a study in 2,900 affected individuals and 5,100 controls of French descent, together with 500 trios (confirming an association independent of population stratification) found association of SNPs in the very same region of FTO (rs1421085). The authors found that this variation, or a variation in strong LD with this variation explains 1% of the population BMI variance and 22% of the population attributable risk of obesity. The authors of this study claim that while obesity was already known to have a genetic component (from twin studies), no replicated previous study has ever identified an obesity risk allele that was so common in the human population. The risk allele is a cluster of 10 single nucleotide polymorphism in the first intron of FTO called rs9939609. According to HapMap, it has population frequencies of 45% in the West/Central Europeans, 52% in Yorubans (West African natives) and 14% in Chinese/Japanese. Furthermore morbid obesity is associated with a combination of FTO and INSIG2 single nucleotide polymorphisms.
In adult humans it was shown that adults bearing the at risk AT and AA alleles at rs9939609 consumed between 500 and 1250 kJ more each day than those carrying the protective TT genotype (equivalent to between 125 and 280 kcal per day more intake). The same study showed that there was no impact of the polymorphism on energy expenditure. This finding of an effect of the rs9939609 polymorphism on food intake or satiety has been independently replicated in five subsequent studies (in order of publication). Three of these subsequent studies also measured resting energy expenditure and confirmed the original finding that there is no impact of the polymorphic variation at the rs9939609 locus on energy expenditure. A different study explored the effects of variation in two different SNPs in the FTO gene (rs17817449 and rs1421085) and suggested there might be an effect on circulating leptin levels and energy expenditure, but this latter effect disappeared when the expenditure was normalised for differences in body composition. The accumulated data across seven independent studies therefore clearly implicates the FTO gene in humans as having a direct impact on food intake but no effect on energy expenditure.
The obesity-associated noncoding region within the FTO gene interacts directly with the promoter of IRX3, a homeobox gene. This noncoding region of FTO interacts with the promoters of IRX3 and FTO in human, mouse and zebrafish. Results suggest that IRX3 is linked with obesity and determines body mass and composition. This is further supported by the fact that obesity-associated single nucleotide polymorphisms are involved in the expression of IRX3 (not FTO) in human brains.
Association with Alzheimer's disease
Recent studies revealed that carriers of common FTO gene polymorphisms show both a reduction in frontal lobe volume of the brain and an impaired verbal fluency performance. Fittingly, a population-based study from Sweden found that carriers of the FTO rs9939609 A allele have an increased risk for incident Alzheimer disease.
Association with other diseases
The presence of the FTO rs9939609 A allele was also found to be positively correlated with other symptoms of the metabolic syndrome, including higher fasting insulin, glucose, and triglycerides, and lower HDL cholesterol. However all these effects appear to be secondary to weight increase since no association was found after correcting for increases in body mass index. Similarly, the association of rs11076008 G allele with the increased risk for degenerative disc disease was reported. 
Model organisms have been used in the study of FTO function. In contrast to the findings in humans deletion, analysis of the Fto gene in mice showed loss of function is associated with no differences in energy intake but greater energy expenditure and this results in a reduction of body weight and fatness.
|Glucose tolerance test||Normal|
|Auditory brainstem response||Normal|
|All tests and analysis from|
Another conditional knockout mouse line, called Ftotm1a(EUCOMM)Wtsi was generated as part of the International Knockout Mouse Consortium program â€” a high-throughput mutagenesis project to generate and distribute animal models of disease to interested scientists. Male and female animals from this line underwent a standardized phenotypic screen to determine the effects of deletion. Twenty five tests were carried out on mutant mice and only significant skeletal abnormalities were observed, including kyphosis and abnormal vertebral transverse processes, and only in female homozygous mutant animals. 
The reasons for the differences in FTO phenotype between humans and different lines of mice is presently uncertain. However, many other genes involved in regulation of energy balance exert effects on both intake and expenditure.
Origin of name
- Jia G, Fu Y, Zhao X, Dai Q, Zheng G, Yang Y et al. (December 2011). "N6-Methyladenosine in nuclear RNA is a major substrate of the obesity-associated FTO". Nat. Chem. Biol. 7 (12): 885â€“7. doi:10.1038/nchembio.687. PMC 3218240. PMID 22002720.
- Loos RJ, Yeo GS (2014). "The bigger picture of FTO: the first GWAS-identified obesity gene". Nat Rev Endocrinol 10 (1): 51â€“61. doi:10.1038/nrendo.2013.227. PMC 4188449. PMID 24247219.
- Gerken T, Girard CA, Tung YC, Webby CJ, Saudek V, Hewitson KS et al. (2007). "The obesity-associated FTO gene encodes a 2-oxoglutarate-dependent nucleic acid demethylase". Science 318 (5855): 1469â€“72. doi:10.1126/science.1151710. PMC 2668859. PMID 17991826.
- Sanchez-Pulido L, Andrade-Navarro MA (2007). "The FTO (fat mass and obesity associated) gene codes for a novel member of the non-heme dioxygenase superfamily". BMC Biochem. 8: 23. doi:10.1186/1471-2091-8-23. PMC 2241624. PMID 17996046.
- Meyer KD, Saletore Y, Zumbo P, Elemento O, Mason CE, Jaffrey SR (May 2012). "Comprehensive Analysis of mRNA Methylation Reveals Enrichment in 3' UTRs and near Stop Codons". Cell 149 (7): 1635â€“46. doi:10.1016/j.cell.2012.05.003. PMC 3383396. PMID 22608085.
- Fredriksson R, HÃ¤gglund M, Olszewski PK, Stephansson O, Jacobsson JA, Olszewska AM et al. (2008). "The obesity gene, FTO, is of ancient origin, upregulated during food deprivation and expressed in neurons of feeding-related nuclei of the brain". Endocrinology 149 (5): 2062â€“71. doi:10.1210/en.2007-1457. PMID 18218688.
- Olszewski PK, Fredriksson R, Olszewska AM, Stephansson O, AlsiÃ¶ J, Radomska KJ et al. (October 2009). "Hypothalamic FTO is associated with the regulation of energy intake not feeding reward". BMC Neurosci 10: 129. doi:10.1186/1471-2202-10-129. PMC 2774323. PMID 19860904.
- Aloni Y, Dhar R, Khoury G (October 1979). "Methylation of nuclear simian virus 40 RNAs". J. Virol. 32 (1): 52â€“60. PMC 353526. PMID 232187.
- Beemon K, Keith J (June 1977). "Localization of N6-methyladenosine in the Rous sarcoma virus genome". J. Mol. Biol. 113 (1): 165â€“79. doi:10.1016/0022-2836(77)90047-X. PMID 196091.
- Desrosiers R, Friderici K, Rottman F (October 1974). "Identification of methylated nucleosides in messenger RNA from Novikoff hepatoma cells". Proc. Natl. Acad. Sci. U.S.A. 71 (10): 3971â€“5. Bibcode:1974PNAS...71.3971D. doi:10.1073/pnas.71.10.3971. PMC 434308. PMID 4372599.
- Adams JM, Cory S (May 1975). "Modified nucleosides and bizarre 5'-termini in mouse myeloma mRNA". Nature 255 (5503): 28â€“33. Bibcode:1975Natur.255...28A. doi:10.1038/255028a0. PMID 1128665.
- Wei CM, Gershowitz A, Moss B (January 1976). "5'-Terminal and internal methylated nucleotide sequences in HeLa cell mRNA". Biochemistry 15 (2): 397â€“401. doi:10.1021/bi00647a024. PMID 174715.
- Perry RP, Kelley DE, Friderici K, Rottman F (April 1975). "The methylated constituents of L cell messenger RNA: evidence for an unusual cluster at the 5' terminus". Cell 4 (4): 387â€“94. doi:10.1016/0092-8674(75)90159-2. PMID 1168101.
- Levis R, Penman S (April 1978). "5'-terminal structures of poly(A)+ cytoplasmic messenger RNA and of poly(A)+ and poly(A)- heterogeneous nuclear RNA of cells of the dipteran Drosophila melanogaster". J. Mol. Biol. 120 (4): 487â€“515. doi:10.1016/0022-2836(78)90350-9. PMID 418182.
- Nichols JL (August 1979). "N6-methyladenosine in maize poly(A)-containing RNA". Plant Science Letters 15 (4): 357â€“361. doi:10.1016/0304-4211(79)90141-X.
- Kennedy TD, Lane BG (June 1979). "Wheat embryo ribonucleates. XIII. Methyl-substituted nucleoside constituents and 5'-terminal dinucleotide sequences in bulk poly(AR)-rich RNA from imbibing wheat embryos". Can. J. Biochem. 57 (6): 927â€“31. doi:10.1139/o79-112. PMID 476526.
- Zhong S, Li H, Bodi Z, Button J, Vespa L, Herzog M et al. (May 2008). "MTA is an Arabidopsis messenger RNA adenosine methylase and interacts with a homolog of a sex-specific splicing factor". Plant Cell 20 (5): 1278â€“88. doi:10.1105/tpc.108.058883. PMC 2438467. PMID 18505803.
- Clancy MJ, Shambaugh ME, Timpte CS, Bokar JA (October 2002). "Induction of sporulation in Saccharomyces cerevisiae leads to the formation of N6-methyladenosine in mRNA: a potential mechanism for the activity of the IME4 gene". Nucleic Acids Res. 30 (20): 4509â€“18. doi:10.1093/nar/gkf573. PMC 137137. PMID 12384598.
- Bodi Z, Button JD, Grierson D, Fray RG (September 2010). "Yeast targets for mRNA methylation". Nucleic Acids Res. 38 (16): 5327â€“35. doi:10.1093/nar/gkq266. PMC 2938207. PMID 20421205.
- Dominissini D, Moshitch-Moshkovitz S, Schwartz S, Salmon-Divon M, Ungar L, Osenberg S et al. (May 2012). "Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq". Nature 485 (7397): 201â€“6. Bibcode:2012Natur.485..201D. doi:10.1038/nature11112. PMID 22575960.
- Bokar JA, Shambaugh ME, Polayes D, Matera AG, Rottman FM (November 1997). "Purification and cDNA cloning of the AdoMet-binding subunit of the human mRNA (N6-adenosine)-methyltransferase". RNA 3 (11): 1233â€“47. PMC 1369564. PMID 9409616.
- Harper JE, Miceli SM, Roberts RJ, Manley JL (October 1990). "Sequence specificity of the human mRNA N6-adenosine methylase in vitro". Nucleic Acids Res. 18 (19): 5735â€“41. doi:10.1093/nar/18.19.5735. PMC 332308. PMID 2216767.
- Kane SE, Beemon K (September 1985). "Precise localization of m6A in Rous sarcoma virus RNA reveals clustering of methylation sites: implications for RNA processing". Mol. Cell. Biol. 5 (9): 2298â€“306. PMC 366956. PMID 3016525.
- Horowitz S, Horowitz A, Nilsen TW, Munns TW, Rottman FM (September 1984). "Mapping of N6-methyladenosine residues in bovine prolactin mRNA". Proc. Natl. Acad. Sci. U.S.A. 81 (18): 5667â€“71. Bibcode:1984PNAS...81.5667H. doi:10.1073/pnas.81.18.5667. PMC 391771. PMID 6592581.
- Bodi Z, Zhong S, Mehra S, Song J, Graham N, Li H et al. (2012). "Adenosine Methylation in Arabidopsis mRNA is Associated with the 3' End and Reduced Levels Cause Developmental Defects". Front Plant Sci 3: 48. doi:10.3389/fpls.2012.00048. PMC 3355605. PMID 22639649.
- Frayling TM, Timpson NJ, Weedon MN, Zeggini E, Freathy RM, Lindgren CM et al. (2007). "A common variant in the FTO gene is associated with body mass index and predisposes to childhood and adult obesity". Science 316 (5826): 889â€“94. Bibcode:2007Sci...316..889F. doi:10.1126/science.1141634. PMC 2646098. PMID 17434869.
- Sandholt CH, Hansen T, Pedersen O (2012). "Beyond the fourth wave of genome-wide obesity association studies". Nutr Diabetes 2: e37. doi:10.1038/nutd.2012.9. PMC 3408643. PMID 23168490.
- Dina C, Meyre D, Gallina S, Durand E, KÃ¶rner A, Jacobson P et al. (2007). "Variation in FTO contributes to childhood obesity and severe adult obesity". Nature Genetics 39 (6): 724â€“6. doi:10.1038/ng2048. PMID 17496892.
- Chu X, Erdman R, Susek M, Gerst H, Derr K, Al-Agha M et al. (2008). "Association of morbid obesity with FTO and INSIG2 allelic variants". Arch Surg 143 (3): 235â€“40; discussion 241. doi:10.1001/archsurg.2007.77. PMID 18347269.
- Thorleifsson G, Walters GB, Gudbjartsson DF, Steinthorsdottir V, Sulem P, Helgadottir A et al. (January 2009). "Genome-wide association yields new sequence variants at seven loci that associate with measures of obesity". Nat. Genet. 41 (1): 18â€“24. doi:10.1038/ng.274. PMID 19079260.
- Willer CJ, Speliotes EK, Loos RJ, Li S, Lindgren CM, Heid IM et al. (January 2009). "Six new loci associated with body mass index highlight a neuronal influence on body weight regulation". Nat. Genet. 41 (1): 25â€“34. doi:10.1038/ng.287. PMC 2695662. PMID 19079261.
- Speakman JR, Rance KA, Johnstone AM (August 2008). "Polymorphisms of the FTO gene are associated with variation in energy intake, but not energy expenditure". Obesity (Silver Spring) 16 (8): 1961â€“5. doi:10.1038/oby.2008.318. PMID 18551109.
- Wardle J, Carnell S, Haworth CM, Farooqi IS, O'Rahilly S, Plomin R (September 2008). "Obesity associated genetic variation in FTO is associated with diminished satiety". J. Clin. Endocrinol. Metab. 93 (9): 3640â€“3. doi:10.1210/jc.2008-0472. PMID 18583465.
- Timpson NJ, Emmett PM, Frayling TM, Rogers I, Hattersley AT, McCarthy MI et al. (October 2008). "The fat mass- and obesity-associated locus and dietary intake in children". Am. J. Clin. Nutr. 88 (4): 971â€“8. PMID 18842783.
- Haupt A, Thamer C, Staiger H, Tschritter O, Kirchhoff K, Machicao F et al. (April 2009). "Variation in the FTO gene influences food intake but not energy expenditure". Exp. Clin. Endocrinol. Diabetes 117 (4): 194â€“7. doi:10.1055/s-0028-1087176. PMID 19053021.
- Wardle J, Llewellyn C, Sanderson S, Plomin R (January 2009). "The FTO gene and measured food intake in children". Int J Obes (Lond) 33 (1): 42â€“5. doi:10.1038/ijo.2008.174. PMID 18838977.
- Cecil JE, Tavendale R, Watt P, Hetherington MM, Palmer CN (December 2008). "An obesity-associated FTO gene variant and increased energy intake in children". N. Engl. J. Med. 359 (24): 2558â€“66. doi:10.1056/NEJMoa0803839. PMID 19073975.
- Do R, Bailey SD, Desbiens K, Belisle A, Montpetit A, Bouchard C et al. (April 2008). "Genetic variants of FTO influence adiposity, insulin sensitivity, leptin levels, and resting metabolic rate in the Quebec Family Study". Diabetes 57 (4): 1147â€“50. doi:10.2337/db07-1267. PMID 18316358.
- Smemo S, Tena JJ, Kim KH, Gamazon ER, Sakabe NJ, GÃ³mez-MarÃn C et al. (12 March 2014). "Obesity-associated variants within FTO form long-range functional connections with IRX3". Nature 507 (7492): 371â€“375. doi:10.1038/nature13138. PMID 24646999.
- Ho AJ, Stein JL, Hua X, Lee S, Hibar DP, Leow AD et al. (May 2010). "A commonly carried allele of the obesity-related FTO gene is associated with reduced brain volume in the healthy elderly". Proc. Natl. Acad. Sci. U.S.A. 107 (18): 8404â€“9. Bibcode:2010PNAS..107.8404H. doi:10.1073/pnas.0910878107. PMC 2889537. PMID 20404173.
- Benedict C, Jacobsson JA, RÃ¶nnemaa E, SÃ¤llman-AlmÃ©n M, Brooks S, Schultes B et al. (June 2011). "The fat mass and obesity gene is linked to reduced verbal fluency in overweight and obese elderly men". Neurobiol. Aging 32 (6): 1159.e1â€“5. doi:10.1016/j.neurobiolaging.2011.02.006. PMID 21458110.
- Keller L, Xu W, Wang HX, Winblad B, Fratiglioni L, Graff C (2011). "The obesity related gene, FTO, interacts with APOE, and is associated with Alzheimer's disease risk: a prospective cohort study". J. Alzheimers Dis. 23 (3): 461â€“9. doi:10.3233/JAD-2010-101068. PMID 21098976.
- Freathy RM, Timpson NJ, Lawlor DA, Pouta A, Ben-Shlomo Y, Ruokonen A et al. (2008). "Common variation in the FTO gene alters diabetes-related metabolic traits to the extent expected, given its effect on BMI". Diabetes 57 (5): 1419â€“26. doi:10.2337/db07-1466. PMC 3073395. PMID 18346983.
- Lao L, Zhong G, Li X, Liu Z (Feb 2014). "A preliminary association study of fat mass and obesity associated gene polymorphisms and degenerative disc disease in a Chinese Han population.". J Int Med Res 42 (1): 205â€“12. doi:10.1177/0300060513503761. PMID 24304927.
- Fischer J, Koch L, Emmerling C, Vierkotten J, Peters T, BrÃ¼ning JC et al. (April 2009). "Inactivation of the Fto gene protects from obesity". Nature 458 (7240): 894â€“8. Bibcode:2009Natur.458..894F. doi:10.1038/nature07848. PMID 19234441.
- "Radiography data for Fto". Wellcome Trust Sanger Institute.
- "Salmonella infection data for Fto". Wellcome Trust Sanger Institute.
- "Citrobacter infection data for Fto". Wellcome Trust Sanger Institute.
- Gerdin AK (2010). "The Sanger Mouse Genetics Programme: High throughput characterisation of knockout mice". Acta Ophthalmologica 88: 925â€“7. doi:10.1111/j.1755-3768.2010.4142.x.
- Mouse Resources Portal, Wellcome Trust Sanger Institute.
- "International Knockout Mouse Consortium".
- "Mouse Genome Informatics".
- Skarnes WC, Rosen B, West AP, Koutsourakis M, Bushell W, Iyer V et al. (2011). "A conditional knockout resource for the genome-wide study of mouse gene function". Nature 474 (7351): 337â€“342. doi:10.1038/nature10163. PMC 3572410. PMID 21677750.
- Dolgin E (2011). "Mouse library set to be knockout". Nature 474 (7351): 262â€“3. doi:10.1038/474262a. PMID 21677718.
- Collins FS, Rossant J, Wurst W (2007). "A Mouse for All Reasons". Cell 128 (1): 9â€“13. doi:10.1016/j.cell.2006.12.018. PMID 17218247.
- van der Weyden L, White JK, Adams DJ, Logan DW (2011). "The mouse genetics toolkit: revealing function and mechanism". Genome Biol. 12 (6): 224. doi:10.1186/gb-2011-12-6-224. PMC 3218837. PMID 21722353.
- Peters T, Ausmeier K, RÃ¼ther U (October 1999). "Cloning of Fatso (Fto), a novel gene deleted by the Fused toes (Ft) mouse mutation". Mamm. Genome 10 (10): 983â€“6. doi:10.1007/s003359901144. PMID 10501967.
- Kim B, Kim Y, Cooke PS, RÃ¼ther U, Jorgensen JS (2011). "The fused toes locus is essential for somatic-germ cell interactions that foster germ cell maturation in developing gonads in mice". Biol Reprod 84 (5): 1024â€“32. doi:10.1095/biolreprod.110.088559. PMID 21293032.
This tab holds the annotation information that is stored in the Pfam database. As we move to using Wikipedia as our main source of annotation, the contents of this tab will be gradually replaced by the Wikipedia tab.
FTO catalytic domain Provide feedback
This domain is the catalytic AlkB-like domain from the FTO protein . This domain catalyses a demethylase activity with a preference for 3-methylthymidine.
Han Z, Niu T, Chang J, Lei X, Zhao M, Wang Q, Cheng W, Wang J, Feng Y, Chai J;, Nature. 2010; [Epub ahead of print]: Crystal structure of the FTO protein reveals basis for its substrate specificity. PUBMED:20376003 EPMC:20376003
External database links
This tab holds annotation information from the InterPro database.
InterPro entry IPR024367Alpha-ketoglutarate-dependent dioxygenase FTO, also known as Fat mass and obesity-associated protein, is a nucleus protein which belongs to the FTO family. This enzyme is a dioxygenase that repairs alkylated DNA and RNA by oxidative demethylation [PUBMED:17991826]. FTO activity is highest towards single-stranded RNA containing 3-methyluracil, followed by single-stranded DNA containing 3-methylthymine. FTO has low demethylase activity towards single-stranded DNA containing 1-methyladenine or 3-methylcytosine [PUBMED:18775698]. FTO has no activity towards 1-methylguanine. It has no detectable activity towards double-stranded DNA. FTO requires molecular oxygen, alpha-ketoglutarate and iron. FTO contributes to the regulation of the global metabolic rate, energy expenditure and energy homeostasis. It contributes to the regulation of body size and body fat accumulation as well [PUBMED:19234441].
This domain is the catalytic AlkB-like domain from the FTO protein [PUBMED:20376003]. This domain catalyses a demethylase activity with a preference for 3-methylthymidine.
Below is a listing of the unique domain organisations or architectures in which this domain is found. More...
The graphic that is shown by default represents the longest sequence with a given architecture. Each row contains the following information:
- the number of sequences which exhibit this architecture
a textual description of the architecture, e.g. Gla, EGF x 2, Trypsin.
This example describes an architecture with one
Gladomain, followed by two consecutive
EGFdomains, and finally a single
- a link to the page in the Pfam site showing information about the sequence that the graphic describes
- the UniProt description of the protein sequence
- the number of residues in the sequence
- the Pfam graphic itself.
Note that you can see the family page for a particular domain by clicking on the graphic. You can also choose to see all sequences which have a given architecture by clicking on the Show link in each row.
Finally, because some families can be found in a very large number of architectures, we load only the first fifty architectures by default. If you want to see more architectures, click the button at the bottom of the page to load the next set.
Loading domain graphics...
This clan represents the conserved barrel domain of the 'cupin' superfamily  ('cupa' is the Latin term for a small barrel). The cupin fold is found in a wide variety of enzymes, but notably contains the non-enzymatic seed storage proteins also.
The clan contains the following 57 members:2OG-Fe_Oxy_2 2OG-FeII_Oxy 2OG-FeII_Oxy_2 2OG-FeII_Oxy_3 2OG-FeII_Oxy_4 2OG-FeII_Oxy_5 3-HAO AraC_binding AraC_binding_2 AraC_N ARD Asp_Arg_Hydrox Auxin_BP CDO_I CENP-C_C CsiD Cupin_1 Cupin_2 Cupin_3 Cupin_4 Cupin_5 Cupin_6 Cupin_7 Cupin_8 DMSP_lyase dTDP_sugar_isom DUF1255 DUF1479 DUF1498 DUF1637 DUF1971 DUF386 DUF4437 DUF4867 Ectoine_synth EutQ FdtA FTO_NTD GPI HgmA HutD JmjC KduI MannoseP_isomer Ofd1_CTDD Oxygenase-NA PhyH Pirin Pirin_C PMI_typeI Popeye Pox_C4_C10 TauD Tet_JBP Ureidogly_hydro VIT VIT_2
We store a range of different sequence alignments for families. As well as the seed alignment from which the family is built, we provide the full alignment, generated by searching the sequence database using the family HMM. We also generate alignments using four representative proteomes (RP) sets, the NCBI sequence database, and our metagenomics sequence database. More...
There are various ways to view or download the sequence alignments that we store. We provide several sequence viewers and a plain-text Stockholm-format file for download.
We make a range of alignments for each Pfam-A family:
- the curated alignment from which the HMM for the family is built
- the alignment generated by searching the sequence database using the HMM
- Representative Proteomes (RPs) at 15%, 35%, 55% and 75% co-membership thresholds
- alignment generated by searching the NCBI sequence database using the family HMM
- alignment generated by searching the metagenomics sequence database using the family HMM
You can see the alignments as HTML or in three different sequence viewers:
- a Java applet developed at the University of Dundee. You will need Java installed before running jalview
- an HTML page showing the whole alignment.Please note: full Pfam alignments can be very large. These HTML views are extremely large and often cause problems for browsers. Please use either jalview or the Pfam viewer if you have trouble viewing the HTML version
- an HTML-based representation of the alignment, coloured according to the posterior-probability (PP) values from the HMM. As for the standard HTML view, heatmap alignments can also be very large and slow to render.
- Pfam viewer
- an HTML-based viewer that uses DAS to retrieve alignment fragments on request
You can download (or view in your browser) a text representation of a Pfam alignment in various formats:
You can also change the order in which sequences are listed in the alignment, change how insertions are represented, alter the characters that are used to represent gaps in sequences and, finally, choose whether to download the alignment or to view it in your browser directly.
You may find that large alignments cause problems for the viewers and the reformatting tool, so we also provide all alignments in Stockholm format. You can download either the plain text alignment, or a gzipped version of it.
We make a range of alignments for each Pfam-A family. You can see a description of each above. You can view these alignments in various ways but please note that some types of alignment are never generated while others may not be available for all families, most commonly because the alignments are too large to handle.
1Cannot generate PP/Heatmap alignments for seeds; no PP data available
Key: available, not generated, — not available.
Format an alignment
We make all of our alignments available in Stockholm format. You can download them here as raw, plain text files or as gzip-compressed files.
You can also download a FASTA format file containing the full-length sequences for all sequences in the full alignment.
MyHits provides a collection of tools to handle multiple sequence alignments. For example, one can refine a seed alignment (sequence addition or removal, re-alignment or manual edition) and then search databases for remote homologs using HMMER3.
HMM logos is one way of visualising profile HMMs. Logos provide a quick overview of the properties of an HMM in a graphical form. You can see a more detailed description of HMM logos and find out how you can interpret them here. More...
If you find these logos useful in your own work, please consider citing the following article:
This page displays the phylogenetic tree for this family's seed alignment. We use FastTree to calculate neighbour join trees with a local bootstrap based on 100 resamples (shown next to the tree nodes). FastTree calculates approximately-maximum-likelihood phylogenetic trees from our seed alignment.
Note: You can also download the data file for the tree.
Curation and family details
This section shows the detailed information about the Pfam family. You can see the definitions of many of the terms in this section in the glossary and a fuller explanation of the scoring system that we use in the scores section of the help pages.
|Number in seed:||20|
|Number in full:||138|
|Average length of the domain:||244.00 aa|
|Average identity of full alignment:||55 %|
|Average coverage of the sequence by the domain:||59.64 %|
|HMM build commands:||
build method: hmmbuild -o /dev/null HMM SEED
search method: hmmsearch -Z 80369284 -E 1000 --cpu 4 HMM pfamseq
|Family (HMM) version:||3|
|Download:||download the raw HMM for this family|
Weight segments by...
Change the size of the sunburst
selected sequences to HMM
a FASTA-format file
- 0 sequences
- 0 species
This visualisation provides a simple graphical representation of the distribution of this family across species. You can find the original interactive tree in the More....
This chart is a modified "sunburst" visualisation of the species tree for this family. It shows each node in the tree as a separate arc, arranged radially with the superkingdoms at the centre and the species arrayed around the outermost ring.
How the sunburst is generated
The tree is built by considering the taxonomic lineage of each sequence that has a match to this family. For each node in the resulting tree, we draw an arc in the sunburst. The radius of the arc, its distance from the root node at the centre of the sunburst, shows the taxonomic level ("superkingdom", "kingdom", etc). The length of the arc represents either the number of sequences represented at a given level, or the number of species that are found beneath the node in the tree. The weighting scheme can be changed using the sunburst controls.
In order to reduce the complexity of the representation, we reduce the number of taxonomic levels that we show. We consider only the following eight major taxonomic levels:
Colouring and labels
Segments of the tree are coloured approximately according to their superkingdom. For example, archeal branches are coloured with shades of orange, eukaryotes in shades of purple, etc. The colour assignments are shown under the sunburst controls. Where space allows, the name of the taxonomic level will be written on the arc itself.
As you move your mouse across the sunburst, the current node will be highlighted. In the top section of the controls panel we show a summary of the lineage of the currently highlighed node. If you pause over an arc, a tooltip will be shown, giving the name of the taxonomic level in the title and a summary of the number of sequences and species below that node in the tree.
Anomalies in the taxonomy tree
There are some situations that the sunburst tree cannot easily handle and for which we have work-arounds in place.
Missing taxonomic levels
Some species in the taxonomic tree may not have one or more of the main eight levels that we display. For example, Bos taurus is not assigned an order in the NCBI taxonomic tree. In such cases we mark the omitted level with, for example, "No order", in both the tooltip and the lineage summary.
Unmapped species names
The tree is built by looking at each sequence in the full alignment for the family. We take the name of the species given by UniProt and try to map that to the full taxonomic tree from NCBI. In some cases, the name chosen by UniProt does not map to any node in the NCBI tree, perhaps because the chosen name is listed as a synonym or a misspelling in the NCBI taxonomy.
So that these nodes are not simply omitted from the sunburst tree, we group them together in a separate branch (or segment of the sunburst tree). Since we cannot determine the lineage for these unmapped species, we show all levels between the superkingdom and the species as "uncategorised".
Since we reduce the species tree to only the eight main taxonomic levels, sequences that are mapped to the sub-species level in the tree would not normally be shown. Rather than leave out these species, we map them instead to their parent species. So, for example, for sequences belonging to one of the Vibrio cholerae sub-species in the NCBI taxonomy, we show them instead as belonging to the species Vibrio cholerae.
Too many species/sequences
For large species trees, you may see blank regions in the outer layers of the sunburst. These occur when there are large numbers of arcs to be drawn in a small space. If an arc is less than approximately one pixel wide, it will not be drawn and the space will be left blank. You may still be able to get some information about the species in that region by moving your mouse across the area, but since each arc will be very small, it will be difficult to accurately locate a particular species.
The tree shows the occurrence of this domain across different species. More...
We show the species tree in one of two ways. For smaller trees we try to show an interactive representation, which allows you to select specific nodes in the tree and view them as an alignment or as a set of Pfam domain graphics.
Unfortunately we have found that there are problems viewing the interactive tree when the it becomes larger than a certain limit. Furthermore, we have found that Internet Explorer can become unresponsive when viewing some trees, regardless of their size. We therefore show a text representation of the species tree when the size is above a certain limit or if you are using Internet Explorer to view the site.
If you are using IE you can still load the interactive tree by clicking the "Generate interactive tree" button, but please be aware of the potential problems that the interactive species tree can cause.
For all of the domain matches in a full alignment, we count the number that are found on all sequences in the alignment. This total is shown in the purple box.
We also count the number of unique sequences on which each domain is found, which is shown in green. Note that a domain may appear multiple times on the same sequence, leading to the difference between these two numbers.
Finally, we group sequences from the same organism according to the NCBI code that is assigned by UniProt, allowing us to count the number of distinct sequences on which the domain is found. This value is shown in the pink boxes.
We use the NCBI species tree to group organisms according to their taxonomy and this forms the structure of the displayed tree. Note that in some cases the trees are too large (have too many nodes) to allow us to build an interactive tree, but in most cases you can still view the tree in a plain text, non-interactive representation. Those species which are represented in the seed alignment for this domain are highlighted.
You can use the tree controls to manipulate how the interactive tree is displayed:
- show/hide the summary boxes
- highlight species that are represented in the seed alignment
- expand/collapse the tree or expand it to a given depth
- select a sub-tree or a set of species within the tree and view them graphically or as an alignment
- save a plain text representation of the tree
Please note: for large trees this can take some time. While the tree is loading, you can safely switch away from this tab but if you browse away from the family page entirely, the tree will not be loaded.
There is 1 interaction for this family. More...
We determine these interactions using iPfam, which considers the interactions between residues in three-dimensional protein structures and maps those interactions back to Pfam families. You can find more information about the iPfam algorithm in the journal article that accompanies the website.
For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe group, to allow us to map Pfam domains onto UniProt sequences and three-dimensional protein structures. The table below shows the structures on which the FTO_NTD domain has been found. There are 11 instances of this domain found in the PDB. Note that there may be multiple copies of the domain in a single PDB structure, since many structures contain multiple copies of the same protein seqence.
Loading structure mapping...