Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
1  structure 1  species 0  interactions 1  sequence 1  architecture

Protein: FOXA2_HUMAN (Q9Y261)

Summary

This is the summary of UniProt entry FOXA2_HUMAN (Q9Y261).

Description: Hepatocyte nuclear factor 3-beta
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 457 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 2 3
Pfam Forkhead_N 16 158
low_complexity n/a 29 40
low_complexity n/a 43 66
low_complexity n/a 73 103
low_complexity n/a 112 124
disorder n/a 148 152
Pfam Forkhead 158 244
disorder n/a 228 235
low_complexity n/a 257 291
disorder n/a 266 411
low_complexity n/a 325 346
low_complexity n/a 340 346
low_complexity n/a 344 363
Pfam HNF_C 373 446
low_complexity n/a 382 397
disorder n/a 415 437

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q9Y261. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MLGAVKMEGH EPSDWSSYYA EPEGYSSVSN MNAGLGMNGM NTYMSMSAAA
50
51
MGSGSGNMSA GSMNMSSYVG AGMSPSLAGM SPGAGAMAGM GGSAGAAGVA
100
101
GMGPHLSPSL SPLGGQAAGA MGGLAPYANM NSMSPMYGQA GLSRARDPKT
150
151
YRRSYTHAKP PYSYISLITM AIQQSPNKML TLSEIYQWIM DLFPFYRQNQ
200
201
QRWQNSIRHS LSFNDCFLKV PRSPDKPGKG SFWTLHPDSG NMFENGCYLR
250
251
RQKRFKCEKQ LALKEAAGAA GSGKKAAAGA QASQAQLGEA AGPASETPAG
300
301
TESPHSSASP CQEHKRGGLG ELKGTPAAAL SPPEPAPSPG QQQQAAAHLL
350
351
GPPHHPGLPP EAHLKPEHHY AFNHPFSINN LMSSEQQHHH SHHHHQPHKM
400
401
DLKAYEQVMH YPGYGSPMPG SLAMGPVTNK TGLDASPLAA DTSYYQGVYS
450
451
RPIMNSS                                               
457
 

Show the unformatted sequence.

Checksums:
CRC64:61DDE4C75C70680A
MD5:a68bf8fc94c271c0f7564e465ff6871e

Structures

For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe SIFTS project, to allow us to map Pfam domains onto UniProt three-dimensional structures. The table below shows the mapping between Pfam domains, this UniProt entry and a corresponding three dimensional structure.

Pfam family UniProt residues PDB ID PDB chain ID PDB residues View
Forkhead 158 - 238 5X07 C 158 - 238 Jmol OpenAstexViewer
I 158 - 238 Jmol OpenAstexViewer
158 - 239 5X07 F 158 - 239 Jmol OpenAstexViewer
L 158 - 239 Jmol OpenAstexViewer
Forkhead_N 157 - 158 5X07 C 157 - 158 Jmol OpenAstexViewer
F 157 - 158 Jmol OpenAstexViewer
I 157 - 158 Jmol OpenAstexViewer
L 157 - 158 Jmol OpenAstexViewer

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.