Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
1  structure 1  species 0  interactions 1  sequence 1  architecture

Protein: FOXG1_HUMAN (P55316)

Summary

This is the summary of UniProt entry FOXG1_HUMAN (P55316).

Description: Forkhead box protein G1
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 489 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 181
low_complexity n/a 43 86
low_complexity n/a 94 126
low_complexity n/a 123 142
low_complexity n/a 144 181
Pfam Forkhead 180 266
disorder n/a 268 272
disorder n/a 277 280
disorder n/a 327 329
low_complexity n/a 375 386
low_complexity n/a 425 450
disorder n/a 427 455
disorder n/a 474 475
disorder n/a 485 489

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P55316. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MLDMGDRKEV KMIPKSSFSI NSLVPEAVQN DNHHASHGHH NSHHPQHHHH
50
51
HHHHHHHPPP PAPQPPPPPQ QQQPPPPPPP APQPPQTRGA PAADDDKGPQ
100
101
QLLLPPPPPP PPAAALDGAK ADGLGGKGEP GGGPGELAPV GPDEKEKGAG
150
151
AGGEEKKGAG EGGKDGEGGK EGEKKNGKYE KPPFSYNALI MMAIRQSPEK
200
201
RLTLNGIYEF IMKNFPYYRE NKQGWQNSIR HNLSLNKCFV KVPRHYDDPG
250
251
KGNYWMLDPS SDDVFIGGTT GKLRRRSTTS RAKLAFKRGA RLTSTGLTFM
300
301
DRAGSLYWPM SPFLSLHHPR ASSTLSYNGT TSAYPSHPMP YSSVLTQNSL
350
351
GNNHSFSTAN GLSVDRLVNG EIPYATHHLT AAALAASVPC GLSVPCSGTY
400
401
SLNPCSVNLL AGQTSYFFPH VPHPSMTSQS STSMSARAAS SSTSPQAPST
450
451
LPCESLRPSL PSFTTGLSGG LSDYFTHQNQ GSSSNPLIH            
489
 

Show the unformatted sequence.

Checksums:
CRC64:897945F9CE4F2A71
MD5:aa66076240d1f7d27d4bd9374595f9f9

Structures

For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe SIFTS project, to allow us to map Pfam domains onto UniProt three-dimensional structures. The table below shows the mapping between Pfam domains, this UniProt entry and a corresponding three dimensional structure.

Pfam family UniProt residues PDB ID PDB chain ID PDB residues View
Forkhead 180 - 266 7CBY C 180 - 266 NGL View in InterPro
×

The parts of the structure corresponding to the Pfam family are highlighted in yellow.

Loading Structure Data

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.