Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: HME2_HUMAN (P19622)

Summary

This is the summary of UniProt entry HME2_HUMAN (P19622).

Description: Homeobox protein engrailed-2
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 333 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 77
low_complexity n/a 22 37
disorder n/a 81 207
low_complexity n/a 89 127
low_complexity n/a 132 146
low_complexity n/a 146 157
low_complexity n/a 164 178
low_complexity n/a 184 199
low_complexity n/a 222 247
disorder n/a 223 257
Pfam Homeodomain 245 301
disorder n/a 262 279
disorder n/a 301 307
Pfam Engrail_1_C_sig 302 332
disorder n/a 309 333

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P19622. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MEENDPKPGE AAAAVEGQRQ PESSPGGGSG GGGGSSPGEA DTGRRRALML
50
51
PAVLQAPGNH QHPHRITNFF IDNILRPEFG RRKDAGTCCA GAGGGRGGGA
100
101
GGEGGASGAE GGGGAGGSEQ LLGSGSREPR QNPPCAPGAG GPLPAAGSDS
150
151
PGDGEGGSKT LSLHGGAKKG GDPGGPLDGS LKARGLGGGD LSVSSDSDSS
200
201
QAGANLGAQP MLWPAWVYCT RYSDRPSSGP RSRKPKKKNP NKEDKRPRTA
250
251
FTAEQLQRLK AEFQTNRYLT EQRRQSLAQE LSLNESQIKI WFQNKRAKIK
300
301
KATGNKNTLA VHLMAQGLYN HSTTAKEGKS DSE                  
333
 

Show the unformatted sequence.

Checksums:
CRC64:ACF5399E383D6257
MD5:dab257ab2a98838faad9b0fb8e91e801

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.