Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
12  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: TF2H2_HUMAN (Q13888)

Summary

This is the summary of UniProt entry TF2H2_HUMAN (Q13888).

Description: General transcription factor IIH subunit 2
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
Length: 395 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 7
Pfam Ssl1 64 255
disorder n/a 234 235
low_complexity n/a 236 244
Pfam C1_4 344 387
low_complexity n/a 365 374

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q13888. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MDEEPERTKR WEGGYERTWE ILKEDESGSL KATIEDILFK AKRKRVFEHH
50
51
GQVRLGMMRH LYVVVDGSRT MEDQDLKPNR LTCTLKLLEY FVEEYFDQNP
100
101
ISQIGIIVTK SKRAEKLTEL SGNPRKHITS LKKAVDMTCH GEPSLYNSLS
150
151
IAMQTLKHMP GHTSREVLII FSSLTTCDPS NIYDLIKTLK AAKIRVSVIG
200
201
LSAEVRVCTV LARETGGTYH VILDESHYKE LLTHHVSPPP ASSSSECSLI
250
251
RMGFPQHTIA SLSDQDAKPS FSMAHLDGNT EPGLTLGGYF CPQCRAKYCE
300
301
LPVECKICGL TLVSAPHLAR SYHHLFPLDA FQEIPLEEYN GERFCYGCQG
350
351
ELKDQHVYVC AVCQNVFCVD CDVFVHDSLH CCPGCIHKIP APSGV     
395
 

Show the unformatted sequence.

Checksums:
CRC64:56D1BD8841288739
MD5:ff1049a93a450853791e023ddcd133f2

Structures

For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe SIFTS project, to allow us to map Pfam domains onto UniProt three-dimensional structures. The table below shows the mapping between Pfam domains, this UniProt entry and a corresponding three dimensional structure.

Pfam family UniProt residues PDB ID PDB chain ID PDB residues View
C1_4 344 - 386 1Z60 A 344 - 386 Jmol OpenAstexViewer
344 - 387 5O85 B 344 - 387 Jmol OpenAstexViewer
D 344 - 387 Jmol OpenAstexViewer
6NMI E 344 - 387 Jmol OpenAstexViewer
6O9L 6 344 - 387 Jmol OpenAstexViewer
6O9M 6 344 - 387 Jmol OpenAstexViewer
6RO4 D 344 - 387 Jmol OpenAstexViewer
Ssl1 64 - 240 5OF4 E 64 - 240 Jmol OpenAstexViewer
6RO4 D 64 - 240 Jmol OpenAstexViewer
64 - 241 5IVW n/a 64 - 241 Jmol OpenAstexViewer
5IY6 n/a 64 - 241 Jmol OpenAstexViewer
5IY7 n/a 64 - 241 Jmol OpenAstexViewer
5IY8 n/a 64 - 241 Jmol OpenAstexViewer
5IY9 n/a 64 - 241 Jmol OpenAstexViewer
64 - 255 6NMI E 64 - 255 Jmol OpenAstexViewer
6O9L 6 64 - 255 Jmol OpenAstexViewer
6O9M 6 64 - 255 Jmol OpenAstexViewer

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.