Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: TCF20_HUMAN (Q9UGU0)

Summary

This is the summary of UniProt entry TCF20_HUMAN (Q9UGU0).

Description: Transcription factor 20
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
View Pfam proteome data.
Length: 1960 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 798
low_complexity n/a 41 74
low_complexity n/a 97 116
low_complexity n/a 163 191
low_complexity n/a 235 258
low_complexity n/a 305 321
low_complexity n/a 367 387
low_complexity n/a 414 427
low_complexity n/a 482 493
low_complexity n/a 652 685
disorder n/a 803 805
disorder n/a 815 865
disorder n/a 871 873
disorder n/a 878 1349
low_complexity n/a 983 1003
low_complexity n/a 1018 1027
disorder n/a 1351 1620
low_complexity n/a 1503 1515
low_complexity n/a 1548 1564
low_complexity n/a 1573 1588
disorder n/a 1654 1687
disorder n/a 1726 1799
low_complexity n/a 1764 1776
disorder n/a 1806 1810
disorder n/a 1813 1841
low_complexity n/a 1832 1846
Pfam zf-HC5HC2H 1855 1933
low_complexity n/a 1928 1942
disorder n/a 1949 1960

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q9UGU0. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MQSFREQSSY HGNQQSYPQE VHGSSRLEEF SPRQAQMFQN FGGTGGSSGS
50
51
SGSGSGGGRR GAAAAAAAMA SETSGHQGYQ GFRKEAGDFY YMAGNKDPVT
100
101
TGTPQPPQRR PSGPVQSYGP PQGSSFGNQY GSEGHVGQFQ AQHSGLGGVS
150
151
HYQQDYTGPF SPGSAQYQQQ ASSQQQQQQV QQLRQQLYQS HQPLPQATGQ
200
201
PASSSSHLQP MQRPSTLPSS AAGYQLRVGQ FGQHYQSSAS SSSSSSFPSP
250
251
QRFSQSGQSY DGSYNVNAGS QYEGHNVGSN AQAYGTQSNY SYQPQSMKNF
300
301
EQAKIPQGTQ QGQQQQQPQQ QQHPSQHVMQ YTNAATKLPL QSQVGQYNQP
350
351
EVPVRSPMQF HQNFSPISNP SPAASVVQSP SCSSTPSPLM QTGENLQCGQ
400
401
GSVPMGSRNR ILQLMPQLSP TPSMMPSPNS HAAGFKGFGL EGVPEKRLTD
450
451
PGLSSLSALS TQVANLPNTV QHMLLSDALT PQKKTSKRPS SSKKADSCTN
500
501
SEGSSQPEEQ LKSPMAESLD GGCSSSSEDQ GERVRQLSGQ STSSDTTYKG
550
551
GASEKAGSSP AQGAQNEPPR LNASPAAREE ATSPGAKDMP LSSDGNPKVN
600
601
EKTVGVIVSR EAMTGRVEKP GGQDKGSQED DPAATQRPPS NGGAKETSHA
650
651
SLPQPEPPGG GGSKGNKNGD NNSNHNGEGN GQSGHSAAGP GFTSRTEPSK
700
701
SPGSLRYSYK DSFGSAVPRN VSGFPQYPTG QEKGDFTGHG ERKGRNEKFP
750
751
SLLQEVLQGY HHHPDRRYSR STQEHQGMAG SLEGTTRPNV LVSQTNELAS
800
801
RGLLNKSIGS LLENPHWGPW ERKSSSTAPE MKQINLTDYP IPRKFEIEPQ
850
851
SSAHEPGGSL SERRSVICDI SPLRQIVRDP GAHSLGHMSA DTRIGRNDRL
900
901
NPTLSQSVIL PGGLVSMETK LKSQSGQIKE EDFEQSKSQA SFNNKKSGDH
950
951
CHPPSIKHES YRGNASPGAA THDSLSDYGP QDSRPTPMRR VPGRVGGREG
1000
1001
MRGRSPSQYH DFAEKLKMSP GRSRGPGGDP HHMNPHMTFS ERANRSSLHT
1050
1051
PFSPNSETLA SAYHANTRAH AYGDPNAGLN SQLHYKRQMY QQQPEEYKDW
1100
1101
SSGSAQGVIA AAQHRQEGPR KSPRQQQFLD RVRSPLKNDK DGMMYGPPVG
1150
1151
TYHDPSAQEA GRCLMSSDGL PNKGMELKHG SQKLQESCWD LSRQTSPAKS
1200
1201
SGPPGMSSQK RYGPPHETDG HGLAEATQSS KPGSVMLRLP GQEDHSSQNP
1250
1251
LIMRRRVRSF ISPIPSKRQS QDVKNSSTED KGRLLHSSKE GADKAFNSYA
1300
1301
HLSHSQDIKS IPKRDSSKDL PSPDSRNCPA VTLTSPAKTK ILPPRKGRGL
1350
1351
KLEAIVQKIT SPNIRRSASS NSAEAGGDTV TLDDILSLKS GPPEGGSVAV
1400
1401
QDADIEKRKG EVASDLVSPA NQELHVEKPL PRSSEEWRGS VDDKVKTETH
1450
1451
AETVTAGKEP PGAMTSTTSQ KPGSNQGRPD GSLGGTAPLI FPDSKNVPPV
1500
1501
GILAPEANPK AEEKENDTVT ISPKQEGFPP KGYFPSGKKK GRPIGSVNKQ
1550
1551
KKQQQPPPPP PQPPQIPEGS ADGEPKPKKQ RQRRERRKPG AQPRKRKTKQ
1600
1601
AVPIVEPQEP EIKLKYATQP LDKTDAKNKS FYPYIHVVNK CELGAVCTII
1650
1651
NAEEEEQTKL VRGRKGQRSL TPPPSSTESK ALPASSFMLQ GPVVTESSVM
1700
1701
GHLVCCLCGK WASYRNMGDL FGPFYPQDYA ATLPKNPPPK RATEMQSKVK
1750
1751
VRHKSASNGS KTDTEEEEEQ QQQQKEQRSL AAHPRFKRRH RSEDCGGGPR
1800
1801
SLSRGLPCKK AATEGSSEKT VLDSKPSVPT TSEGGPELEL QIPELPLDSN
1850
1851
EFWVHEGCIL WANGIYLVCG RLYGLQEALE IAREMKCSHC QEAGATLGCY
1900
1901
NKGCSFRYHY PCAIDADCLL HEENFSVRCP KHKPPLPCPL PPLQNKTAKG
1950
1951
SLSTEQSERG                                            
1960
 

Show the unformatted sequence.

Checksums:
CRC64:D3E34B1FAA6ED06F
MD5:efdbe822fe3daf7808e1a8bee10f83b4

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.