Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: THYG_HUMAN (P01266)

Summary

This is the summary of UniProt entry THYG_HUMAN (P01266).

Description: Thyroglobulin
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
View Pfam proteome data.
Length: 2768 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 19
Pfam Thyroglobulin_1 34 92
Pfam Thyroglobulin_1 96 160
low_complexity n/a 128 141
Pfam Thyroglobulin_1 171 251
Pfam Thyroglobulin_1 301 358
disorder n/a 348 360
disorder n/a 521 527
disorder n/a 529 544
low_complexity n/a 566 580
Pfam Thyroglobulin_1 597 658
Pfam Thyroglobulin_1 662 726
disorder n/a 669 670
disorder n/a 717 718
Pfam Thyroglobulin_1 730 801
disorder n/a 848 850
disorder n/a 911 919
Pfam Thyroglobulin_1 1006 1073
Pfam Thyroglobulin_1 1077 1149
disorder n/a 1088 1092
Pfam Thyroglobulin_1 1149 1210
disorder n/a 1201 1203
Pfam Ephrin_rec_like 1465 1510
low_complexity n/a 1721 1734
disorder n/a 1831 1844
disorder n/a 1846 1847
Pfam COesterase 2197 2718
disorder n/a 2262 2270
low_complexity n/a 2336 2351
disorder n/a 2731 2746
disorder n/a 2748 2754
disorder n/a 2762 2763
disorder n/a 2766 2768

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession P01266. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MALVLEIFTL LASICWVSAN IFEYQVDAQP LRPCELQRET AFLKQADYVP
50
51
QCAEDGSFQT VQCQNDGRSC WCVGANGSEV LGSRQPGRPV ACLSFCQLQK
100
101
QQILLSGYIN STDTSYLPQC QDSGDYAPVQ CDVQQVQCWC VDAEGMEVYG
150
151
TRQLGRPKRC PRSCEIRNRR LLHGVGDKSP PQCSAEGEFM PVQCKFVNTT
200
201
DMMIFDLVHS YNRFPDAFVT FSSFQRRFPE VSGYCHCADS QGRELAETGL
250
251
ELLLDEIYDT IFAGLDLPST FTETTLYRIL QRRFLAVQSV ISGRFRCPTK
300
301
CEVERFTATS FGHPYVPSCR RNGDYQAVQC QTEGPCWCVD AQGKEMHGTR
350
351
QQGEPPSCAE GQSCASERQQ ALSRLYFGTS GYFSQHDLFS SPEKRWASPR
400
401
VARFATSCPP TIKELFVDSG LLRPMVEGQS QQFSVSENLL KEAIRAIFPS
450
451
RGLARLALQF TTNPKRLQQN LFGGKFLVNV GQFNLSGALG TRGTFNFSQF
500
501
FQQLGLASFL NGGRQEDLAK PLSVGLDSNS STGTPEAAKK DGTMNKPTVG
550
551
SFGFEINLQE NQNALKFLAS LLELPEFLLF LQHAISVPED VARDLGDVME
600
601
TVLSSQTCEQ TPERLFVPSC TTEGSYEDVQ CFSGECWCVN SWGKELPGSR
650
651
VRGGQPRCPT DCEKQRARMQ SLMGSQPAGS TLFVPACTSE GHFLPVQCFN
700
701
SECYCVDAEG QAIPGTRSAI GKPKKCPTPC QLQSEQAFLR TVQALLSNSS
750
751
MLPTLSDTYI PQCSTDGQWR QVQCNGPPEQ VFELYQRWEA QNKGQDLTPA
800
801
KLLVKIMSYR EAASGNFSLF IQSLYEAGQQ DVFPVLSQYP SLQDVPLAAL
850
851
EGKRPQPREN ILLEPYLFWQ ILNGQLSQYP GSYSDFSTPL AHFDLRNCWC
900
901
VDEAGQELEG MRSEPSKLPT CPGSCEEAKL RVLQFIRETE EIVSASNSSR
950
951
FPLGESFLVA KGIRLRNEDL GLPPLFPPRE AFAEQFLRGS DYAIRLAAQS
1000
1001
TLSFYQRRRF SPDDSAGASA LLRSGPYMPQ CDAFGSWEPV QCHAGTGHCW
1050
1051
CVDEKGGFIP GSLTARSLQI PQCPTTCEKS RTSGLLSSWK QARSQENPSP
1100
1101
KDLFVPACLE TGEYARLQAS GAGTWCVDPA SGEELRPGSS SSAQCPSLCN
1150
1151
VLKSGVLSRR VSPGYVPACR AEDGGFSPVQ CDQAQGSCWC VMDSGEEVPG
1200
1201
TRVTGGQPAC ESPRCPLPFN ASEVVGGTIL CETISGPTGS AMQQCQLLCR
1250
1251
QGSWSVFPPG PLICSLESGR WESQLPQPRA CQRPQLWQTI QTQGHFQLQL
1300
1301
PPGKMCSADY ADLLQTFQVF ILDELTARGF CQIQVKTFGT LVSIPVCNNS
1350
1351
SVQVGCLTRE RLGVNVTWKS RLEDIPVASL PDLHDIERAL VGKDLLGRFT
1400
1401
DLIQSGSFQL HLDSKTFPAE TIRFLQGDHF GTSPRTWFGC SEGFYQVLTS
1450
1451
EASQDGLGCV KCPEGSYSQD EECIPCPVGF YQEQAGSLAC VPCPVGRTTI
1500
1501
SAGAFSQTHC VTDCQRNEAG LQCDQNGQYR ASQKDRGSGK AFCVDGEGRR
1550
1551
LPWWETEAPL EDSQCLMMQK FEKVPESKVI FDANAPVAVR SKVPDSEFPV
1600
1601
MQCLTDCTED EACSFFTVST TEPEISCDFY AWTSDNVACM TSDQKRDALG
1650
1651
NSKATSFGSL RCQVKVRSHG QDSPAVYLKK GQGSTTTLQK RFEPTGFQNM
1700
1701
LSGLYNPIVF SASGANLTDA HLFCLLACDR DLCCDGFVLT QVQGGAIICG
1750
1751
LLSSPSVLLC NVKDWMDPSE AWANATCPGV TYDQESHQVI LRLGDQEFIK
1800
1801
SLTPLEGTQD TFTNFQQVYL WKDSDMGSRP ESMGCRKDTV PRPASPTEAG
1850
1851
LTTELFSPVD LNQVIVNGNQ SLSSQKHWLF KHLFSAQQAN LWCLSRCVQE
1900
1901
HSFCQLAEIT ESASLYFTCT LYPEAQVCDD IMESNAQGCR LILPQMPKAL
1950
1951
FRKKVILEDK VKNFYTRLPF QKLMGISIRN KVPMSEKSIS NGFFECERRC
2000
2001
DADPCCTGFG FLNVSQLKGG EVTCLTLNSL GIQMCSEENG GAWRILDCGS
2050
2051
PDIEVHTYPF GWYQKPIAQN NAPSFCPLVV LPSLTEKVSL DSWQSLALSS
2100
2101
VVVDPSIRHF DVAHVSTAAT SNFSAVRDLC LSECSQHEAC LITTLQTQPG
2150
2151
AVRCMFYADT QSCTHSLQGQ NCRLLLREEA THIYRKPGIS LLSYEASVPS
2200
2201
VPISTHGRLL GRSQAIQVGT SWKQVDQFLG VPYAAPPLAE RRFQAPEPLN
2250
2251
WTGSWDASKP RASCWQPGTR TSTSPGVSED CLYLNVFIPQ NVAPNASVLV
2300
2301
FFHNTMDREE SEGWPAIDGS FLAAVGNLIV VTASYRVGVF GFLSSGSGEV
2350
2351
SGNWGLLDQV AALTWVQTHI RGFGGDPRRV SLAADRGGAD VASIHLLTAR
2400
2401
ATNSQLFRRA VLMGGSALSP AAVISHERAQ QQAIALAKEV SCPMSSSQEV
2450
2451
VSCLRQKPAN VLNDAQTKLL AVSGPFHYWG PVIDGHFLRE PPARALKRSL
2500
2501
WVEVDLLIGS SQDDGLINRA KAVKQFEESR GRTSSKTAFY QALQNSLGGE
2550
2551
DSDARVEAAA TWYYSLEHST DDYASFSRAL ENATRDYFII CPIIDMASAW
2600
2601
AKRARGNVFM YHAPENYGHG SLELLADVQF ALGLPFYPAY EGQFSLEEKS
2650
2651
LSLKIMQYFS HFIRSGNPNY PYEFSRKVPT FATPWPDFVP RAGGENYKEF
2700
2701
SELLPNRQGL KKADCSFWSK YISSLKTSAD GAKGGQSAES EEEELTAGSG
2750
2751
LREDLLSLQE PGSKTYSK                                   
2768
 

Show the unformatted sequence.

Checksums:
CRC64:69A87D935F1BAA72
MD5:ef0ede8f025be4dc8b407c6d030e82e7

TreeFam

Below is a phylogenetic tree of animal genes, with ortholog and paralog assignments, from TreeFam.