Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: D4CJ49_9FIRM (D4CJ49)

Summary

This is the summary of UniProt entry D4CJ49_9FIRM (D4CJ49).

Description: Glycosyl hydrolase family 25 {ECO:0000313|EMBL:EFE92475.1}
Source organism: Oribacterium sp. oral taxon 078 str. F0262 (NCBI taxonomy ID 608534)
View Pfam proteome data.
Length: 510 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
disorder n/a 1 105
low_complexity n/a 15 41
low_complexity n/a 43 60
disorder n/a 118 121
Pfam Glyco_hydro_25 122 301
disorder n/a 131 132
disorder n/a 320 368
low_complexity n/a 323 333
low_complexity n/a 338 362
Pfam CW_binding_1 389 407
Pfam CW_binding_1 409 427
Pfam CW_binding_1 429 447
Pfam CW_binding_1 449 467
Pfam CW_binding_1 469 487

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession D4CJ49. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MGPTQGSENA AIPAEGGAAS DQSSASPPQS GASVQSSASP PQDGASVQSS
50
51
VSPPQSQGAA SDQSGAISAQ GTPDAAAPSA EGVPSGTNGG GSTKSDADAF
100
101
SNPWLGYSNE DGSDLIYLEK GTEVSKFQND NGAIDWEAAK ADGLDFVIVR
150
151
LAYGLKEDPY FDQNVKGAQA AGLKVGAYLC STAKNMDEAV AEANLTIRKI
200
201
KKYKLQYPVA YDAEVNDMLS EGATPEILTA MANKYCAMVK AAGYTPIVYA
250
251
NRTWLTEYMN IADIPYDIWF AAYPQDRVYR PVKGSNTTIW QSGEAGTVKG
300
301
IKGNVTTEFS WKAYGGGSPS RKNAGSGGSA KKGSLPATGS NSGSGKVSGN
350
351
SGGGSAANMN NKADGNGVSD GWTQKNGKWY YYEKRKKVSG WKQVSKLWYY
400
401
MDGDGVMQTG WVRDGGNWYF LKPNGVMATN WANVGGRWFY LGSDGRMVSG
450
451
WTKLDKKWYY LGDDGAMATG WKTISGNWYY LGDDGVMAAD GTKKIDGANY
500
501
RFDKSGVWLA                                            
510
 

Show the unformatted sequence.

Checksums:
CRC64:3D87F400A7232950
MD5:1d1300aa0e7c8a4ce807ac3b7fd1d630