Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: Q97K42_CLOAB (Q97K42)

Summary

This is the summary of UniProt entry Q97K42_CLOAB (Q97K42).

Description: Uncharacterized protein, related to enterotoxins of other Clostridiales {ECO:0000313|EMBL:AAK79053.1}
Source organism: Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787) (NCBI taxonomy ID 272562)
View Pfam proteome data.
Length: 2817 amino acids
Reference Proteome: ✓

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Download the data used to generate the domain graphic in JSON format.

Show or hide the data used to generate the graphic in JSON format.

Source Domain Start End
sig_p n/a 1 25
low_complexity n/a 5 19
disorder n/a 27 43
disorder n/a 45 258
low_complexity n/a 145 159
low_complexity n/a 161 184
low_complexity n/a 207 225
Pfam CW_binding_1 278 296
Pfam CW_binding_1 298 316
Pfam CW_binding_1 338 356
Pfam CW_binding_1 358 376
Pfam CW_binding_1 398 416
Pfam CW_binding_1 418 436
low_complexity n/a 438 448
Pfam CW_binding_1 479 497
Pfam CW_binding_1 518 536
Pfam CW_binding_1 538 555
Pfam CW_binding_1 558 576
Pfam CW_binding_1 607 624
Pfam CW_binding_1 627 642
Pfam CW_binding_1 666 684
Pfam CW_binding_1 686 704
Pfam CW_binding_1 706 724
Pfam CW_binding_1 765 783
Pfam CW_binding_1 806 824
low_complexity n/a 807 820
Pfam CW_binding_1 826 843
Pfam CW_binding_1 846 864
low_complexity n/a 892 905
Pfam CW_binding_1 934 951
Pfam CW_binding_1 953 971
Pfam CW_binding_1 973 991
Pfam CW_binding_1 1051 1069
Pfam CW_binding_1 1071 1089
Pfam CW_binding_1 1091 1109
low_complexity n/a 1141 1152
Pfam CW_binding_1 1180 1198
Pfam CW_binding_1 1200 1218
Pfam CW_binding_1 1299 1317
Pfam CW_binding_1 1339 1354
Pfam CW_binding_1 1378 1396
Pfam CW_binding_1 1469 1487
Pfam CW_binding_1 1529 1547
Pfam CW_binding_1 1550 1567
Pfam CW_binding_1 1586 1604
Pfam CW_binding_1 1627 1643
Pfam CW_binding_1 1646 1660
Pfam CW_binding_1 1807 1825
Pfam CW_binding_1 1827 1844
Pfam CW_binding_1 1867 1885
low_complexity n/a 1971 1982
Pfam CW_binding_1 1988 2004
Pfam CW_binding_1 2024 2042
Pfam CW_binding_1 2106 2124
Pfam CW_binding_1 2151 2166
Pfam CW_binding_1 2188 2201
Pfam CW_binding_1 2208 2226
Pfam CW_binding_1 2289 2307
Pfam CW_binding_1 2309 2326
Pfam CW_binding_1 2329 2347
Pfam CW_binding_1 2349 2367
Pfam CW_binding_1 2371 2386
Pfam CW_binding_1 2389 2407
Pfam CW_binding_1 2470 2488
Pfam CW_binding_1 2490 2508
Pfam CW_binding_1 2510 2528
Pfam CW_binding_1 2552 2568
Pfam CW_binding_1 2570 2589
Pfam CW_binding_1 2591 2609
Pfam CW_binding_1 2611 2629
Pfam CW_binding_1 2631 2649
Pfam CW_binding_1 2671 2689
Pfam CW_binding_1 2691 2706
Pfam CW_binding_1 2712 2730
Pfam CW_binding_1 2752 2770
Pfam CW_binding_1 2772 2790

Show or hide domain scores.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q97K42. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MLKNKITLLL SSIYILGSIS TPTLASELTK NSSALTKRSS SNNFSLNKNH
50
51
VFTPITSNVN GSNAKNNLNT KVQTNTASSS MPNTNPKQAT NNSKILVNPK
100
101
LNQASSPNEG ITPKKQASIP YTNVTDNKNT FKNESSINNE APIIPKDTSK
150
151
TKSTSSAQTK GSNDNNIPSN NTSTNTSKNE NPSNTDIKTT EAPANAPIKD
200
201
TPNNQSDSAL AKNKALSNNN LAADSSQTSK VTSSNNDAPK VNTTSTDKKA
250
251
SNLNNDSQDG WVTKDGKKYY YVNGVQQKGF QSINKSIYYF NDDGSMQTGW
300
301
LKYNSNSYYF DASGVMLTGL QNINGTYYGF NDDGKLLTGL QAINNNYYYF
350
351
NNDGVMQTGW ITCNDSKYYF DNNGVMQTGL VHINNKYYGF GNDGKLLTGL
400
401
QNINNYTYYF DSNGVMQTDW ITIDGSKYYF SVNGVMQTGI IYISGYYYGF
450
451
ANDGKLLTGL QVINGNSYYF DTNGIRLVSR WITIDGKDYY FNQDGILTDN
500
501
WINYDGKYYF YISGVKQTGL QNIDGNYYYF DSSGIMQTGL QKIDGKTYYF
550
551
GDNGIRQIGW ITYQNNKYYF NSDGSMQTDL KIYSYSTSPY NYHYQYYGFD
600
601
NDGKLLTGLQ TIKGNTYYFD SNGISQMGWV NIDGKDFYFN SNSIMTENWV
650
651
INDEKYYFYI NNVKQTGFQY INGKYYYFDP DGIMQTGFQT ISGNTYYLDD
700
701
NGVKQTGWVT IKGKDYYFDG NGVMINYWVF DNDKTYYYIN GNMQTGAISI
750
751
NNHYYGFDDN GIMQTGWQRI NGRTYYFDNN GAAKTGLVTY EGKTYYFNTY
800
801
YAYLDTGFIY FNNNYYFLDN NGVVRTGWIN YSNNRYYLDS TGVRVTGFQT
850
851
IDGNKYYFDS SGAMCTSFIT VNGNTYGFSK DGIMLTGWQT ILSSNYSSYN
900
901
IYYFNSDGSA QKGFFTYLGK TYYFEPNYGY MLLGYNYING KYYYFDNDGV
950
951
IQTGWVTDRS SKYYLDPSGA AVTGFQNING DKYYFNSSGI MQTGLVYVNP
1000
1001
DYYGFDDNGH ILTGMHSING YIYYFDSTGK AQKGFVTYLG KTYYFNTNMY
1050
1051
TGFVNANNNL YYFDNEGVMQ TGWINYNSNR YYFSATGASV TGFQTIDGNK
1100
1101
YCFDSNGAIY TDVVTINGST YGFNTDGIML TGWQTIRYNR GYSSYFNTYY
1150
1151
FNSDGTAKTG FFTYLNKTYY FNPSDGRMLQ GYQYINGNHY YFAPDGTMQT
1200
1201
GWITNGSSKY YLDPSGAAVT GLQTINGNKY CFDSNGILQH NGIFYIGNTY
1250
1251
YGSDNNGIML TGLQLINGYL YCFNSDGSVK TGLVTYLGKT YYFDSYSVSG
1300
1301
FQNINNNTYY FGNDGTMQTG WVNYGYYRYY LNDSGIKVTG WQTIDGNKYY
1350
1351
FDYYGAKTGI VNIDGNYYGF NNSGVMLTGW QHINGSTYYF NSNGIANTGF
1400
1401
ITYLGKTYYF DSYGRMQIGS MTINGTSYYF YANGVMKTST DSPNTLAVGW
1450
1451
VRDSYYYQYY LNAAGTKLTG LQTIDGNTYY FDSNGIMQTG IITINGNRYG
1500
1501
FGVNGVMLYG LQFINNNTYY SNSYGISQTG FVTLSGNTYY FDSYGEMRIG
1550
1551
LTYINNNYYY FNSKGIMETG WISYLRYANP NGILLTGFQT INGKTYYFNS
1600
1601
DGSLLYDLQY INGSYYGFDK NGVMLYGLQT IGGNTYYLNS NGISQSGFIT
1650
1651
LNGKTYYFDS YYGMRTGIQN INNNYYFFGD NGTLQTGWIS QDNLRYYANS
1700
1701
SGVCLTGLQT IDGKKYYFNS YARMETGLVY INNTYYGFDN DGTLLYSWHN
1750
1751
INGRMYCFNT DGTVKTGWIN YLGRSCYLDS SQGFLSTGLL TIGHNIYYFG
1800
1801
SDYSMKTGWV TSGSSKYYFN ESGIMLTGFQ TIDGNTYYFD SYGNSTTGTR
1850
1851
SINGNCYGFN DDGIMLTGWQ TISGNNYYFN PDGTAKIGLN TYEGKTYYFS
1900
1901
TGGYTQTGII NINSNTYYFG YDGALKTGWI RNNYIYYADN NGIIQTGLKT
1950
1951
IDGKNYLFNY SGIMMNGIQS INNNYYGFDN DGGMLFGVHS INGQTYYFNA
2000
2001
DGSVKTGWIP YEGKMYYANP SYCTGFATIN NNTYYFDNNG AMETGIITVD
2050
2051
NSKYYINSYG IRETGYKIID GKTYYFNPNY SGIMQTGVIS LNGNYYGFDD
2100
2101
NGVMQIGLQK LNGNTYYFNS NGTAITGWTT LGSNKYYFNP NSYGAAEVGI
2150
2151
ESINGHYYYF NNNGIMQTGV QRINGYVYCF NNDGTAMNGF QSINGNTYYL
2200
2201
NIYGCTYTGW QTINSNRYYF NSDGVMLTGA QNITGYIYGF DSNGIMLTGV
2250
2251
QTIAGNTYDF SSNGTATTGW VILSNKTYYF SPSLGYKLTG FITVSGDNYY
2300
2301
LDADGVLQTG WVTVDGNKYY LNSNGVRQTG FLTLNNNKYY FDTNGVMQTG
2350
2351
FTTINNNVYY FNDDGIMQTL WITIDYNKYY FDSTGIRLIG FQIIDGKKYY
2400
2401
FNSNGIMLTG VQTIGNKFYG FDNYGVMLTG VQDIDNHTYN FNQDGSASTG
2450
2451
WTKLSDKTYY FSPSLGYKLT GFQTIDDKTY YFDTDGAILS GWITANSNKY
2500
2501
YLDSNGVVQT GFLTLDNNKY YLNSSGVMQT GIVPINGTYY GFGQDGIMLT
2550
2551
GIQKLNDQTY YFNSDGTMFT GWFKDSSSSQ HYFDSNGMML TGLKDVNGNK
2600
2601
YYFNENGVMQ TGLITFSNNK YYFDQNGIMQ TGWQNLSGCN YYFNNDGVMQ
2650
2651
TGVNTINDSI YGFDSNGIML TGWQTINSKT YYFAPNGIAK NGWLNYRGKK
2700
2701
YYFDPQYAYM ITGFKTITGC TYYFDQDGVM QTGVVSIDGG TYGFNSLGFL
2750
2751
LKGWQNIDSK TYYFDSNGLA PKGLKTIDGS IYYFSSDGYL EKDTKIIVDG
2800
2801
ITYKINDNGT ATEVKNI                                    
2817
 

Show the unformatted sequence.

Checksums:
CRC64:1851D0D4FFBEE921
MD5:5e502ed0a07d5bbd45de62b6551dcad0