Summary
This is the summary of UniProt entry Q97K42_CLOAB (Q97K42).
| Description: | Uncharacterized protein, related to enterotoxins of other Clostridiales {ECO:0000313|EMBL:AAK79053.1} |
| Source organism: |
Clostridium acetobutylicum (strain ATCC 824 / DSM 792 / JCM 1419 / LMG 5710 / VKM B-1787)
(NCBI taxonomy ID
272562)
View Pfam proteome data. |
| Length: | 2817 amino acids |
| Reference Proteome: |
|
Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.
Pfam domains
Download the data used to generate the domain graphic in JSON format.
Show or hide the data used to generate the graphic in JSON format.
| Source | Domain | Start | End |
|---|---|---|---|
| sig_p | n/a | 1 | 25 |
| low_complexity | n/a | 5 | 19 |
| disorder | n/a | 27 | 43 |
| disorder | n/a | 45 | 258 |
| low_complexity | n/a | 145 | 159 |
| low_complexity | n/a | 161 | 184 |
| low_complexity | n/a | 207 | 225 |
| Pfam | CW_binding_1 | 278 | 296 |
| Pfam | CW_binding_1 | 298 | 316 |
| Pfam | CW_binding_1 | 338 | 356 |
| Pfam | CW_binding_1 | 358 | 376 |
| Pfam | CW_binding_1 | 398 | 416 |
| Pfam | CW_binding_1 | 418 | 436 |
| low_complexity | n/a | 438 | 448 |
| Pfam | CW_binding_1 | 479 | 497 |
| Pfam | CW_binding_1 | 518 | 536 |
| Pfam | CW_binding_1 | 538 | 555 |
| Pfam | CW_binding_1 | 558 | 576 |
| Pfam | CW_binding_1 | 607 | 624 |
| Pfam | CW_binding_1 | 627 | 642 |
| Pfam | CW_binding_1 | 666 | 684 |
| Pfam | CW_binding_1 | 686 | 704 |
| Pfam | CW_binding_1 | 706 | 724 |
| Pfam | CW_binding_1 | 765 | 783 |
| Pfam | CW_binding_1 | 806 | 824 |
| low_complexity | n/a | 807 | 820 |
| Pfam | CW_binding_1 | 826 | 843 |
| Pfam | CW_binding_1 | 846 | 864 |
| low_complexity | n/a | 892 | 905 |
| Pfam | CW_binding_1 | 934 | 951 |
| Pfam | CW_binding_1 | 953 | 971 |
| Pfam | CW_binding_1 | 973 | 991 |
| Pfam | CW_binding_1 | 1051 | 1069 |
| Pfam | CW_binding_1 | 1071 | 1089 |
| Pfam | CW_binding_1 | 1091 | 1109 |
| low_complexity | n/a | 1141 | 1152 |
| Pfam | CW_binding_1 | 1180 | 1198 |
| Pfam | CW_binding_1 | 1200 | 1218 |
| Pfam | CW_binding_1 | 1299 | 1317 |
| Pfam | CW_binding_1 | 1339 | 1354 |
| Pfam | CW_binding_1 | 1378 | 1396 |
| Pfam | CW_binding_1 | 1469 | 1487 |
| Pfam | CW_binding_1 | 1529 | 1547 |
| Pfam | CW_binding_1 | 1550 | 1567 |
| Pfam | CW_binding_1 | 1586 | 1604 |
| Pfam | CW_binding_1 | 1627 | 1643 |
| Pfam | CW_binding_1 | 1646 | 1660 |
| Pfam | CW_binding_1 | 1807 | 1825 |
| Pfam | CW_binding_1 | 1827 | 1844 |
| Pfam | CW_binding_1 | 1867 | 1885 |
| low_complexity | n/a | 1971 | 1982 |
| Pfam | CW_binding_1 | 1988 | 2004 |
| Pfam | CW_binding_1 | 2024 | 2042 |
| Pfam | CW_binding_1 | 2106 | 2124 |
| Pfam | CW_binding_1 | 2151 | 2166 |
| Pfam | CW_binding_1 | 2188 | 2201 |
| Pfam | CW_binding_1 | 2208 | 2226 |
| Pfam | CW_binding_1 | 2289 | 2307 |
| Pfam | CW_binding_1 | 2309 | 2326 |
| Pfam | CW_binding_1 | 2329 | 2347 |
| Pfam | CW_binding_1 | 2349 | 2367 |
| Pfam | CW_binding_1 | 2371 | 2386 |
| Pfam | CW_binding_1 | 2389 | 2407 |
| Pfam | CW_binding_1 | 2470 | 2488 |
| Pfam | CW_binding_1 | 2490 | 2508 |
| Pfam | CW_binding_1 | 2510 | 2528 |
| Pfam | CW_binding_1 | 2552 | 2568 |
| Pfam | CW_binding_1 | 2570 | 2589 |
| Pfam | CW_binding_1 | 2591 | 2609 |
| Pfam | CW_binding_1 | 2611 | 2629 |
| Pfam | CW_binding_1 | 2631 | 2649 |
| Pfam | CW_binding_1 | 2671 | 2689 |
| Pfam | CW_binding_1 | 2691 | 2706 |
| Pfam | CW_binding_1 | 2712 | 2730 |
| Pfam | CW_binding_1 | 2752 | 2770 |
| Pfam | CW_binding_1 | 2772 | 2790 |
Show or hide domain scores.
Sequence information
This is the amino acid sequence of the UniProt sequence database entry with the accession Q97K42. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.
| Sequence: | 1
MLKNKITLLL SSIYILGSIS TPTLASELTK NSSALTKRSS SNNFSLNKNH
50 51
VFTPITSNVN GSNAKNNLNT KVQTNTASSS MPNTNPKQAT NNSKILVNPK
100 101
LNQASSPNEG ITPKKQASIP YTNVTDNKNT FKNESSINNE APIIPKDTSK
150 151
TKSTSSAQTK GSNDNNIPSN NTSTNTSKNE NPSNTDIKTT EAPANAPIKD
200 201
TPNNQSDSAL AKNKALSNNN LAADSSQTSK VTSSNNDAPK VNTTSTDKKA
250 251
SNLNNDSQDG WVTKDGKKYY YVNGVQQKGF QSINKSIYYF NDDGSMQTGW
300 301
LKYNSNSYYF DASGVMLTGL QNINGTYYGF NDDGKLLTGL QAINNNYYYF
350 351
NNDGVMQTGW ITCNDSKYYF DNNGVMQTGL VHINNKYYGF GNDGKLLTGL
400 401
QNINNYTYYF DSNGVMQTDW ITIDGSKYYF SVNGVMQTGI IYISGYYYGF
450 451
ANDGKLLTGL QVINGNSYYF DTNGIRLVSR WITIDGKDYY FNQDGILTDN
500 501
WINYDGKYYF YISGVKQTGL QNIDGNYYYF DSSGIMQTGL QKIDGKTYYF
550 551
GDNGIRQIGW ITYQNNKYYF NSDGSMQTDL KIYSYSTSPY NYHYQYYGFD
600 601
NDGKLLTGLQ TIKGNTYYFD SNGISQMGWV NIDGKDFYFN SNSIMTENWV
650 651
INDEKYYFYI NNVKQTGFQY INGKYYYFDP DGIMQTGFQT ISGNTYYLDD
700 701
NGVKQTGWVT IKGKDYYFDG NGVMINYWVF DNDKTYYYIN GNMQTGAISI
750 751
NNHYYGFDDN GIMQTGWQRI NGRTYYFDNN GAAKTGLVTY EGKTYYFNTY
800 801
YAYLDTGFIY FNNNYYFLDN NGVVRTGWIN YSNNRYYLDS TGVRVTGFQT
850 851
IDGNKYYFDS SGAMCTSFIT VNGNTYGFSK DGIMLTGWQT ILSSNYSSYN
900 901
IYYFNSDGSA QKGFFTYLGK TYYFEPNYGY MLLGYNYING KYYYFDNDGV
950 951
IQTGWVTDRS SKYYLDPSGA AVTGFQNING DKYYFNSSGI MQTGLVYVNP
1000 1001
DYYGFDDNGH ILTGMHSING YIYYFDSTGK AQKGFVTYLG KTYYFNTNMY
1050 1051
TGFVNANNNL YYFDNEGVMQ TGWINYNSNR YYFSATGASV TGFQTIDGNK
1100 1101
YCFDSNGAIY TDVVTINGST YGFNTDGIML TGWQTIRYNR GYSSYFNTYY
1150 1151
FNSDGTAKTG FFTYLNKTYY FNPSDGRMLQ GYQYINGNHY YFAPDGTMQT
1200 1201
GWITNGSSKY YLDPSGAAVT GLQTINGNKY CFDSNGILQH NGIFYIGNTY
1250 1251
YGSDNNGIML TGLQLINGYL YCFNSDGSVK TGLVTYLGKT YYFDSYSVSG
1300 1301
FQNINNNTYY FGNDGTMQTG WVNYGYYRYY LNDSGIKVTG WQTIDGNKYY
1350 1351
FDYYGAKTGI VNIDGNYYGF NNSGVMLTGW QHINGSTYYF NSNGIANTGF
1400 1401
ITYLGKTYYF DSYGRMQIGS MTINGTSYYF YANGVMKTST DSPNTLAVGW
1450 1451
VRDSYYYQYY LNAAGTKLTG LQTIDGNTYY FDSNGIMQTG IITINGNRYG
1500 1501
FGVNGVMLYG LQFINNNTYY SNSYGISQTG FVTLSGNTYY FDSYGEMRIG
1550 1551
LTYINNNYYY FNSKGIMETG WISYLRYANP NGILLTGFQT INGKTYYFNS
1600 1601
DGSLLYDLQY INGSYYGFDK NGVMLYGLQT IGGNTYYLNS NGISQSGFIT
1650 1651
LNGKTYYFDS YYGMRTGIQN INNNYYFFGD NGTLQTGWIS QDNLRYYANS
1700 1701
SGVCLTGLQT IDGKKYYFNS YARMETGLVY INNTYYGFDN DGTLLYSWHN
1750 1751
INGRMYCFNT DGTVKTGWIN YLGRSCYLDS SQGFLSTGLL TIGHNIYYFG
1800 1801
SDYSMKTGWV TSGSSKYYFN ESGIMLTGFQ TIDGNTYYFD SYGNSTTGTR
1850 1851
SINGNCYGFN DDGIMLTGWQ TISGNNYYFN PDGTAKIGLN TYEGKTYYFS
1900 1901
TGGYTQTGII NINSNTYYFG YDGALKTGWI RNNYIYYADN NGIIQTGLKT
1950 1951
IDGKNYLFNY SGIMMNGIQS INNNYYGFDN DGGMLFGVHS INGQTYYFNA
2000 2001
DGSVKTGWIP YEGKMYYANP SYCTGFATIN NNTYYFDNNG AMETGIITVD
2050 2051
NSKYYINSYG IRETGYKIID GKTYYFNPNY SGIMQTGVIS LNGNYYGFDD
2100 2101
NGVMQIGLQK LNGNTYYFNS NGTAITGWTT LGSNKYYFNP NSYGAAEVGI
2150 2151
ESINGHYYYF NNNGIMQTGV QRINGYVYCF NNDGTAMNGF QSINGNTYYL
2200 2201
NIYGCTYTGW QTINSNRYYF NSDGVMLTGA QNITGYIYGF DSNGIMLTGV
2250 2251
QTIAGNTYDF SSNGTATTGW VILSNKTYYF SPSLGYKLTG FITVSGDNYY
2300 2301
LDADGVLQTG WVTVDGNKYY LNSNGVRQTG FLTLNNNKYY FDTNGVMQTG
2350 2351
FTTINNNVYY FNDDGIMQTL WITIDYNKYY FDSTGIRLIG FQIIDGKKYY
2400 2401
FNSNGIMLTG VQTIGNKFYG FDNYGVMLTG VQDIDNHTYN FNQDGSASTG
2450 2451
WTKLSDKTYY FSPSLGYKLT GFQTIDDKTY YFDTDGAILS GWITANSNKY
2500 2501
YLDSNGVVQT GFLTLDNNKY YLNSSGVMQT GIVPINGTYY GFGQDGIMLT
2550 2551
GIQKLNDQTY YFNSDGTMFT GWFKDSSSSQ HYFDSNGMML TGLKDVNGNK
2600 2601
YYFNENGVMQ TGLITFSNNK YYFDQNGIMQ TGWQNLSGCN YYFNNDGVMQ
2650 2651
TGVNTINDSI YGFDSNGIML TGWQTINSKT YYFAPNGIAK NGWLNYRGKK
2700 2701
YYFDPQYAYM ITGFKTITGC TYYFDQDGVM QTGVVSIDGG TYGFNSLGFL
2750 2751
LKGWQNIDSK TYYFDSNGLA PKGLKTIDGS IYYFSSDGYL EKDTKIIVDG
2800 2801
ITYKINDNGT ATEVKNI
2817
Show the unformatted sequence. |
| Checksums: |
CRC64:1851D0D4FFBEE921
MD5:5e502ed0a07d5bbd45de62b6551dcad0
|

