# STOCKHOLM 1.0 #=GF ID PA14 #=GF AC PF07691.13 #=GF DE PA14 domain #=GF AU Rigden DJ;0000-0002-7565-8937 #=GF AU Mello LV;0000-0002-8632-1678 #=GF AU Galperin MY;0000-0002-2265-5572 #=GF SE Rigden DJ, Mello LV, Galperin MY #=GF GA 24.00 24.00; #=GF TC 24.00 24.00; #=GF NC 23.90 23.90; #=GF BM hmmbuild HMM.ann SEED.ann #=GF SM hmmsearch -Z 47079205 -E 1000 --cpu 4 HMM pfamseq #=GF TP Domain #=GF RN [1] #=GF RM 15236739 #=GF RT The PA14 domain, a conserved all-beta domain in bacterial #=GF RT toxins, enzymes, adhesins and signaling molecules. #=GF RA Rigden DJ, Mello LV, Galperin MY; #=GF RL Trends Biochem Sci. 2004;29:335-339. #=GF DR INTERPRO; IPR011658; #=GF DR SO; 0000417; polypeptide_domain; #=GF CC This domain forms an insert in bacterial beta-glucosidases and #=GF CC is found in other glycosidases, glycosyltransferases, proteases, #=GF CC amidases, yeast adhesins, and bacterial toxins, including #=GF CC anthrax protective antigen (PA). The domain also occurs in a #=GF CC Dictyostelium prespore-cell-inducing factor Psi and in #=GF CC fibrocystin, the mammalian protein whose mutation leads to #=GF CC polycystic kidney and hepatic disease. The crystal structure of #=GF CC PA shows that this domain (named PA14 after its location in the #=GF CC PA20 pro-peptide) has a beta-barrel structure. The PA14 domain #=GF CC sequence suggests a binding function, rather than a catalytic #=GF CC role. The PA14 domain distribution is compatible with #=GF CC carbohydrate binding. #=GF SQ 22 #=GS Q8IKW0_PLAF7/216-383 AC Q8IKW0.2 #=GS Q7RP37_PLAYO/216-383 AC Q7RP37.1 #=GS PAG_BACAN/44-179 AC P13423.2 #=GS PAG_BACAN/44-179 DR PDB; 4H2A A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 4NAM A; 16-150; #=GS PAG_BACAN/44-179 DR PDB; 3TEW A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 3Q8A A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 3Q8F A; 16-150; #=GS PAG_BACAN/44-179 DR PDB; 3Q8B A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 1ACC A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 3MHZ A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 4EE2 A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 3TEY A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 3TEX A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 3TEZ A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 1T6B X; 16-150; #=GS PAG_BACAN/44-179 DR PDB; 5FR3 A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 3Q8E A; 15-150; #=GS PAG_BACAN/44-179 DR PDB; 3Q8C A; 15-150; #=GS Q9PF33_XYLFA/452-607 AC Q9PF33.1 #=GS Q8CK54_STRCO/42-184 AC Q8CK54.1 #=GS Q8AAF9_BACTN/630-763 AC Q8AAF9.1 #=GS Q7URI2_RHOBA/387-524 AC Q7URI2.1 #=GS PSIA_DICDI/117-261 AC Q968Z6.1 #=GS BGLS_SCHPO/398-544 AC Q9P6J6.1 #=GS G0RWC1_HYPJQ/397-546 AC G0RWC1.1 #=GS HEXA_PORGI/626-766 AC P49008.2 #=GS Q7UF11_RHOBA/396-557 AC Q7UF11.1 #=GS FLO1_YEAST/100-249 AC P32768.4 #=GS FLO1_YEAST/100-249 DR PDB; 4LHN A; 88-249; #=GS FLO1_YEAST/100-249 DR PDB; 4LHL A; 88-249; #=GS FLO10_YEAST/121-271 AC P36170.1 #=GS Q8A341_BACTN/847-985 AC Q8A341.1 #=GS Q7SG54_NEUCR/449-597 AC Q7SG54.3 #=GS Q8A3E5_BACTN/456-595 AC Q8A3E5.1 #=GS Q87YU8_PSESM/308-470 AC Q87YU8.1 #=GS Q880I7_PSESM/435-593 AC Q880I7.1 #=GS Q8A3K6_BACTN/262-401 AC Q8A3K6.1 #=GS Q7UKT3_RHOBA/181-327 AC Q7UKT3.1 #=GS Q8P3G5_XANCP/475-625 AC Q8P3G5.1 Q8IKW0_PLAF7/216-383 KADGL.MASYYNNAYFSGYPTA............IHNDKYINFIWDTGIPIENIPYQHFSIRWDGYLKIPESDNYIF..SVD......HDCGIRIF..LDNSPIIVDNMPFPKEEESEEIRPISIQSFDKMNSKVHKTNSEKLGLIGGKKYKIRIEYFHLSTMKFANPH.ISHIILYWKSNNIMEEIIPSNY Q7RP37_PLAYO/216-383 KADGL.TGSYYDNAYFSGYPLS............INNDKYINFIWDSGVPIEMIPYQHFSVRWDGYIKIPQTDNYII..SIE......HDCGVRIF..LDNSPIIVNNMPDPKEEESEEIKPIYILPINKINSNVQKISSEKLGLIGGKKYKFRIEYFHLSTIKHENPD.TAHIILYWKSDKIIDEEIIPSQ PAG_BACAN/44-179 SSQGL.LGYYFSDLNFQAPMVV...........TSSTTGDLSIPSSELENIPSENQYFQSAIWSGFIKVKKSDEYTF..ATS......ADNHVTMW..VDDQEVINKA.............................SNSNKIRLEKGRLYQIKIQYQRENPTEKGL.....DFKLYWTDSQNKKEVISSDN #=GR PAG_BACAN/44-179 SS CT-SE.EEEEESSTTSSSEEEE...........EEESSSB--BEGGGGTTS-GGGGB-SEEEEEEEEEBSS-EEEEE..EET......TGGGEEEE..ETTEEEEESS.............................----EEEE-TT-EEEEEEEEE-SS-SSSBB.....--BEEEE-TTS-EEE--GGG Q9PF33_XYLFA/452-607 GHPGL.KGEYFDTIDFAGPPHL............VRQDRIIAFNWDHVAPAPGMNPHRYAVRWTGELLPPGPGTYTFAVHVARCFDCNGHDPVRLS..IDDRQIIPDNATAAQAT.............TAPQQTDNTHLEATLHFTDTRPHHIRLDMEHRGEDQGV........RLEWLAPAAPQLAEAERA Q8CK54_STRCO/42-184 QEPGVTLRVFDVQTPLNELCTLKP.......GQTPNHDKLMPTVDWSTTADFGGIADNFVSQASGYLVAPRDGTYVF..RLV......SDDGSRLA..LDGATVIDHD...GLHGA..........EP...........KDGTVELTAGAHPLRIDHFDRGGGQQV........QLSWMPPGESGFTVVPTE Q8AAF9_BACTN/630-763 LNSGLKAVWHKFRGNLCAD...............IDAAPVNGEYVVESVSIPEEVKGDIGLIITGYLEVPADGIYTF..ALL......SDDGSTLK..LDGELLGDND...GAHS.....................PVEIIVQKALKAGLHPIEVRYFDCNGGVL.........QMELVNEKGEKEVLPKEW Q7URI2_RHOBA/387-524 LEPGLDYRMYLGKFKKLPEFET............MEVERTGQVDSLNLEEIQGDQRDDFALSINAMFRVPEEGLYRF..QIT......SDDGSRVF..LHDKLFLDHD...GNHP.....................PMTVSRLVRAGAGLHPIRIDYFEGGGAQTL........TAALTRLDVAGDDDGDDS PSIA_DICDI/117-261 LRTASGNYIYDNDFFFPIDYEG............FDTDPANRIYKDDESTNKTYHNYHFCFQFDNRFLF..KGNETF..KFT......GDDDVWVF..INKQLVVDLG....................GTHPAASSSVDLSTLGLTVGKVYPFNFFYCERHTSRSTIR...IETSLELYCDKYDYCGVCNGD BGLS_SCHPO/398-544 GKHGYVAKFYLEPATSENRTLI............DDYDLEDGVVRFYDYCNDKMKDGYFYIDIEGYLIPDEDAVYEF..GIS......VFGTALLF..IDDVLLIDNK...TKQTP...........TNHTFEFGTIEERNSIYLKKGRKYNVRVEYGSAATYTLS.........TNLSPSTGGRYSIGCVK G0RWC1_HYPJQ/397-546 GAPGMRWRVFNEPPGTPNRQHID.........ELFFTKTDMHLVDYY....HPKAADTWYADMEGTYTADEDCTYEL..GLV......VCGTAKAY..VDDQLVVDNA...TKQVP...........GDAFFGSATREETGRINLVKGNTYKFKIEFGSAPTYTLKG....DTIVPGHGSLRVGGCKVIDD. HEXA_PORGI/626-766 PKPGL..TIRTAYGDLYDVPDLQQVA....SWEVGTVSSLEEIMHGKEKITSPEVLERRVVEATGYVLIPEDGVYEF...........STENNEFW..IDNVKLIDNV...GEVK....................KFSRRNSSRALQKGYHPIKTIWVGAIQGGWP.........TYWNYSRVMIRLKGEEK Q7UF11_RHOBA/396-557 LRSGVLVETFEPVAPSPPLSSIAEL.....RDDETFVANTPSTASDISSFSYSAAPGAKVSRIRGIVSPPVSGHYTF..HLA......ASDEAELW..LSQGVTSDTSRLIASVATP.........TLMEGFTDANAGLSASVYLVAGQDYYVETLQVHDDPVAKN......HLSVAWTRPDQLASGPQLIG FLO1_YEAST/100-249 DSYG.NWGCKGMG.....................ACSNSQGIAYWSTDLFGFYTTPTNVTLEMTGYFLPPQTGSYTF..KFA.....TVDDSAILS..VGGATAFNCC...AQQQPPITSTNFTIDGIKPWGGSLPPNIEGTVYMYAGYYYPMKVVYSNAVSWGTL........PISVTLPDGTTVSDDFEG #=GR FLO1_YEAST/100-249 SS GEET.TTEETTTE.....................E--S-TTS---BSTTTT--B-TTSEEEEEEEEE--SSSEEEEE..EES.....--BSEEEEE..ESTTTS--TT...-TTS--------SEEEE--TT----S-EEEEEEE-TT-BEEEEEEEEE-SS-EEE........-EEEE-TTS-EEESB-TT FLO10_YEAST/121-271 DNTTLSSKTEKRE.....................NDDCDQGAAYWSSDLFGFYTTPTNVTVEMTGYFLPPKTGTYTF..GFA.....TVDDSAILS..VGGNVAFECC...KQEQPPITSTDFTINGIKPWNADAPTDIKGSTYMYAGYYYPIKIVYSNAVSWGTL........PVSVVLPDGTEVNDDFEG Q8A341_BACTN/847-985 GKSVL.ELSLYKDDDLRTLAGV...........TQDNKIDRTFADGAQPDPLLPANQSFSAIWDGKLKAPQSGTYMI..GVT......SDQGMRLS..VNGQRIVDEW...RNNK....................ELTVVRPFLLKAGDEVAVRVEYSQRNPTGSV........QLVWSLPDHAAIAPQELL Q7SG54_NEUCR/449-597 GKPGV.TIEWFKGDKFKGEPVV.............IQRRTNTDLFLWDSAPLAQTGPEWSAIATTYLTPKHSGKHTI..SYM......SVGPGKLY..INGKLSLDLW...DWTE............EGEAMFDGSVDYLVELEMVANRPVELRVEMTNELRPLSKQK..QMGMTHRYGGCRIGYKEADQ.. Q8A3E5_BACTN/456-595 GRPGL.TATYWNNMNLSGDVAAT..........SQITSPINLSNGGNTVFATGVGLYNFTAVYEGTFRPKESGAYEL..LIE......GDDGYRVY..VNGEKVIDYW...GEHA....................SAKREYTLKAIAGTDYKIRIEYMQAGAE...A.....LLRFDLGIYRHISPEMVVDR Q87YU8_PSESM/308-470 SNSGV.KAEYYANTTLSGDPVANRIEPGVNLDWT..TNTNVTDNGSTAVSGYTPAAGSFSARFTGKIKPTITGAHVF..KVR......ADGAYKLW..INDELVLEDE...GAQV.............SFDLIPVIPRTVKTPTLKAGTEYNVRLEYRRLKGNFIPVLGGLNGVQMSWASLRPPKNLADYDA Q880I7_PSESM/435-593 SNSGV.KAEYFSNTSFSGDPALTRVEPGVNLNWA..TGTNVTNAGSTAVSGFSPSAGAFSARFTATIKPTVSGAQVF..KVR......ADGPYKLW..VNDELVLQSD...GVPY.............SGDVVNALTTSGKTAELSAGKTYTVKLEYQRIQGNFIPV....LGGLTGVQMSWASLRPPKDLS Q8A3K6_BACTN/262-401 WVNGL.KGIYTQDSKGVGHLRS............EKIDPVIDFDWDWYKPADDFSFNDYQVTWSGKLKAPSTGEYTL..GIQ......ADDGARLY..INGELLIDDW.....................KSHSFSYQPTQKKISLEAGKMYDIKLEYYQHEWSSRI........KLSWIRPDKKSSTSLLTG Q7UKT3_RHOBA/181-327 SKHGLSVSIYEGKGWKKENRKI............ERVDSVVDFDFGKEGPGEGVPWDEFHAHWDGGLRVEQTGRYEI..VVR......SKTSFTMDFGHDSKQLIDNH...VQSE...................GRNEFRRTLLLTGGRVYPLEIDLNQRKRKGETPP...ASISLSWVPPGGIETIIPASA Q8P3G5_XANCP/475-625 SQNGL.TGDYFRGRALAGQPVLTRIDPRIAFRWDRNAPTDDAVGRGELQPGQALGKDDFSVRWHGQLLPPVSGNYEL..QIA......ADDGVRLY..LDGKPLIDQW......................SDAPRMRSSTATVALQAGKAYDLRVEYYEATRDAGV........RLAWRMPGAKPPLQEAVD #=GC SS_cons CCETETCEECCTCTTSSSEEEE...........EEESSSCTSBEGCCCTTT-GCGCCSCEEEEEEEEEBSSSEEEEE..EET.....-TCCCEEEE..ETTCCCEETT...-TTS--------SEEEE--TT----S-EEEEEEE-TT-EEEEEEEEEESSSSCCEB.......BEEEE-TTS-EEESBGCC #=GC seq_cons spsGl.hthahpssthts..hh.............psss.hshshspthss.shshspassphsGhlpsspoGsYpF..tlt......uDssscla..lssphllDss...stpt......................ppptslthhuGphYsl+l-Yhptssstth........phsas.sctttpt.stpt //