# STOCKHOLM 1.0 #=GF ID SOBP #=GF AC PF15279.7 #=GF DE Sine oculis-binding protein #=GF AU Eberhardt R;0000-0001-6152-1369 #=GF AU Coggill P;0000-0001-5731-1588 #=GF AU Hetherington K; #=GF SE Jackhmmer:Q9Y5P3 #=GF GA 27.10 27.10; #=GF TC 27.10 27.10; #=GF NC 26.70 26.70; #=GF BM hmmbuild HMM.ann SEED.ann #=GF SM hmmsearch -Z 47079205 -E 1000 --cpu 4 HMM pfamseq #=GF TP Family #=GF RN [1] #=GF RM 17618476 #=GF RT Autosomal recessive mental retardation syndrome with anterior #=GF RT maxillary protrusion and strabismus: MRAMS syndrome. #=GF RA Basel-Vanagaite L, Rainshtein L, Inbar D, Gothelf D, Hennekam R, #=GF RA Straussberg R; #=GF RL Am J Med Genet A. 2007;143:1687-1691. #=GF RN [2] #=GF RM 21035105 #=GF RT SOBP is mutated in syndromic and nonsyndromic intellectual #=GF RT disability and is highly expressed in the brain limbic system. #=GF RA Birk E, Har-Zahav A, Manzini CM, Pasmanik-Chor M, Kornreich L, #=GF RA Walsh CA, Noben-Trauth K, Albin A, Simon AJ, Colleaux L, Morad #=GF RA Y, Rainshtein L, Tischfield DJ, Wang P, Magal N, Maya I, #=GF RA Shoshani N, Rechavi G, Gothelf D, Maydan G, Shohat M, #=GF RA Basel-Vanagaite L; #=GF RL Am J Hum Genet. 2010;87:694-700. #=GF DR INTERPRO; IPR026092; #=GF DR SO; 0100021; polypeptide_conserved_region; #=GF CC SOBP is associated with syndromic and nonsyndromic intellectual #=GF CC disability. It carries a zinc-finger of the zf-C2H2 type at the #=GF CC N-terminus, and a highly characteristic C-terminal PhPhPhPhPhPh #=GF CC motif. The deduced 873-amino acid protein contains an N-terminal #=GF CC nuclear localisation signal (NLS), followed by 2 FCS-type zinc #=GF CC finger motifs, a proline-rich region (PR1), a putative #=GF CC RNA-binding motif region, and a C-terminal NLS embedded in a #=GF CC second proline-rich motif. SOBP is expressed in various human #=GF CC tissues, including developing mouse brain at embryonic day 14. #=GF CC In postnatal and adult mouse brain SOBP is expressed in all #=GF CC neurons, with intense staining in the limbic system. Highest #=GF CC expression is in layer V cortical neurons, hippocampus, pyriform #=GF CC cortex, dorsomedial nucleus of thalamus, amygdala, and #=GF CC hypothalamus. Postnatal expression of SOBP in the limbic system #=GF CC corresponds to a time of active synaptogenesis [2]. the family #=GF CC is also referred to as Jackson circler, JXC1. In seven affected #=GF CC siblings from a consanguineous Israeli Arab family with mental #=GF CC retardation, anterior maxillary protrusion, and strabismus #=GF CC mutations were found in this protein [1,2]. #=GF SQ 15 #=GS F6U6M7_MACMU/51-370 AC F6U6M7.2 #=GS H3AXM2_LATCH/1-318 AC H3AXM2.1 #=GS E0VHI9_PEDHC/37-370 AC E0VHI9.1 #=GS B4N5Q1_DROWI/245-568 AC B4N5Q1.2 #=GS B4QBP7_DROSI/273-602 AC B4QBP7.1 #=GS B4LIW1_DROVI/240-582 AC B4LIW1.1 #=GS Q7PZA5_ANOGA/218-535 AC Q7PZA5.4 #=GS B0WBD5_CULQU/105-432 AC B0WBD5.1 #=GS Q16NQ8_AEDAE/54-382 AC Q16NQ8.1 #=GS E9GN80_DAPPU/141-450 AC E9GN80.1 #=GS K1PCW3_CRAGI/167-508 AC K1PCW3.1 #=GS W4XN74_STRPU/158-433 AC W4XN74.1 #=GS W5NIN6_LEPOC/228-538 AC W5NIN6.1 #=GS I3KTY8_ORENI/226-537 AC I3KTY8.1 #=GS SOBPA_DANRE/255-571 AC A5X7A0.1 F6U6M7_MACMU/51-370 VCDWCKHIRHTKEYLDFGDGERRLQFCSAKCLNQYKMDIFYKETQANLPA..GLCSTLHPPMENKA.....EGTG........VQLLTPDSWNIPLTDAR.RKAPSPVATAGQSQGPGPS........ASTTVSPSDT....ANCSVTKIPTPVPKSIPISETPNIPPVSVQPPASI.......................GPPLGVPPRSPPMVMTNRGP.......................VPLPIFMEQQIMQQIRPPFIRGPPHHASNPNSPLSNPMLPGIGPPPG.................GPRNLGPTSSPM.HRPMLSPHIHPPSTPTMPGNPPGLLPPPPPG....A..........P...LPSLPFPPVSMMPNGPMPVPQMMNFGLPSLAPLVPPPTLLVPYPV..IVPLPVPIPIPIPI H3AXM2_LATCH/1-318 VCDWCKHIRHTKEYLDFGDGERRLQFCSAKCLNQYKMDIFYKETQANLPAGLCNTMHPPIEHKSEG.....TGV..........QLLTPDSWNVPLADAR.RKAPSPAATAGLNQGPGPS........ASA..TASPS....DANSSAAKIPTPVPKPLPANESPNVAPVPIQP..P....................ANIVHPVGVPPRSPPMVMTNRGP.......................VPLPIFMEQQMMQQIRPPFIRGPH.APS.PNSPLSNPMIPGIGPPPV.................VPRTIGPSSSPM.HRPMLSPHIHPPSTPTMPGNAPGLLPPPPPGA...P..........F...PPGFPFPPVNMMPNGPVPLPQMVNFGVPSLAPLVPPPTLLVPYPV..IVPLPVPIPFPIPI E0VHI9_PEDHC/37-370 TCDWCRHIRHTVNYVDFQDGEHQLQFCSDKCLNQYKMNIFCKETEAHL......QLHPHLQGLGSS.....GGSNGKG..NASVRLITPELW...LRDC...RSESPISDRSLSPQ............DNNNSSSSPI.......EIKVVMRPPNENSQLTVKPSPPPSLTPSTLEI...TKVDRK..........ERGTHREKSRKTGSILSKSKRQGRYGSGN.............NERRGSSLSPRNRENL...TPPKPNIFPPSFFPLNGTSLPTMHPPHPAPPP..................PPHNGMPPDHGL.LKPPFLPLGCFPPPPPPPPPPPPPPPTGPRNF...QRPGGQIHHRPP...LISPPSPSGPTV.....INKPIRNPLLPPPRSLLPPVTVLVPYPVALAIPIPLPIPIPIPL B4N5Q1_DROWI/245-568 TCDWCKHVRHAVSYVDFQDGASQLQFCSEKCLNQYKMQIFCNETQAHL......DMNPHLRDQGLD.....AGNE........AALITPDLW...LRNCR.SRSASPAS..TVSVSPGPI........KRNSPCATP.....PSNPTKPLISVAPVSKLLALRSPQAAQASQPPPPP...QTSLHQQRQQHQNLTSKSGRRKRLHRG.GLSAGSETVASLLQKQQ.............QHHQPSPISADFAGSS...VPLPPLSPMPM....PEQSAPPPPAAAPPTPTQSLGR.......GCATAIPPSYFATPPNGM.LGHPFGVR....PPHHFLPPPP...GMLPRGF...I..........P...PFGPPPPFATP.......QMPE..FGLL...GTAPPVTVMVPYPI..IIPLPIPIPVPLPI B4QBP7_DROSI/273-602 TCDWCKHVRHAVSYVDFQDGASQLQFCSEKCLNQYKMQIFCHETQAHL......DMHPHLRDQAMD.....AGSE........AGLITPDLW...LRNCR.SRSASPASTLSVSPGPPPAP......AVSRCPDSPP.....SHHPSKPLISVAPVSKLLAQKCPGPGPGVPPPPPP...PVQRHL..........AGKLGRRKRRGISSSSGSETVASLLQKQQQHHQQ...QQAPQQHQAPPTLTADFGGSS...VPLPPLSPMPSMQEAPQQQQAPQVPPTQVPPRGP.............TAIPASYFATPSNGVPMGHPFAGR.PPPPPHHFMHPPP..HGMMPRGF...G..........P...HFGPPLPPQ..............MPELGSLLGAVPPVTVMVPYPI..IIPLPIPIPVPLPV B4LIW1_DROVI/240-582 TCDWCKHVRHAVSYVDFQDGASQLQFCSEKCLNQYKMQIFCNETQAHL......DMNPHLRDQGLEA..AAAGDG........AGLITPDLW...LRNCR.SRSASPAS..TISVSPGPAAVCIANASKRSSPCNTP.....PSHPNKPLISVAPVSKLLAQKSPASSSITTGAPPPPLPQLQRHHVH.....AGPKPGRRKRPHRAAG.AAGSETVASLLQKQQ.............QQQQPAPLSADFAGSS...VPLPPLSPMPDQPQ.PQQQHQQ.HPPPQLPQPMGRP..........PAGMPGGYFATPPNGM.LGHPFGAR.AAPPPPHPFLPPP..PGMLPRGF...G..........A...HFGPPPPPQLQP.....QLQPELQGALGALLGAAPPVTVMVPYPI..IIPLPIPIPVPLPI Q7PZA5_ANOGA/218-535 TCDWCRHVRHTVSYVDFQDGASQLQFCSDKCLNQYKMQIFCNETQAHL......EMNPHLKEKSTS.....AGKVT.....GLRTLITPELW...MKNCK.SCSISPVSDRSESVSPVPSL......PVRSSPEPSPVLLRSPTPAKKPLISLAPASKLLS.KSLQPSTLPTRPS.P...KSSRKR..........RTGHRPPPGTNASMSSGKRST..........................VTHQAEYGGSA.........GASSSINNNNNVTIPNNNLLTAVLNI..................PPQFLPPPLNL..LRAPFFPL..NPAQLRFPAGLPNLPNAQPPPAP.PP..........P..PPMGPPPSPLAGGPLGGPTGSRPPLPNLLGFGGAAPPVTILVPYPI..IIPLPLPIPVPIPV B0WBD5_CULQU/105-432 TCDWCRHVRHAVSYVDFQDGVTQLQFCSDKCLNQYKMQIFCNETQAHL......DMNPHLKEKSTS.....AG...........SLITPDLW...LKNCK.SRSASPLSDRSESVSPAPSL......PMRPSPEPSPVM..QSSPAKKPMISVAHPSKLLS.KNLQPSAAVNARTVI...KSTRKR..........RPGLRPLQQNTLHNRKNAKVDYGQLTSNN.............NNNNASVLNNNLPASS...MPKPATVTSANIQD.LRAGISLQNLTHSLTPTKFESRESPTTPRPPPLNIPPQFLQLPPNPA.MRPPFFPL....NPAFRFGSPPNPQAMPPP.....P..........P...PNMDPNNPQNR.......HQPP.LLGFP.....APPVTILVPYPI..VIPLPLPIPIPIPL Q16NQ8_AEDAE/54-382 TCDWCRHVRHAVSYVDFQDGVTQLQFCSDKCLNQYKMQIFCNETQAHL......DMNPHLKEKSAS.....TG...........SLITPDLW...LKNCK.SRSMSPASDRSESVSPTPS........MHQSPEPSPTM..RSPPSKKPMISVAPASKLLSKTVQIPISRNTAK.......ANRKR..........RPGLRPMQQTVLQNRRSSNLKLDFSQTNN..............NNNSNVLNNNLTSTSSGMLTKPATVTSGSVQD.LRNAIPMQKLPHPLTPTKFENRESP.TTPRPPLNIPHKFLQMPPNSA.MRPPFFPM....NPMFRFGSFPNQPNTMPP.....P..........P...PIGTPLPNIDLGS...ANRPPGSILGLP.....VPPVTILVPCPI..VIPLPLPIPIPIPL E9GN80_DAPPU/141-450 VCDWCKHVRHTVSYVDLHDGQRQLQFCTDKCRNQYKMRLFVAEAS.SLGV..QSCATPGSKGET.S.....PGTSIDLKKQSTGILITPDLW...LSDC...QLDNPINVKLECNEET..........ELASIDSEPE....EQNCEIPLNLVHSSRRSVQPVKRKPNLV.EIPLS....KKGKSH..........RKHHSSS.....HQHKTSIYTRNLIPPP................PPTLPCPATTWPS....FPYPPHMVPSYILQ.SQFLALNRSLK.....P..................PKNPCVRSTDRE.DQISFTPV..IESNKMPSEPVI..EQSSTKVA...T..........PPDERTAEAINELTE.......NKQEINH.TKLLTPEIPPNFVVLRIPY..LIPLPIPVPIPIPL K1PCW3_CRAGI/167-508 ICDWCKHVRHTVNYVDFQDGETQLQFCSSKCLNQYKMNIFCKETQEHL.....QQISPTTETKSDT.....EGSSD......QQILITPDLW...LSNAK.HKNAKLRRKEAGDTDRELENAEKMESKSDHSRRQSPT....SASSSSDKLPVTNR..SILERMHGRDKSKRSSLRE...SLHERS..........VPRSKSPDSAQ.........PAQNLGIPFVPPHMWGAPTFGMGIPPMGAVPPWFYPGF...MPPGFMPPNMPMEG.LSGVLPSRTETPSATNNKRF............ALSPNYDNESSKNAN.EERGHGRS.SSRPISSARSNTPRSSTPRNTSFSTPT..........P...QSGAPLPNGVG.......MFGQNGMNFS.SYGGFPPLTMILPVPV..PLPLPVPIPLPLPI W4XN74_STRPU/158-433 VCDWCRHTRHLQDYINLIDGERRLQFCSPKCMNQYKMESYYKDGP.SL........KRKGTKAGASSGSKSNGS..........APVSPTLQ...QQSL..SRGNSPDTVSQDASYHD..........SQTSPSNIPL........SGVFQTNSSSSQSLMGMPQLRPSGVFGVQTV...................MPQAMSFRQVVPQMPVMQVVGSGGVP.....................LNMVQGLNQSSMAGSRKATGRLNPQQWGNIQGIQGVTSFKKIRPKPQGS...........SATLPSSTVTSASNSA....PFSK.......HKPRQNTPSSTSAAAAG.................................................SSVLPPVTVMVPHPV..FIPIPVPIPIPIPV W5NIN6_LEPOC/228-538 VCDWCKHIRHTKEYLDFGAGERRLQFCSAKCLNQYKMDIFYKETQANLPA..GLCNPGHPPMESKP.....ESSN........LQLLTPDSWNTPLGDLR.RKAPSPGTSMAGH.NQAPG........PSGSASASPS.......EATAICSSA.KIPTPGPKPHESPTLPPPVPIP.......................APPMGVPSGSPPMVMTHRGP.......................IPLPFFMDHPMLPQIRSPFLRAPHT.PS.PNSPLPNPIIPGIGPPP..................PPRNLGPTSSPM.HRPMLSPHIHPPTTPTMPGNPPGLMPPHPGAP...M..........P.....GLPFPPVNMMPNGHMPMPQMMNFGVPSLAPLVPPPTLLVPYPV..IVPLPVPIPIPVPI I3KTY8_ORENI/226-537 VCDWCKHIRHTKEYLDFGAGERRLQFCSAKCLNQYKMDIFYKETQAALPG..ALCNPGHGAGGEGKPE.CSSGV..........QLLTPESWGTPLTDLR.RKAPSPGGPSSTSVL.APST......SSAASPSDTA.........AVCSPSSSSSAKIPTPRPHESPSLPPPPV......................PTLHPPVGVPSGSPPMVMTPRGP.......................MPLPLFMEHQMMQQIRPPFLRTSAHPGG.PNSPLSNPIIPGIGPPP..................PPRTLGPASSPM.HRPLLSPH.....VHPSSNPNPGMIPPHPGI...............P..MPGLPPFPPVNMMPNGPIPLPPMMNFGMPSLAPLVPPPTLLVPYPV..IVPLPVPIPIPIPI SOBPA_DANRE/255-571 VCDWCKHIRHTKEYLDFGAGERRLQFCSAKCLNQYKMDIFYKETQAALPG..GLCNPPLPTSDTKSE..SGAGV..........QLLTPESWSAPLSELRSRKAPSPVGATIAGPSG..S........TSGSPSEAGT....VCSSSSSSSSSSSSTKIPTPRPHESPSLPPPHPPP...................ISGLHPALGMPPGSPPMVMTPRGP.......................VPFPIFMEHQMMQQMRPPFLRPPG.....PNSPHSNPMIPGIGPPP..................PPRTLCPPSSPM.HRPLLSPHLHPSSTPTLSGNPPGIMPPHPAAH...M..........P.....GLPFPPVNMMPSGPIPVPPIMNIGMPSLAPLVPPPTLLVPYPV..IVPLPVPIPIPVPI #=GC seq_cons sCDWCKHlRHTVsYVDFpDGppQLQFCSsKCLNQYKMpIFs+ETQApL......shsPHhcpcutu.....sGs..........pLITPDLW...Lpss+.p+usSPsosputSsussPu........spsSPssoP.....ssssspshhSsusso+llu.+sptssshsssss.........p...........hssh+s.htssstssshshostu........................ssLssthtts....h..Phhpsssph.s.spsslss.hlPs.sPPP..................PPpslssssssh.h+ssFuPp....ss.ph.sssPs..sshPsuh...s..........P.....GsPhPPhsh.......h....thulsshtshlPPlTlLVPYPl..IIPLPlPIPIPIPl //