Sequence alignment for CL0451

PF00093

TSP2_MOUSE/320-374-----------------CV.QEGRIFAENETWVVDS........CT.TCTCKK........FKTVCHQIT..CSP.....
VWF_HUMAN/2431-2494-----------------CV.HRSTIYPVGQFWEEG.........CD.VCTCTDMEDAVMGLRVAQCSQKP..CE......
VWF_HUMAN/2257-2325-----------------CIGEDGVQHQFLEAWVPDHQ......PCQ.ICTCLS.......GRKVNCTTQP..CPTAKAPT
VWF_HUMAN/2582-2644-----------------CM.LNGTVIGPGKTVMIDV........CT.TCRCMVQVG.VISGFKLECRKTT..C.......
CO1A1_HUMAN/40-95-----------------CV.QNGLRYHDRDVWKPEP........CR.ICVCDN........GKVLCDDVI..CDE.....
CO5A2_HUMAN/41-96-----------------CT.QNGQMYLNRDIWKPAP........CQ.ICVCDN........GAILCDKIE..CQD.....
CO1A1_CHICK/33-88-----------------CV.QDGLTYNDKDVWKPEP........CQ.ICVCDS........GNILCDEVI..CED.....
CO3A1_MOUSE/33-89-----------------CS.HLGQSYESRDVWKPEP........CQ.ICVCDS........GSVLCDDII..CDEE....
SOG_DROME/941-1019-----------------CK.VVNKVYENGQEWHPILMSH.GEQKCI.KCRCKD........SKVNCDRKR..CSR.....
TSP1_HUMAN/318-372-----------------CY.HNGVQYRNNEEWTVDS........CT.ECHCQN........SVTICKKVS..CPI.....
TSP2_CHICK/326-380-----------------CW.QDGRVFADSESWIVDS........CT.KCTCQD........SKIVCHQIT..CPP.....
CYR61_CHICK/100-163-----------------CE.YNSKIYQNGESFQPN.........CKHQCTCID........GAVGCIPL...CPQELSLP
NOV_CHICK/106-169-----------------CV.FDGMIYRNGETFQPS.........CKYQCTCRD........GQIGCLPR...CNLGLLLP
SOG_DROME/832-898-----------------CR.LGEQFHPAGASWHPFLPPN.GFDTCT.TCSCDPL......TLEIRCPRLV..CPP.....
CO2A1_MOUSE/34-88-----------------CL.QNGQRYKDKDVWKPSS........CR.ICVCDT........GNVLCDDII..CED.....
SOG_DROME/744-803-----------------CF.HSGRFYNESEQWRSAQD......SCQ.MCACLR........GQSSCEVIK..CPA.....
CTGF_HUMAN/103-166-----------------CI.FGGTVYRSGESFQSS.........CKYQCTCLD........GAVGCMPL...CSMDVRLP
SOG_DROME/102-174-----------------CQ.FGKVLRELGSTWYADLGPPFGVMYCI.KCECVAIPKKRRIVARVQCRNIKNECPP.....
F6XC33_XENTR/326-380-----------------CL.HNGVLHKNRDEWTVDS........CT.ECTCQN........SATICRKVS..CPL.....

PF12714

C3YHS2_BRAFL/245-297----------------PPCYFHGNAYPSGEWWTY...-------ACKECTC.E-------CGQVVCQPVQ--CPP-----
C3YHS2_BRAFL/348-403----------------PPCYVNGNVIPSGEWWTNTIT-------ACTECTC.L-------CGQVQCQPIK--CPP-----
ZAN_MOUSE/1221-1275----------------NCP.YNNKHYKPGEEWFTPN.-------CTERCRCLP-------GSLMECQISQ--CGT-----
ZAN_MOUSE/1609-1664----------------GCTSFQGRYFKLQEQWFNPD.-------CKEICTCES-------HNHILCKPWK--CKA-----
ZAN_MOUSE/2399-2454----------------GCKDAQGVFIPAGKTWISED.-------CTQSCTC.M-------KGSMRCWDFQ--CPP-----
ZAN_MOUSE/2519-2574----------------GCQHTQGGFIPAGKNWTSRG.-------CSQSCDC.M-------EGVIRCQNFQ--CPS-----
ZAN_MOUSE/2639-2694----------------GCKDAQGVLIPADKIWINKG.-------CTQTCAC.V-------TGTIHCRDFQ--CPS-----
ZAN_MOUSE/2759-2814----------------GCKDAQGVLIPAGKTWINRG.-------CTQSCSC.M-------GGAIQCQNFK--CPS-----
ZAN_MOUSE/2879-2934----------------GCKDTQGVLIPADKTWINRG.-------CTQSCTC.R-------GGAIQCQKYH--CSS-----
ZAN_MOUSE/3119-3174----------------GCKDAQGALIPEGKTWITSG.-------CTQSCNC.T-------GGAIQCQNFQ--CPL-----
ZAN_MOUSE/3239-3294----------------GCKNTQGAFISADKTWISRG.-------CTQSCTC.P-------AGAIHCRNFK--CPS-----
ZAN_MOUSE/3356-3411----------------GCKDARGAIIPAGKTWTSKG.-------CTQSCAC.V-------EGNIQCQNFQ--CPP-----
ZAN_MOUSE/3476-3531----------------DCKDAQGALIPAGKTWTSPG.-------CTQSCAC.M-------GGAVQCQSSQ--CPP-----
ZAN_MOUSE/3596-3651----------------GCKDAHGALIPEDKTWVSRG.-------CTQSCVC.T-------GGSIQCLSSQ--CPP-----
ZAN_MOUSE/3836-3887----------------GCRDTQGAVIPAGKTWLSTG.-------CIQSCAC.V-------EGTIQCQNFQ--CPP-----
ZAN_MOUSE/3952-4007----------------DCKDAEGGLVPAGKTWTSKD.-------CTQSCAC.T-------GGAVQCQNFQ--CPL-----
ZAN_MOUSE/4088-4143----------------GCKDTQGGLIPAGRTWISSD.-------CTKSCSC.M-------GGIIQCRDFQ--CPP-----
ZAN_MOUSE/4208-4258----------------GCKDAQGGFVPAGKTWISRG.-------CTQSCAC.V-------GGAVQCHNFT--CPT-----
ZAN_MOUSE/4323-4378----------------GCKDAQGDLIPANKTWLTRG.-------CAQKCTC.K-------GGNIHCWNFK--CPL-----
ZAN_MOUSE/4443-4498----------------GCKDSQGTLIPAGKNWITTG.-------CSQRCTC.T-------GGLVQCHDFQ--CPS-----
ZAN_MOUSE/4563-4618----------------GCKDAHGVLIPESKTWVSRG.-------CTKNCTC.K-------GGTVQCHDFS--CPT-----
ZAN_MOUSE/4683-4738----------------DCKDFQGSLIKTGQTWISSG.-------CSKICTC.K-------GGFFQCQSYK--CPS-----
ZAN_MOUSE/4803-4858----------------GCKDIRGNIIPAGNTWISSD.-------CTQSCAC.T-------DGVIQCQNFV--CPS-----
ZAN_MOUSE/5204-5258----------------GCT.ADGIYYKLGDSFVTND.-------CSQHCTCAS-------QGILLCEPYG--CRA-----
ZAN_RABIT/551-605----------------PCL.YNNNNYEPEAEWFSPN.-------CTELCHCWP-------GGRIECQISQ--CKT-----
ZAN_RABIT/935-990----------------GCLDPSRGYFKVGEQWFKSD.-------CKQLCICEG-------SNQIRCQPWK--CGP-----
ZAN_RABIT/1323-1379----------------GCTDFEGSYHLVRESWYTDNT-------CSRLCTCSL-------HNNITCRQTA--CKP-----
ZAN_RABIT/1727-1782----------------SCRDAQGGSIPSGKSWVSSG.-------CSQKCAC.T-------EGSIQCRAFH--CPS-----
ZAN_RABIT/2130-2184----------------GCT.SNGIYYQLGHSFVTAD.-------CSQRCTCAS-------SGVLLCEPFS--CRP-----
ZAN_PIG/740-794----------------DCF.YNDNYYKLGTDWFSPN.-------CTEHCHCRP-------SSRMECQTFK--CGT-----
ZAN_PIG/1512-1568----------------GCTTQRGSYHPVGESWYTDNS-------CSRLCTCSA-------HNNISCRQAS--CKP-----
ZAN_PIG/1908-1963----------------GCRDARGTFLPVGRFRLSSG.-------CSQMCVC.T-------AGAIECRPFT--CPS-----
F5H4B5_HUMAN/1094-1148----------------NCF.YNNDYYEPGAEWFSPN.-------CTEHCRCWP-------GSRVECQISQ--CGT-----
F5H4B5_HUMAN/1480-1535----------------GCLHPAGSYFKVGERWYKPG.-------CKELCVCES-------NNRIRCQPWR--CRA-----
H3CDL3_TETNG/143-198----------------.ATTRTESYRPVGDTWFTSKN-------CSERCKCNG-------NHNVTCEPWQ--CSP-----

PF05825

MSMB_MOUSE/21-113VCSIENREIFPNQMSDDCMDADGNKHFLNTPWK.KN--------CT-WCSCDK-------TSITCCTNAT--RPL--SYD
MSMB_RAT/21-113ACSIQRLKRLPNEKSDECTDVDGGKHVLNTYWQ.KN--------CE-WCFCEK-------TAITCCTKTL--IPV--SYD
MSMB_PIG/21-111QCYFIPNQSL...KPNECQDLKGVSHPLNSVWKTKD--------CE-ECTCGQ-------NAISCCNTAA--IPT--GYD
MSMB_MACMU/21-114SCSFIPNERFPGDSTRECTDLKGNKHPINSKWKTDN--------CE-RCTCYK-------TEIICCTLIA--TPV--GYD

PF00093

TSP2_MOUSE/320-374.ATCANP................SFVEGECCP..SC--
VWF_HUMAN/2431-2494.DSCRSG.............FTYVLHEGECCG..RC--
VWF_HUMAN/2257-2325CGLCEVA..............RLRQNADQCCPEYEC--
VWF_HUMAN/2582-2644.NPCPLG.............YKEENNTGECCG..RC--
CO1A1_HUMAN/40-95TKNCPGA................EVPEGECCP..VC--
CO5A2_HUMAN/41-96VLDCADP................VTPPGECCP..VC--
CO1A1_CHICK/33-88TSDCPNA................EIPFGECCP..IC--
CO3A1_MOUSE/33-89PLDCPNP................EIPFGECCA..IC--
SOG_DROME/941-1019.STCQQQTRVTSKRRLFEKPDAAAPAIDECCS.TQC--
TSP1_HUMAN/318-372.MPCSNAT............VP....DGECCP..RC--
TSP2_CHICK/326-380.VSCADP.............SF...IEGECCP..VC--
CYR61_CHICK/100-163NLGCPSP..............RLVKVPGQCCEEWVC--
NOV_CHICK/106-169GPDCPFP..............RKIEVPGECCEKWVC--
SOG_DROME/832-898.LQCSEK.............LAYRPDKKACCK..IC--
CO2A1_MOUSE/34-88.PDCLNP................EIPFGECCP..IC--
SOG_DROME/744-803.LKCKST.............EQLLQRDGECCP..SC--
CTGF_HUMAN/103-166SPDCPFP..............RRVKLPGKCCEEWVC--
SOG_DROME/102-174.AKCDDP................ISLPGKCCK..TC--
F6XC33_XENTR/326-380.MPCTNA................TIPDGECCP..RC--

PF12714

C3YHS2_BRAFL/245-297VTVPTCE-----------------.NGAVLEE------
C3YHS2_BRAFL/348-403VTVPECV-----------------.NGAEPEY------
ZAN_MOUSE/1221-1275HTVCQLK-----------------.SDQYQCE------
ZAN_MOUSE/1609-1664QEACSYK-----------------.NGVLGCH------
ZAN_MOUSE/2399-2454GTYCKNS-----------------NDGSSNCV------
ZAN_MOUSE/2519-2574GTYCQDI-----------------EDGTSNCA------
ZAN_MOUSE/2639-2694GTYCKDI-----------------KDDASNCT------
ZAN_MOUSE/2759-2814EAYCQDM-----------------EDGNSNCT------
ZAN_MOUSE/2879-2934GTYCKDM-----------------EDDSSSCA------
ZAN_MOUSE/3119-3174KTYCKDL-----------------KDGSSNCT------
ZAN_MOUSE/3239-3294GTYCKNG-----------------DNGSSNCT------
ZAN_MOUSE/3356-3411ETYCKDN-----------------SEGSSTCT------
ZAN_MOUSE/3476-3531GTYCKDN-----------------EDGNSNCA------
ZAN_MOUSE/3596-3651GAYCKDN-----------------EDGSSNCA------
ZAN_MOUSE/3836-3887GTYC...-----------------.NHNNNCA------
ZAN_MOUSE/3952-4007GTYCKDS-----------------GDGSSNCT------
ZAN_MOUSE/4088-4143GTYCKES-----------------NDSSRTCA------
ZAN_MOUSE/4208-4258GTQCQ..-----------------...NSSCS------
ZAN_MOUSE/4323-4378GTECKDS-----------------VDGGSNCT------
ZAN_MOUSE/4443-4498GAECQDI-----------------EDGNSNCV------
ZAN_MOUSE/4563-4618GSRCLDN-----------------NEGNSNCV------
ZAN_MOUSE/4683-4738GTQCEES-----------------EDGSSNCV------
ZAN_MOUSE/4803-4858GSHCQYN-----------------EDGSSDCA------
ZAN_MOUSE/5204-5258GESCMVA-----------------.NFTRGCF------
ZAN_RABIT/551-605HTKCQLK-----------------.NGQYECQ------
ZAN_RABIT/935-990HEVCSQQ-----------------.SGIYGCH------
ZAN_RABIT/1323-1379GQQCWAV-----------------.DGLLRCR------
ZAN_RABIT/1727-1782RSHCKLN-----------------SNGNSNCV------
ZAN_RABIT/2130-2184GESCTLG-----------------.NLTRGCF------
ZAN_PIG/740-794HTVCQLK-----------------.NGQYGCH------
ZAN_PIG/1512-1568SQMCWPQ-----------------.DGLIRCR------
ZAN_PIG/1908-1963GSQCEPN-----------------EDGKDFCQ------
F5H4B5_HUMAN/1094-1148HTVCQLK-----------------.NGQYGCH------
F5H4B5_HUMAN/1480-1535QEFCGQQ-----------------.DGIYGCH------
H3CDL3_TETNG/143-198AQECKVV-----------------.DGLLDCH------

PF05825

MSMB_MOUSE/21-113KDNCDVQ----FHPENCTYSVVDRKNPGKTCRVDSWTM
MSMB_RAT/21-113KKRCQRQ----FHSENCTYSVVERTNPGKTCPVNGWTI
MSMB_PIG/21-111TNKCQKI----LNKKTCTYTVVEKKDPGKTCDVTGWVL
MSMB_MACMU/21-114KKKCQRI----FKKEDCKYIVVEKKNPKKTCPIDQWIL

This alignment is coloured according to the ClustalX colouring scheme:

 
Glycine (G)
 
Proline (P)
 
Small or hydrophobic (C,A,V,L,I,M,F,W)
 
Hydroxyl or amine amino acids (S,T,N,Q)
 
Charged amino-acids (D,E,R,K)
 
Histidine or tyrosine (H,Y)

For UniProt-based alignments, we also add some additional mark-up to the alignments where appropriate. Active site information is shown as follows:

 
Active site (residue annotated in SwissProt as an active site)
 
Predicted active site (residue aligns in a Pfam alignment with a SwissProt active site)
 
Predicted active site (residue annotated in SwissProt as a potential active site)

Some UniProt sequences can be mapped to protein structures, in which case we also show the secondary structure definition. These lines are shown below the sequence to which they apply and are marked (SS). The meaning of each of the symbols is as follows:

C
Random coil
H
Alpha-helix
G
3(10) helix
I
Pi-helix
E
Hydrogen bonded beta-strand (extended strand)
B
Residue in isolated beta-bridge
T
H-bonded turn (3-turn, 4-turn, or 5-turn)
S
Bend (five-residue bend centered at residue i)
Close window