Planaria EST Database


DrC_00731

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_00731
         (1088 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   303   4e-82
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   302   1e-81
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   300   4e-81
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   300   6e-81
sp|Q9GLE3|CATK_PIG  Cathepsin K precursor                         299   8e-81
sp|P43235|CATK_HUMAN  Cathepsin K precursor (Cathepsin O) (C...   298   2e-80
sp|P61277|CATK_MACMU  Cathepsin K precursor >gi|47117667|sp|...   298   2e-80
sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   297   3e-80
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   297   4e-80
sp|P43236|CATK_RABIT  Cathepsin K precursor (OC-2 protein)        296   7e-80
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  303 bits (777), Expect = 4e-82
 Identities = 157/306 (51%), Positives = 201/306 (65%), Gaps = 7/306 (2%)
 Frame = +1

Query: 79  WQVFKTKFNKNYT-AIDELIRKTIWIENIKYIQHHNVKYDLGHHSHSLGINEFSDLTYKE 255
           W+ FK K+ + Y  A ++  R+ I+ +N KYI+  N KY+ G  + +L +N+F D+T +E
Sbjct: 20  WEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEE 79

Query: 256 FEKLYLLSKNIEYNDG--IDYLPPLNIDNLPDSVDWRTKGYVTNVKNQGQCGSCWAFSTT 429
           F  +  +  NI         + P          VDWRTKG VT VK+QGQCGSCWAFSTT
Sbjct: 80  FNAV--MKGNIPRRSAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFSTT 137

Query: 430 GSLEGQHFRKHKVLQNISEQQLVDCVTKNS--GCNGGWMNIAFEYI-SSHGIESEDNYPY 600
           GSLEGQHF K   L +++EQQLVDC       GCNGGWMN AF+YI +++GI++E  YPY
Sbjct: 138 GSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPY 197

Query: 601 QAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGYS-FQQYKQGV 777
           +A+ G+C FD + V A C G  NI S +E  L  AV  +GPISV ID  +S FQ Y  GV
Sbjct: 198 EARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGV 257

Query: 778 YYEAKCDPTIQNHAVLVVGYGVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATT 957
           YYE  C P+  +HAVL VGYG E G  +WLVKNSW  SWG  GYIKMS++R+NNCGIAT 
Sbjct: 258 YYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATV 317

Query: 958 ASFPIV 975
           AS+P+V
Sbjct: 318 ASYPLV 323
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  302 bits (774), Expect = 1e-81
 Identities = 155/314 (49%), Positives = 205/314 (65%), Gaps = 8/314 (2%)
 Frame = +1

Query: 58  NEEFNAEWQVFKTKFNKNYTAIDELIRKTIWIENIKYIQHHNVKYDLGHHSHSLGINEFS 237
           ++  + +W  +K    + Y A +E  R+ +W +N+K I+ HN +Y  G H  ++ +N F 
Sbjct: 22  DQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFG 81

Query: 238 DLTYKEFEKLYLLSKNIEYNDGIDYLPPLNIDNLPDSVDWRTKGYVTNVKNQGQCGSCWA 417
           D+T +EF ++    +N ++  G  +  PL +D LP SVDWR KGYVT VKNQ QCGSCWA
Sbjct: 82  DMTNEEFRQMMGCFRNQKFRKGKVFREPLFLD-LPKSVDWRKKGYVTPVKNQKQCGSCWA 140

Query: 418 FSTTGSLEGQHFRKHKVLQNISEQQLVDCVTK--NSGCNGGWMNIAFEYISSH-GIESED 588
           FS TG+LEGQ FRK   L ++SEQ LVDC     N GCNGG+M  AF+Y+  + G++SE+
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEE 200

Query: 589 NYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-SFQQY 765
           +YPY A    C +     VAN  GF  +    EK L  AVATVGPISVA+D G+ SFQ Y
Sbjct: 201 SYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFY 260

Query: 766 KQGVYYEAKCDPTIQNHAVLVVGYGVE----NGHKYWLVKNSWGPSWGMNGYIKMSKDRD 933
           K G+Y+E  C     +H VLVVGYG E    N  KYWLVKNSWGP WG NGY+K++KD++
Sbjct: 261 KSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKN 320

Query: 934 NNCGIATTASFPIV 975
           N+CGIAT AS+P V
Sbjct: 321 NHCGIATAASYPNV 334
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  300 bits (769), Expect = 4e-81
 Identities = 156/314 (49%), Positives = 207/314 (65%), Gaps = 8/314 (2%)
 Frame = +1

Query: 58  NEEFNAEWQVFKTKFNKNYTAIDELIRKTIWIENIKYIQHHNVKYDLGHHSHSLGINEFS 237
           ++ F+AEW  +K+   + Y   +E  R+ IW +N++ IQ HN +Y  G H  S+ +N F 
Sbjct: 22  DQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFG 81

Query: 238 DLTYKEFEKLYLLSKNIEYNDGIDYLPPLNIDNLPDSVDWRTKGYVTNVKNQGQCGSCWA 417
           D+T +EF ++    ++ ++  G  +  PL +  +P SVDWR KG VT VKNQGQCGSCWA
Sbjct: 82  DMTNEEFRQVVNGYRHQKHKKGRLFQEPLML-KIPKSVDWREKGCVTPVKNQGQCGSCWA 140

Query: 418 FSTTGSLEGQHFRKHKVLQNISEQQLVDC--VTKNSGCNGGWMNIAFEYISSH-GIESED 588
           FS +G LEGQ F K   L ++SEQ LVDC     N GCNGG M+ AF+YI  + G++SE+
Sbjct: 141 FSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEE 200

Query: 589 NYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-SFQQY 765
           +YPY+AK G+C +     VAN  GF +I    EK L  AVATVGPISVA+D  + S Q Y
Sbjct: 201 SYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFY 259

Query: 766 KQGVYYEAKCDPTIQNHAVLVVGYGVE----NGHKYWLVKNSWGPSWGMNGYIKMSKDRD 933
             G+YYE  C     +H VL+VGYG E    N +KYWLVKNSWG  WGM GYIK++KDRD
Sbjct: 260 SSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRD 319

Query: 934 NNCGIATTASFPIV 975
           N+CG+AT AS+P+V
Sbjct: 320 NHCGLATAASYPVV 333
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  300 bits (767), Expect = 6e-81
 Identities = 162/331 (48%), Positives = 214/331 (64%), Gaps = 9/331 (2%)
 Frame = +1

Query: 10  LIIIEFTQFITSH-LIINEEFNAEWQVFKTKFNKNYTAIDELIRKTIWIENIKYIQHHNV 186
           LI+  F   I S  L  +    A+W  +K   N+ Y   +E  R+ +W +N+K I+ HN 
Sbjct: 5   LILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQ 64

Query: 187 KYDLGHHSHSLGINEFSDLTYKEFEKLYLLSKNIEYNDGIDYLPPLNIDNLPDSVDWRTK 366
           +Y  G HS ++ +N F D+T +EF ++    +N +   G  +  PL  +  P SVDWR K
Sbjct: 65  EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-APRSVDWREK 123

Query: 367 GYVTNVKNQGQCGSCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDC--VTKNSGCNGGWM 540
           GYVT VKNQGQCGSCWAFS TG+LEGQ FRK   L ++SEQ LVDC     N GCNGG M
Sbjct: 124 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM 183

Query: 541 NIAFEYISSH-GIESEDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATV 717
           + AF+Y+  + G++SE++YPY+A + +C ++    VAN  GF +I    EK L  AVATV
Sbjct: 184 DYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALMKAVATV 242

Query: 718 GPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE----NGHKYWLVKNSW 882
           GPISVAID G+ SF  YK+G+Y+E  C     +H VLVVGYG E    + +KYWLVKNSW
Sbjct: 243 GPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSW 302

Query: 883 GPSWGMNGYIKMSKDRDNNCGIATTASFPIV 975
           G  WGM GY+KM+KDR N+CGIA+ AS+P V
Sbjct: 303 GEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|Q9GLE3|CATK_PIG Cathepsin K precursor
          Length = 330

 Score =  299 bits (766), Expect = 8e-81
 Identities = 149/327 (45%), Positives = 214/327 (65%), Gaps = 7/327 (2%)
 Frame = +1

Query: 10  LIIIEFTQFITSHLIINEEFNAEWQVFKTKFNKNYTA-IDELIRKTIWIENIKYIQHHNV 186
           L ++     ++S L   E  + +W+++K  + K Y + +DE+ R+ IW +N+K+I  HN+
Sbjct: 4   LKVVLLLPVMSSALYPEEILDTQWELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNL 63

Query: 187 KYDLGHHSHSLGINEFSDLTYKEFEK----LYLLSKNIEYNDGIDYLPPLNIDNLPDSVD 354
           +  LG H++ L +N   D+T +E  +    L +   +   ND + Y+P       PDS+D
Sbjct: 64  EASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSHSRSNDTL-YIPDWE-GRTPDSID 121

Query: 355 WRTKGYVTNVKNQGQCGSCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTKNSGCNGG 534
           +R KGYVT VKNQGQCGSCWAFS+ G+LEGQ  +K   L N+S Q LVDCV++N GC GG
Sbjct: 122 YRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGG 181

Query: 535 WMNIAFEYI-SSHGIESEDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVA 711
           +M  AF+Y+  + GI+SED YPY  +  NC+++ +   A C+G++ I   NEK L  AVA
Sbjct: 182 YMTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVA 241

Query: 712 TVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVENGHKYWLVKNSWGP 888
            VGP+SVAID    SFQ Y +GVYY+  C+    NHAVL VGYG++ G K+W++KNSWG 
Sbjct: 242 RVGPVSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGE 301

Query: 889 SWGMNGYIKMSKDRDNNCGIATTASFP 969
           +WG  GYI M+++++N CGIA  ASFP
Sbjct: 302 NWGNKGYILMARNKNNACGIANLASFP 328
>sp|P43235|CATK_HUMAN Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2)
          Length = 329

 Score =  298 bits (762), Expect = 2e-80
 Identities = 147/309 (47%), Positives = 207/309 (66%), Gaps = 6/309 (1%)
 Frame = +1

Query: 61  EEFNAEWQVFKTKFNKNYT-AIDELIRKTIWIENIKYIQHHNVKYDLGHHSHSLGINEFS 237
           E  +  W+++K    K Y   +DE+ R+ IW +N+KYI  HN++  LG H++ L +N   
Sbjct: 20  EILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLG 79

Query: 238 DLTYKEF-EKLYLLSKNIEYNDGID--YLPPLNIDNLPDSVDWRTKGYVTNVKNQGQCGS 408
           D+T +E  +K+  L   + ++   D  Y+P       PDSVD+R KGYVT VKNQGQCGS
Sbjct: 80  DMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWE-GRAPDSVDYRKKGYVTPVKNQGQCGS 138

Query: 409 CWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTKNSGCNGGWMNIAFEYI-SSHGIESE 585
           CWAFS+ G+LEGQ  +K   L N+S Q LVDCV++N GC GG+M  AF+Y+  + GI+SE
Sbjct: 139 CWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSE 198

Query: 586 DNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-SFQQ 762
           D YPY  ++ +C+++ +   A C+G++ I   NEK L  AVA VGP+SVAID    SFQ 
Sbjct: 199 DAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQF 258

Query: 763 YKQGVYYEAKCDPTIQNHAVLVVGYGVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNC 942
           Y +GVYY+  C+    NHAVL VGYG++ G+K+W++KNSWG +WG  GYI M+++++N C
Sbjct: 259 YSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNAC 318

Query: 943 GIATTASFP 969
           GIA  ASFP
Sbjct: 319 GIANLASFP 327
>sp|P61277|CATK_MACMU Cathepsin K precursor
 sp|P61276|CATK_MACFA Cathepsin K precursor
          Length = 329

 Score =  298 bits (762), Expect = 2e-80
 Identities = 147/310 (47%), Positives = 208/310 (67%), Gaps = 7/310 (2%)
 Frame = +1

Query: 61  EEFNAEWQVFKTKFNKNYTA-IDELIRKTIWIENIKYIQHHNVKYDLGHHSHSLGINEFS 237
           E  +  W+++K    K Y + +DE+ R+ IW +N+KYI  HN++  LG H++ L +N   
Sbjct: 20  EILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLG 79

Query: 238 DLTYKEFEK----LYLLSKNIEYNDGIDYLPPLNIDNLPDSVDWRTKGYVTNVKNQGQCG 405
           D+T +E  +    L + + +   ND + Y+P       PDSVD+R KGYVT VKNQGQCG
Sbjct: 80  DMTNEEVVQKMTGLKVPASHSRSNDTL-YIPDWE-GRAPDSVDYRKKGYVTPVKNQGQCG 137

Query: 406 SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTKNSGCNGGWMNIAFEYI-SSHGIES 582
           SCWAFS+ G+LEGQ  +K   L N+S Q LVDCV++N GC GG+M  AF+Y+  + GI+S
Sbjct: 138 SCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDS 197

Query: 583 EDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-SFQ 759
           ED YPY  ++ +C+++ +   A C+G++ I   NEK L  AVA VGP+SVAID    SFQ
Sbjct: 198 EDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQ 257

Query: 760 QYKQGVYYEAKCDPTIQNHAVLVVGYGVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNN 939
            Y +GVYY+  C+    NHAVL VGYG++ G+K+W++KNSWG +WG  GYI M+++++N 
Sbjct: 258 FYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNA 317

Query: 940 CGIATTASFP 969
           CGIA  ASFP
Sbjct: 318 CGIANLASFP 327
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  297 bits (761), Expect = 3e-80
 Identities = 154/314 (49%), Positives = 207/314 (65%), Gaps = 8/314 (2%)
 Frame = +1

Query: 58  NEEFNAEWQVFKTKFNKNYTAIDELIRKTIWIENIKYIQHHNVKYDLGHHSHSLGINEFS 237
           ++ FNA+W  +K+   + Y   +E  R+ +W +N++ IQ HN +Y  G H  ++ +N F 
Sbjct: 22  DQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFG 81

Query: 238 DLTYKEFEKLYLLSKNIEYNDGIDYLPPLNIDNLPDSVDWRTKGYVTNVKNQGQCGSCWA 417
           D+T +EF ++    ++ ++  G  +  PL +  +P +VDWR KG VT VKNQGQCGSCWA
Sbjct: 82  DMTNEEFRQIVNGYRHQKHKKGRLFQEPLMLQ-IPKTVDWREKGCVTPVKNQGQCGSCWA 140

Query: 418 FSTTGSLEGQHFRKHKVLQNISEQQLVDCV--TKNSGCNGGWMNIAFEYISSH-GIESED 588
           FS +G LEGQ F K   L ++SEQ LVDC     N GCNGG M+ AF+YI  + G++SE+
Sbjct: 141 FSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEE 200

Query: 589 NYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-SFQQY 765
           +YPY+AK G+C +     VAN  GF +I    EK L  AVATVGPISVA+D  + S Q Y
Sbjct: 201 SYPYEAKDGSCKYRAEYAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFY 259

Query: 766 KQGVYYEAKCDPTIQNHAVLVVGYGVE----NGHKYWLVKNSWGPSWGMNGYIKMSKDRD 933
             G+YYE  C     +H VLVVGYG E    N  KYWLVKNSWG  WGM+GYIK++KDR+
Sbjct: 260 SSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRN 319

Query: 934 NNCGIATTASFPIV 975
           N+CG+AT AS+PIV
Sbjct: 320 NHCGLATAASYPIV 333
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  297 bits (760), Expect = 4e-80
 Identities = 158/315 (50%), Positives = 210/315 (66%), Gaps = 15/315 (4%)
 Frame = +1

Query: 76  EWQVFKTKFNKNYT-AIDELIRKTIWIENIKYIQHHNVKYDLGHHSHSLGINEFSDLTYK 252
           EW  FK +  KNY    +E  R  I+ EN   I  HN ++  G  S  L +N+++DL + 
Sbjct: 28  EWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHH 87

Query: 253 EFEKL-----YLLSKNIEYND----GIDYLPPLNIDNLPDSVDWRTKGYVTNVKNQGQCG 405
           EF +L     Y L K +   D    G+ ++ P ++  LP SVDWRTKG VT VK+QG CG
Sbjct: 88  EFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHV-TLPKSVDWRTKGAVTAVKDQGHCG 146

Query: 406 SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTK--NSGCNGGWMNIAFEYISSH-GI 576
           SCWAFS+TG+LEGQHFRK  VL ++SEQ LVDC TK  N+GCNGG M+ AF YI  + GI
Sbjct: 147 SCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 206

Query: 577 ESEDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-S 753
           ++E +YPY+A   +C F+K  V A  +GF +I   +EK +A AVATVGP+SVAID  + S
Sbjct: 207 DTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHES 266

Query: 754 FQQYKQGVYYEAKCDPTIQNHAVLVVGYGV-ENGHKYWLVKNSWGPSWGMNGYIKMSKDR 930
           FQ Y +GVY E +CD    +H VLVVG+G  E+G  YWLVKNSWG +WG  G+IKM +++
Sbjct: 267 FQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNK 326

Query: 931 DNNCGIATTASFPIV 975
           +N CGIA+ +S+P+V
Sbjct: 327 ENQCGIASASSYPLV 341
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
          Length = 329

 Score =  296 bits (758), Expect = 7e-80
 Identities = 146/310 (47%), Positives = 207/310 (66%), Gaps = 7/310 (2%)
 Frame = +1

Query: 61  EEFNAEWQVFKTKFNKNYTA-IDELIRKTIWIENIKYIQHHNVKYDLGHHSHSLGINEFS 237
           E  + +W+++K  ++K Y + +DE+ R+ IW +N+K+I  HN++  LG H++ L +N   
Sbjct: 20  EILDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLG 79

Query: 238 DLTYKEFEK----LYLLSKNIEYNDGIDYLPPLNIDNLPDSVDWRTKGYVTNVKNQGQCG 405
           D+T +E  +    L +       ND + Y+P       PDS+D+R KGYVT VKNQGQCG
Sbjct: 80  DMTSEEVVQKMTGLKVPPSRSHSNDTL-YIPDWE-GRTPDSIDYRKKGYVTPVKNQGQCG 137

Query: 406 SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTKNSGCNGGWMNIAFEYIS-SHGIES 582
           SCWAFS+ G+LEGQ  +K   L N+S Q LVDCV++N GC GG+M  AF+Y+  + GI+S
Sbjct: 138 SCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENYGCGGGYMTNAFQYVQRNRGIDS 197

Query: 583 EDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-SFQ 759
           ED YPY  +  +C+++ +   A C+G++ I   NEK L  AVA VGP+SVAID    SFQ
Sbjct: 198 EDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQ 257

Query: 760 QYKQGVYYEAKCDPTIQNHAVLVVGYGVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNN 939
            Y +GVYY+  C     NHAVL VGYG++ G+K+W++KNSWG SWG  GYI M+++++N 
Sbjct: 258 FYSKGVYYDENCSSDNVNHAVLAVGYGIQKGNKHWIIKNSWGESWGNKGYILMARNKNNA 317

Query: 940 CGIATTASFP 969
           CGIA  ASFP
Sbjct: 318 CGIANLASFP 327
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 122,278,779
Number of Sequences: 369166
Number of extensions: 2593843
Number of successful extensions: 7730
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 6674
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 7053
length of database: 68,354,980
effective HSP length: 112
effective length of database: 47,664,660
effective search space used: 11916165000
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)

Cluster detail

DrC_00731

  1. Dr_sW_009_M03
  2. Dr_sW_007_I01
  3. Dr_sW_019_B02
  4. Dr_sW_011_A02
  5. Dr_sW_015_C18
  6. Dr_sW_002_E23
  7. Dr_sW_006_E15
  8. Dr_sW_025_O05
  9. Dr_sW_003_B01
  10. Dr_sW_002_O06
  11. Dr_sW_006_H24
  12. Dr_sW_026_O18
  13. Dr_sW_028_K05