Planarian EST Database


Dr_sW_028_K05

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_028_K05
         (581 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   194   1e-49
sp|Q9GLE3|CATK_PIG  Cathepsin K precursor                         183   2e-46
sp|P43235|CATK_HUMAN  Cathepsin K precursor (Cathepsin O) (C...   183   3e-46
sp|P61277|CATK_MACMU  Cathepsin K precursor >gi|47117667|sp|...   183   3e-46
sp|P25784|CYSP3_HOMAM  Digestive cysteine proteinase 3 precu...   183   3e-46
sp|O35186|CATK_RAT  Cathepsin K precursor                         182   4e-46
sp|P43236|CATK_RABIT  Cathepsin K precursor (OC-2 protein)        182   5e-46
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   181   1e-45
sp|P55097|CATK_MOUSE  Cathepsin K precursor                       180   2e-45
sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   180   3e-45
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  194 bits (493), Expect = 1e-49
 Identities = 95/166 (57%), Positives = 116/166 (69%), Gaps = 4/166 (2%)
 Frame = +3

Query: 3   QLVDCVTKNS--GCNGGWMNIAFEYI-SSHGIESEDNYPYQAKQGNCVFDKSKVVANCKG 173
           QLVDC       GCNGGWMN AF+YI +++GI++E  YPY+A+ G+C FD + V A C G
Sbjct: 158 QLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATCSG 217

Query: 174 FQNINSCNEKDLAVAVATVGPISVAIDVGYS-FQQYKQGVYYEAKCDPTIQNHAVLVVGY 350
             NI S +E  L  AV  +GPISV ID  +S FQ Y  GVYYE  C P+  +HAVL VGY
Sbjct: 218 HTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGY 277

Query: 351 GVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFPIV 488
           G E G  +WLVKNSW  SWG  GYIKMS++R+NNCGIAT AS+P+V
Sbjct: 278 GSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323
>sp|Q9GLE3|CATK_PIG Cathepsin K precursor
          Length = 330

 Score =  183 bits (465), Expect = 2e-46
 Identities = 84/161 (52%), Positives = 114/161 (70%), Gaps = 2/161 (1%)
 Frame = +3

Query: 6   LVDCVTKNSGCNGGWMNIAFEYISSH-GIESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
           LVDCV++N GC GG+M  AF+Y+  + GI+SED YPY  +  NC+++ +   A C+G++ 
Sbjct: 168 LVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYRE 227

Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
           I   NEK L  AVA VGP+SVAID    SFQ Y +GVYY+  C+    NHAVL VGYG++
Sbjct: 228 IPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQ 287

Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
            G K+W++KNSWG +WG  GYI M+++++N CGIA  ASFP
Sbjct: 288 KGKKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 328
>sp|P43235|CATK_HUMAN Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2)
          Length = 329

 Score =  183 bits (464), Expect = 3e-46
 Identities = 83/161 (51%), Positives = 116/161 (72%), Gaps = 2/161 (1%)
 Frame = +3

Query: 6   LVDCVTKNSGCNGGWMNIAFEYISSH-GIESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
           LVDCV++N GC GG+M  AF+Y+  + GI+SED YPY  ++ +C+++ +   A C+G++ 
Sbjct: 167 LVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYRE 226

Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
           I   NEK L  AVA VGP+SVAID    SFQ Y +GVYY+  C+    NHAVL VGYG++
Sbjct: 227 IPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQ 286

Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
            G+K+W++KNSWG +WG  GYI M+++++N CGIA  ASFP
Sbjct: 287 KGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>sp|P61277|CATK_MACMU Cathepsin K precursor
 sp|P61276|CATK_MACFA Cathepsin K precursor
          Length = 329

 Score =  183 bits (464), Expect = 3e-46
 Identities = 83/161 (51%), Positives = 116/161 (72%), Gaps = 2/161 (1%)
 Frame = +3

Query: 6   LVDCVTKNSGCNGGWMNIAFEYISSH-GIESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
           LVDCV++N GC GG+M  AF+Y+  + GI+SED YPY  ++ +C+++ +   A C+G++ 
Sbjct: 167 LVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYRE 226

Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
           I   NEK L  AVA VGP+SVAID    SFQ Y +GVYY+  C+    NHAVL VGYG++
Sbjct: 227 IPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQ 286

Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
            G+K+W++KNSWG +WG  GYI M+++++N CGIA  ASFP
Sbjct: 287 KGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
          Length = 321

 Score =  183 bits (464), Expect = 3e-46
 Identities = 92/166 (55%), Positives = 112/166 (67%), Gaps = 4/166 (2%)
 Frame = +3

Query: 3   QLVDCVTK--NSGCNGGWMNIAFEYISSHG-IESEDNYPYQAKQGNCVFDKSKVVANCKG 173
           QLVDC T   N GC GGWM  AF+YI  +G I++E +YPY+A+  +C FD + + A C G
Sbjct: 157 QLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTG 216

Query: 174 FQNINSCNEKDLAVAVATVGPISVAIDVG-YSFQQYKQGVYYEAKCDPTIQNHAVLVVGY 350
              +    E+ L  AV+ VGPISVAID   +SFQ Y  GVYYE  C PT  +H VL VGY
Sbjct: 217 SVEVQH-TEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGY 275

Query: 351 GVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFPIV 488
           G E+   YWLVKNSWG SWG  GYIKMS++RDNNCGIA+  S+P V
Sbjct: 276 GTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321
>sp|O35186|CATK_RAT Cathepsin K precursor
          Length = 329

 Score =  182 bits (463), Expect = 4e-46
 Identities = 83/161 (51%), Positives = 114/161 (70%), Gaps = 2/161 (1%)
 Frame = +3

Query: 6   LVDCVTKNSGCNGGWMNIAFEYISSHG-IESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
           LVDCV++N GC GG+M  AF+Y+  +G I+SED YPY  +  +C+++ +   A C+G++ 
Sbjct: 167 LVDCVSENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYRE 226

Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
           I   NEK L  AVA VGP+SV+ID    SFQ Y +GVYY+  CD    NHAVLVVGYG +
Sbjct: 227 IPVGNEKALKRAVARVGPVSVSIDASLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQ 286

Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
            G+KYW++KNSWG SWG  GY+ ++++++N CGI   ASFP
Sbjct: 287 KGNKYWIIKNSWGESWGNKGYVLLARNKNNACGITNLASFP 327
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
          Length = 329

 Score =  182 bits (462), Expect = 5e-46
 Identities = 84/161 (52%), Positives = 114/161 (70%), Gaps = 2/161 (1%)
 Frame = +3

Query: 6   LVDCVTKNSGCNGGWMNIAFEYIS-SHGIESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
           LVDCV++N GC GG+M  AF+Y+  + GI+SED YPY  +  +C+++ +   A C+G++ 
Sbjct: 167 LVDCVSENYGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYRE 226

Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
           I   NEK L  AVA VGP+SVAID    SFQ Y +GVYY+  C     NHAVL VGYG++
Sbjct: 227 IPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSDNVNHAVLAVGYGIQ 286

Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
            G+K+W++KNSWG SWG  GYI M+++++N CGIA  ASFP
Sbjct: 287 KGNKHWIIKNSWGESWGNKGYILMARNKNNACGIANLASFP 327
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  181 bits (459), Expect = 1e-45
 Identities = 93/169 (55%), Positives = 116/169 (68%), Gaps = 8/169 (4%)
 Frame = +3

Query: 6   LVDC--VTKNSGCNGGWMNIAFEYISSHG-IESEDNYPYQAKQGNCVFDKSKVVANCKGF 176
           LVDC     N GCNGG M+ AF+YI  +G ++SE++YPY+AK G+C +     VAN  GF
Sbjct: 166 LVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGF 225

Query: 177 QNINSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYG 353
            +I    EK L  AVATVGPISVA+D  + S Q Y  G+YYE  C     +H VL+VGYG
Sbjct: 226 VDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYG 284

Query: 354 VE----NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFPIV 488
            E    N +KYWLVKNSWG  WGM GYIK++KDRDN+CG+AT AS+P+V
Sbjct: 285 YEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>sp|P55097|CATK_MOUSE Cathepsin K precursor
          Length = 329

 Score =  180 bits (457), Expect = 2e-45
 Identities = 84/161 (52%), Positives = 112/161 (69%), Gaps = 2/161 (1%)
 Frame = +3

Query: 6   LVDCVTKNSGCNGGWMNIAFEYISSHG-IESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
           LVDCVT+N GC GG+M  AF+Y+  +G I+SED YPY  +  +C+++ +   A C+G++ 
Sbjct: 167 LVDCVTENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYRE 226

Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
           I   NEK L  AVA VGPISV+ID    SFQ Y +GVYY+  CD    NHAVLVVGYG +
Sbjct: 227 IPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQ 286

Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
            G K+W++KNSWG SWG  GY  ++++++N CGI   ASFP
Sbjct: 287 KGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMASFP 327
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  180 bits (456), Expect = 3e-45
 Identities = 94/169 (55%), Positives = 116/169 (68%), Gaps = 8/169 (4%)
 Frame = +3

Query: 6   LVDCVTK--NSGCNGGWMNIAFEYISSHG-IESEDNYPYQAKQGNCVFDKSKVVANCKGF 176
           LVDC     N GCNGG M+ AF+YI  +G ++SE++YPY+AK G+C +     VAN  GF
Sbjct: 166 LVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGF 225

Query: 177 QNINSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYG 353
            +I    EK L  AVATVGPISVA+D  + S Q Y  G+YYE  C     +H VLVVGYG
Sbjct: 226 VDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYG 284

Query: 354 VE----NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFPIV 488
            E    N  KYWLVKNSWG  WGM+GYIK++KDR+N+CG+AT AS+PIV
Sbjct: 285 YEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 69,720,567
Number of Sequences: 369166
Number of extensions: 1425456
Number of successful extensions: 3920
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3376
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3513
length of database: 68,354,980
effective HSP length: 105
effective length of database: 48,957,805
effective search space used: 4308286840
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)