Planarian EST Database


Dr_sW_002_O06

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_002_O06
         (683 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   234   2e-61
sp|P25784|CYSP3_HOMAM  Digestive cysteine proteinase 3 precu...   223   4e-58
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   220   3e-57
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   216   6e-56
sp|Q9GLE3|CATK_PIG  Cathepsin K precursor                         214   2e-55
sp|P43235|CATK_HUMAN  Cathepsin K precursor (Cathepsin O) (C...   213   3e-55
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   213   3e-55
sp|P61277|CATK_MACMU  Cathepsin K precursor >gi|47117667|sp|...   213   3e-55
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   213   4e-55
sp|P43236|CATK_RABIT  Cathepsin K precursor (OC-2 protein)        213   5e-55
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  234 bits (597), Expect = 2e-61
 Identities = 115/194 (59%), Positives = 139/194 (71%), Gaps = 4/194 (2%)
 Frame = +1

Query: 1   SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTKNS--GCNGGWMNIAFEYI-SSHGI 171
           SCWAFSTTGSLEGQHF K   L +++EQQLVDC       GCNGGWMN AF+YI +++GI
Sbjct: 130 SCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGI 189

Query: 172 ESEDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGYS- 348
           ++E  YPY+A+ G+C FD + V A C G  NI S +E  L  AV  +GPISV ID  +S 
Sbjct: 190 DTEAAYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSS 249

Query: 349 FQQYKQGVYYEAKCDPTIQNHAVLVVGYGVENGHKYWLVKNSWGPSWGMNGYIKMSKDRD 528
           FQ Y  GVYYE  C P+  +HAVL VGYG E G  +WLVKNSW  SWG  GYIKMS++R+
Sbjct: 250 FQFYSSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRN 309

Query: 529 NNCGIATTASFPIV 570
           NNCGIAT AS+P+V
Sbjct: 310 NNCGIATVASYPLV 323
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
          Length = 321

 Score =  223 bits (568), Expect = 4e-58
 Identities = 111/194 (57%), Positives = 135/194 (69%), Gaps = 4/194 (2%)
 Frame = +1

Query: 1   SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTK--NSGCNGGWMNIAFEYISSHG-I 171
           SCWAFS TG+LEGQHF K+  L ++SEQQLVDC T   N GC GGWM  AF+YI  +G I
Sbjct: 129 SCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI 188

Query: 172 ESEDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVG-YS 348
           ++E +YPY+A+  +C FD + + A C G   +    E+ L  AV+ VGPISVAID   +S
Sbjct: 189 DTESSYPYEAEDRSCRFDANSIGAICTGSVEVQH-TEEALQEAVSGVGPISVAIDASHFS 247

Query: 349 FQQYKQGVYYEAKCDPTIQNHAVLVVGYGVENGHKYWLVKNSWGPSWGMNGYIKMSKDRD 528
           FQ Y  GVYYE  C PT  +H VL VGYG E+   YWLVKNSWG SWG  GYIKMS++RD
Sbjct: 248 FQFYSSGVYYEQNCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRD 307

Query: 529 NNCGIATTASFPIV 570
           NNCGIA+  S+P V
Sbjct: 308 NNCGIASEPSYPTV 321
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  220 bits (560), Expect = 3e-57
 Identities = 110/195 (56%), Positives = 144/195 (73%), Gaps = 5/195 (2%)
 Frame = +1

Query: 1   SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTK--NSGCNGGWMNIAFEYISSHG-I 171
           SCWAFS+TG+LEGQHFRK  VL ++SEQ LVDC TK  N+GCNGG M+ AF YI  +G I
Sbjct: 147 SCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 206

Query: 172 ESEDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-S 348
           ++E +YPY+A   +C F+K  V A  +GF +I   +EK +A AVATVGP+SVAID  + S
Sbjct: 207 DTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHES 266

Query: 349 FQQYKQGVYYEAKCDPTIQNHAVLVVGYGV-ENGHKYWLVKNSWGPSWGMNGYIKMSKDR 525
           FQ Y +GVY E +CD    +H VLVVG+G  E+G  YWLVKNSWG +WG  G+IKM +++
Sbjct: 267 FQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNK 326

Query: 526 DNNCGIATTASFPIV 570
           +N CGIA+ +S+P+V
Sbjct: 327 ENQCGIASASSYPLV 341
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  216 bits (549), Expect = 6e-56
 Identities = 108/195 (55%), Positives = 142/195 (72%), Gaps = 5/195 (2%)
 Frame = +1

Query: 1   SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTK--NSGCNGGWMNIAFEYISSHG-I 171
           SCWAFS+TG+LEGQHFRK  VL ++SEQ LVDC TK  N+GCNGG M+ AF YI  +G I
Sbjct: 145 SCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 204

Query: 172 ESEDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-S 348
           ++E +YPY+    +C F+K+ + A   GF +I   +E+ +  AVAT+GP+SVAID  + S
Sbjct: 205 DTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHES 264

Query: 349 FQQYKQGVYYEAKCDPTIQNHAVLVVGYGV-ENGHKYWLVKNSWGPSWGMNGYIKMSKDR 525
           FQ Y +GVY E +CD    +H VLVVGYG  E+G  YWLVKNSWG +WG  GYIKM++++
Sbjct: 265 FQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARNQ 324

Query: 526 DNNCGIATTASFPIV 570
           +N CGIAT +S+P V
Sbjct: 325 NNQCGIATASSYPTV 339
>sp|Q9GLE3|CATK_PIG Cathepsin K precursor
          Length = 330

 Score =  214 bits (544), Expect = 2e-55
 Identities = 100/190 (52%), Positives = 134/190 (70%), Gaps = 2/190 (1%)
 Frame = +1

Query: 1   SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTKNSGCNGGWMNIAFEYISSH-GIES 177
           SCWAFS+ G+LEGQ  +K   L N+S Q LVDCV++N GC GG+M  AF+Y+  + GI+S
Sbjct: 139 SCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDS 198

Query: 178 EDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-SFQ 354
           ED YPY  +  NC+++ +   A C+G++ I   NEK L  AVA VGP+SVAID    SFQ
Sbjct: 199 EDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQ 258

Query: 355 QYKQGVYYEAKCDPTIQNHAVLVVGYGVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNN 534
            Y +GVYY+  C+    NHAVL VGYG++ G K+W++KNSWG +WG  GYI M+++++N 
Sbjct: 259 FYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNKGYILMARNKNNA 318

Query: 535 CGIATTASFP 564
           CGIA  ASFP
Sbjct: 319 CGIANLASFP 328
>sp|P43235|CATK_HUMAN Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2)
          Length = 329

 Score =  213 bits (543), Expect = 3e-55
 Identities = 99/190 (52%), Positives = 136/190 (71%), Gaps = 2/190 (1%)
 Frame = +1

Query: 1   SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTKNSGCNGGWMNIAFEYISSH-GIES 177
           SCWAFS+ G+LEGQ  +K   L N+S Q LVDCV++N GC GG+M  AF+Y+  + GI+S
Sbjct: 138 SCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDS 197

Query: 178 EDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-SFQ 354
           ED YPY  ++ +C+++ +   A C+G++ I   NEK L  AVA VGP+SVAID    SFQ
Sbjct: 198 EDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQ 257

Query: 355 QYKQGVYYEAKCDPTIQNHAVLVVGYGVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNN 534
            Y +GVYY+  C+    NHAVL VGYG++ G+K+W++KNSWG +WG  GYI M+++++N 
Sbjct: 258 FYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNA 317

Query: 535 CGIATTASFP 564
           CGIA  ASFP
Sbjct: 318 CGIANLASFP 327
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  213 bits (543), Expect = 3e-55
 Identities = 110/198 (55%), Positives = 135/198 (68%), Gaps = 8/198 (4%)
 Frame = +1

Query: 1   SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTK--NSGCNGGWMNIAFEYISSHG-I 171
           SCWAFS TG+LEGQ FRK   L ++SEQ LVDC     N GCNGG+M  AF+Y+  +G +
Sbjct: 137 SCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGL 196

Query: 172 ESEDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGYS- 348
           +SE++YPY A    C +     VAN  GF  +    EK L  AVATVGPISVA+D G+S 
Sbjct: 197 DSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSS 256

Query: 349 FQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE----NGHKYWLVKNSWGPSWGMNGYIKMS 516
           FQ YK G+Y+E  C     +H VLVVGYG E    N  KYWLVKNSWGP WG NGY+K++
Sbjct: 257 FQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIA 316

Query: 517 KDRDNNCGIATTASFPIV 570
           KD++N+CGIAT AS+P V
Sbjct: 317 KDKNNHCGIATAASYPNV 334
>sp|P61277|CATK_MACMU Cathepsin K precursor
 sp|P61276|CATK_MACFA Cathepsin K precursor
          Length = 329

 Score =  213 bits (543), Expect = 3e-55
 Identities = 99/190 (52%), Positives = 136/190 (71%), Gaps = 2/190 (1%)
 Frame = +1

Query: 1   SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTKNSGCNGGWMNIAFEYISSH-GIES 177
           SCWAFS+ G+LEGQ  +K   L N+S Q LVDCV++N GC GG+M  AF+Y+  + GI+S
Sbjct: 138 SCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDS 197

Query: 178 EDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-SFQ 354
           ED YPY  ++ +C+++ +   A C+G++ I   NEK L  AVA VGP+SVAID    SFQ
Sbjct: 198 EDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQ 257

Query: 355 QYKQGVYYEAKCDPTIQNHAVLVVGYGVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNN 534
            Y +GVYY+  C+    NHAVL VGYG++ G+K+W++KNSWG +WG  GYI M+++++N 
Sbjct: 258 FYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNA 317

Query: 535 CGIATTASFP 564
           CGIA  ASFP
Sbjct: 318 CGIANLASFP 327
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  213 bits (542), Expect = 4e-55
 Identities = 110/198 (55%), Positives = 136/198 (68%), Gaps = 8/198 (4%)
 Frame = +1

Query: 1   SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDC--VTKNSGCNGGWMNIAFEYISSHG-I 171
           SCWAFS +G LEGQ F K   L ++SEQ LVDC     N GCNGG M+ AF+YI  +G +
Sbjct: 137 SCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGL 196

Query: 172 ESEDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-S 348
           +SE++YPY+AK G+C +     VAN  GF +I    EK L  AVATVGPISVA+D  + S
Sbjct: 197 DSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPS 255

Query: 349 FQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE----NGHKYWLVKNSWGPSWGMNGYIKMS 516
            Q Y  G+YYE  C     +H VL+VGYG E    N +KYWLVKNSWG  WGM GYIK++
Sbjct: 256 LQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIA 315

Query: 517 KDRDNNCGIATTASFPIV 570
           KDRDN+CG+AT AS+P+V
Sbjct: 316 KDRDNHCGLATAASYPVV 333
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
          Length = 329

 Score =  213 bits (541), Expect = 5e-55
 Identities = 100/190 (52%), Positives = 134/190 (70%), Gaps = 2/190 (1%)
 Frame = +1

Query: 1   SCWAFSTTGSLEGQHFRKHKVLQNISEQQLVDCVTKNSGCNGGWMNIAFEYIS-SHGIES 177
           SCWAFS+ G+LEGQ  +K   L N+S Q LVDCV++N GC GG+M  AF+Y+  + GI+S
Sbjct: 138 SCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENYGCGGGYMTNAFQYVQRNRGIDS 197

Query: 178 EDNYPYQAKQGNCVFDKSKVVANCKGFQNINSCNEKDLAVAVATVGPISVAIDVGY-SFQ 354
           ED YPY  +  +C+++ +   A C+G++ I   NEK L  AVA VGP+SVAID    SFQ
Sbjct: 198 EDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQ 257

Query: 355 QYKQGVYYEAKCDPTIQNHAVLVVGYGVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNN 534
            Y +GVYY+  C     NHAVL VGYG++ G+K+W++KNSWG SWG  GYI M+++++N 
Sbjct: 258 FYSKGVYYDENCSSDNVNHAVLAVGYGIQKGNKHWIIKNSWGESWGNKGYILMARNKNNA 317

Query: 535 CGIATTASFP 564
           CGIA  ASFP
Sbjct: 318 CGIANLASFP 327
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 79,232,364
Number of Sequences: 369166
Number of extensions: 1638910
Number of successful extensions: 4597
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3923
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4094
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 5830600200
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)