Planarian EST Database


Dr_sW_028_K21

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_028_K21
         (723 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   215   8e-56
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   211   1e-54
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   208   1e-53
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   207   2e-53
sp|P25784|CYSP3_HOMAM  Digestive cysteine proteinase 3 precu...   202   6e-52
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   202   1e-51
sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   198   1e-50
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   196   5e-50
sp|Q10991|CATL_SHEEP  Cathepsin L [Contains: Cathepsin L hea...   195   9e-50
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   195   1e-49
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  215 bits (548), Expect = 8e-56
 Identities = 103/170 (60%), Positives = 127/170 (74%), Gaps = 3/170 (1%)
 Frame = +3

Query: 69  KQQLVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKESDYPYTAQDGDCEYSKEKVVAHC 245
           +Q LVDCS+D GNQGCNGGLMD AFQYI + G L+ E  YPY A+DG C+Y  E  VA+ 
Sbjct: 163 EQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVAND 222

Query: 246 QGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAV 425
            GF DI    E+ L + +ATVGPISV +DAS+PS QFY  G+Y E NCSS  LDHGVL V
Sbjct: 223 TGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVV 281

Query: 426 GYGND--EDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 569
           GYG +  + ++  YWLVKNSWGK WG++GYIK++KD++N CG+AT ASYP
Sbjct: 282 GYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYP 331
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  211 bits (538), Expect = 1e-54
 Identities = 105/170 (61%), Positives = 125/170 (73%), Gaps = 3/170 (1%)
 Frame = +3

Query: 69  KQQLVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKESDYPYTAQDGDCEYSKEKVVAHC 245
           +Q LVDCS+  GNQGCNGGLMD AFQYI + G L+ E  YPY A+DG C+Y  E  VA+ 
Sbjct: 163 EQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVAND 222

Query: 246 QGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAV 425
            GF DI    E+ L + +ATVGPISV +DAS+PS QFY  G+Y E NCSS  LDHGVL V
Sbjct: 223 TGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLV 281

Query: 426 GYGND-EDSQQN-YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 569
           GYG +  DS +N YWLVKNSWG  WG+ GYIK++KD+DN CG+AT ASYP
Sbjct: 282 GYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYP 331
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  208 bits (530), Expect = 1e-53
 Identities = 98/168 (58%), Positives = 124/168 (73%), Gaps = 1/168 (0%)
 Frame = +3

Query: 69  KQQLVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKESDYPYTAQDGDCEYSKEKVVAHC 245
           +Q LVDCS  +GN GCNGGLMD+AF+YI   G ++ E  YPY A D  C ++K  V A  
Sbjct: 173 EQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATD 232

Query: 246 QGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAV 425
           +GFTDI  G E+ +AE +ATVGP+SV IDAS+ SFQFY  GVY+E  C +  LDHGVL V
Sbjct: 233 RGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVV 292

Query: 426 GYGNDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 569
           G+G DE S ++YWLVKNSWG +WG  G+IKM ++K+NQCGIA+ +SYP
Sbjct: 293 GFGTDE-SGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYP 339

 Score = 31.2 bits (69), Expect = 2.9
 Identities = 13/23 (56%), Positives = 19/23 (82%)
 Frame = +2

Query: 2   FSTTGSMEGQYFKNNKQLVSFSE 70
           FS+TG++EGQ+F+ +  LVS SE
Sbjct: 151 FSSTGALEGQHFRKSGVLVSLSE 173
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  207 bits (528), Expect = 2e-53
 Identities = 101/187 (54%), Positives = 127/187 (67%), Gaps = 3/187 (1%)
 Frame = +3

Query: 24  KGNILKIISS*LVFQKQQLVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKESDYPYTAQ 200
           +G + +     +   +Q LVDCS   GNQGCNGG M  AFQY+ + G L+ E  YPY A 
Sbjct: 148 EGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAV 207

Query: 201 DGDCEYSKEKVVAHCQGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDE 380
           D  C+Y  E  VA+  GFT ++ G E+ L + +ATVGPISV +DA + SFQFYK G+Y E
Sbjct: 208 DEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFE 267

Query: 381 ENCSSTQLDHGVLAVGYGNDEDSQQN--YWLVKNSWGKSWGINGYIKMSKDKDNQCGIAT 554
            +CSS  LDHGVL VGYG +  +  N  YWLVKNSWG  WG NGY+K++KDK+N CGIAT
Sbjct: 268 PDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIAT 327

Query: 555 MASYPNM 575
            ASYPN+
Sbjct: 328 AASYPNV 334

 Score = 30.4 bits (67), Expect = 5.0
 Identities = 13/23 (56%), Positives = 17/23 (73%)
 Frame = +2

Query: 2   FSTTGSMEGQYFKNNKQLVSFSE 70
           FS TG++EGQ F+   +LVS SE
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSE 163
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
          Length = 321

 Score =  202 bits (515), Expect = 6e-52
 Identities = 99/170 (58%), Positives = 121/170 (71%), Gaps = 1/170 (0%)
 Frame = +3

Query: 69  KQQLVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKESDYPYTAQDGDCEYSKEKVVAHC 245
           +QQLVDCS D+GN GC GG M SAF YI   G ++ ES YPY A+D  C +    + A C
Sbjct: 155 EQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAIC 214

Query: 246 QGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAV 425
            G  ++ H +EE L E ++ VGPISV IDAS+ SFQFY  GVY E+NCS T LDHGVLAV
Sbjct: 215 TGSVEVQH-TEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAV 273

Query: 426 GYGNDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 575
           GYG   +S ++YWLVKNSWG SWG  GYIKMS+++DN CGIA+  SYP +
Sbjct: 274 GYGT--ESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321

 Score = 33.1 bits (74), Expect = 0.77
 Identities = 14/23 (60%), Positives = 18/23 (78%)
 Frame = +2

Query: 2   FSTTGSMEGQYFKNNKQLVSFSE 70
           FS TG++EGQ+F  N +LVS SE
Sbjct: 133 FSATGALEGQHFLKNDELVSLSE 155
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  202 bits (513), Expect = 1e-51
 Identities = 94/170 (55%), Positives = 119/170 (70%), Gaps = 1/170 (0%)
 Frame = +3

Query: 69  KQQLVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKESDYPYTAQDGDCEYSKEKVVAHC 245
           +Q LVDCS  +GN GCNGGLMD+AF+YI   G ++ E  YPY   D  C ++K  + A  
Sbjct: 171 EQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATD 230

Query: 246 QGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAV 425
            GF DI  G EE + + +AT+GP+SV IDAS+ SFQ Y  GVY+E  C    LDHGVL V
Sbjct: 231 TGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVV 290

Query: 426 GYGNDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 575
           GYG DE S  +YWLVKNSWG +WG  GYIKM+++++NQCGIAT +SYP +
Sbjct: 291 GYGTDE-SGMDYWLVKNSWGTTWGEQGYIKMARNQNNQCGIATASSYPTV 339

 Score = 30.0 bits (66), Expect = 6.5
 Identities = 13/23 (56%), Positives = 18/23 (78%)
 Frame = +2

Query: 2   FSTTGSMEGQYFKNNKQLVSFSE 70
           FS+TG++EGQ+F+    LVS SE
Sbjct: 149 FSSTGALEGQHFRKAGVLVSLSE 171
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  198 bits (504), Expect = 1e-50
 Identities = 94/168 (55%), Positives = 120/168 (71%), Gaps = 1/168 (0%)
 Frame = +3

Query: 69  KQQLVDCSNDFGNQGCNGGLMDSAFQYIM-QYGLEKESDYPYTAQDGDCEYSKEKVVAHC 245
           +QQLVDCS  +G QGCNGG M+ AF YI    G++ E+ YPY A+DG C +    V A C
Sbjct: 156 EQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATC 215

Query: 246 QGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAV 425
            G T+I+ GSE  L + +  +GPISV IDA++ SFQFY  GVY E +CS + LDH VLAV
Sbjct: 216 SGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAV 275

Query: 426 GYGNDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 569
           GYG+  +  Q++WLVKNSW  SWG  GYIKMS++++N CGIAT+ASYP
Sbjct: 276 GYGS--EGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYP 321

 Score = 31.2 bits (69), Expect = 2.9
 Identities = 13/23 (56%), Positives = 17/23 (73%)
 Frame = +2

Query: 2   FSTTGSMEGQYFKNNKQLVSFSE 70
           FSTTGS+EGQ+F     L+S +E
Sbjct: 134 FSTTGSLEGQHFLKTGSLISLAE 156
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  196 bits (498), Expect = 5e-50
 Identities = 96/187 (51%), Positives = 124/187 (66%), Gaps = 3/187 (1%)
 Frame = +3

Query: 24  KGNILKIISS*LVFQKQQLVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKESDYPYTAQ 200
           +G + +     +   +Q LVDCS   GN+GCNGGLMD AFQY+   G L+ E  YPY A 
Sbjct: 148 EGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEAT 207

Query: 201 DGDCEYSKEKVVAHCQGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDE 380
           +  C+Y+ +  VA+  GF DI    E+ L + +ATVGPISV IDA + SF FYK G+Y E
Sbjct: 208 EESCKYNPKYSVANDTGFVDIPK-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFE 266

Query: 381 ENCSSTQLDHGVLAVGYG--NDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIAT 554
            +CSS  +DHGVL VGYG  + E     YWLVKNSWG+ WG+ GY+KM+KD+ N CGIA+
Sbjct: 267 PDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIAS 326

Query: 555 MASYPNM 575
            ASYP +
Sbjct: 327 AASYPTV 333

 Score = 30.0 bits (66), Expect = 6.5
 Identities = 12/23 (52%), Positives = 17/23 (73%)
 Frame = +2

Query: 2   FSTTGSMEGQYFKNNKQLVSFSE 70
           FS TG++EGQ F+   +L+S SE
Sbjct: 141 FSATGALEGQMFRKTGRLISLSE 163
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]
          Length = 217

 Score =  195 bits (496), Expect = 9e-50
 Identities = 98/185 (52%), Positives = 121/185 (65%), Gaps = 1/185 (0%)
 Frame = +3

Query: 24  KGNILKIISS*LVFQKQQLVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKESDYPYTAQ 200
           +G + +     +   +Q LVD S   GNQGCNGGLMD+AFQYI + G L+ E  YPY A 
Sbjct: 35  EGQMFRKTGKLVSLSEQNLVDSSRPQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYEAT 94

Query: 201 DGDCEYSKEKVVAHCQGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDE 380
           D  C Y  E   A   GF DI    E+ L + +ATVGPISV IDA + SFQFYK G+Y +
Sbjct: 95  DTSCNYKPEYSAAKDTGFVDIPQ-REKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYD 153

Query: 381 ENCSSTQLDHGVLAVGYGNDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMA 560
            +CSS  LDHGVL VGYG  E +   +W+VKNSWG  WG  GY+KM+KD++N CGIAT A
Sbjct: 154 PDCSSKDLDHGVLVVGYG-FEGTNNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAA 212

Query: 561 SYPNM 575
           SYP +
Sbjct: 213 SYPTV 217

 Score = 30.4 bits (67), Expect = 5.0
 Identities = 13/23 (56%), Positives = 17/23 (73%)
 Frame = +2

Query: 2  FSTTGSMEGQYFKNNKQLVSFSE 70
          FS TG++EGQ F+   +LVS SE
Sbjct: 28 FSATGALEGQMFRKTGKLVSLSE 50
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  195 bits (495), Expect = 1e-49
 Identities = 101/188 (53%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
 Frame = +3

Query: 24  KGNILKIISS*LVFQKQQLVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKESDYPYTAQ 200
           +G + +     +   +Q LVDCS   GNQGCNGGLMD+AFQYI   G L+ E  YPY A 
Sbjct: 148 EGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLAT 207

Query: 201 D-GDCEYSKEKVVAHCQGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYD 377
           D   C Y  E   A+  GF DI    E+ L + +ATVGPISV IDA + SFQFYK G+Y 
Sbjct: 208 DTNSCNYKPECSAANDTGFVDIPQ-REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYY 266

Query: 378 EENCSSTQLDHGVLAVGYGND-EDSQQN-YWLVKNSWGKSWGINGYIKMSKDKDNQCGIA 551
           + +CS   LDHGVL VGYG +  DS  N +W+VKNSWG  WG NGY+KM+KD++N CGIA
Sbjct: 267 DPDCSCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIA 326

Query: 552 TMASYPNM 575
           T ASYP +
Sbjct: 327 TAASYPTV 334

 Score = 30.4 bits (67), Expect = 5.0
 Identities = 13/23 (56%), Positives = 17/23 (73%)
 Frame = +2

Query: 2   FSTTGSMEGQYFKNNKQLVSFSE 70
           FS TG++EGQ F+   +LVS SE
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSE 163
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 84,214,759
Number of Sequences: 369166
Number of extensions: 1744915
Number of successful extensions: 5137
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4459
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4684
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 6462248555
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)