Planarian EST Database


Dr_sW_007_C05

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_007_C05
         (758 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   262   8e-70
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   258   9e-69
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   256   6e-68
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   255   1e-67
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   247   3e-65
sp|Q24940|CATLP_FASHE  Cathepsin L-like proteinase precursor      246   5e-65
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   243   3e-64
sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   240   3e-63
sp|P43236|CATK_RABIT  Cathepsin K precursor (OC-2 protein)        238   1e-62
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   237   2e-62
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  262 bits (669), Expect = 8e-70
 Identities = 132/227 (58%), Positives = 157/227 (69%), Gaps = 1/227 (0%)
 Frame = +2

Query: 20  RRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTY 199
           RR +WE+N+K I+ HN EY  GKH++++ +N F DMT+EEF+    G    KP  +G  +
Sbjct: 48  RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR-KGKVF 106

Query: 200 MAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQ 379
             P      P SVDWR+KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  RLIS SEQ 
Sbjct: 107 QEPLFYEA-PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQN 165

Query: 380 LVDCSGAYGNYGCGAGLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGF 556
           LVDCSG  GN GC  GLMD AF+Y++D  G++SE  YPY AT+ +CK NP   V   TGF
Sbjct: 166 LVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGF 225

Query: 557 TDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCST 697
            DI  Q E  L  AVATVGP+SVAIDAGH SF  YK GIY E  CS+
Sbjct: 226 VDIPKQ-EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSS 271
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  258 bits (660), Expect = 9e-69
 Identities = 125/230 (54%), Positives = 158/230 (68%), Gaps = 1/230 (0%)
 Frame = +2

Query: 20  RRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTY 199
           RR +WE+N+K I+ HN EY  GKH +++ +N F DMTNEEF+ + +G  + +   +G  +
Sbjct: 48  RRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFR-QMMGCFRNQKFRKGKVF 106

Query: 200 MAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQ 379
             P  +  LP SVDWR+KGYVTPVKNQ+QCGSCW+FSATG+LEGQ FRK  +L+S SEQ 
Sbjct: 107 REPLFLD-LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQN 165

Query: 380 LVDCSGAYGNYGCGAGLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGF 556
           LVDCS   GN GC  G M  AF+Y+K+  G++SE  YPY A D  CK  P   V   TGF
Sbjct: 166 LVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGF 225

Query: 557 TDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQI 706
           T +    E  L  AVATVGP+SVA+DAGH+SFQ YKSGIY E  CS+  +
Sbjct: 226 TVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNL 275
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  256 bits (653), Expect = 6e-68
 Identities = 129/231 (55%), Positives = 155/231 (67%), Gaps = 2/231 (0%)
 Frame = +2

Query: 20  RRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTY 199
           RR +WE+N+K I+ HN EY  GKH +++ +N F DMTNEEF+    G    K   +G  +
Sbjct: 48  RRAVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHK-KGKMF 106

Query: 200 MAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQ 379
             P     +P SVDWR+KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ 
Sbjct: 107 QEPL-FAEIPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQN 165

Query: 380 LVDCSGAYGNYGCGAGLMDNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTG 553
           LVDCS A GN GC  GLMDNAFRY+KD  G++SE  YPY   D  TC   P       TG
Sbjct: 166 LVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTG 225

Query: 554 FTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQI 706
           F D+  Q E  L  AVAT+GP+SVAIDAGH SFQ YKSGIY +  CS+  +
Sbjct: 226 FVDL-PQREKALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDL 275
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  255 bits (651), Expect = 1e-67
 Identities = 132/227 (58%), Positives = 152/227 (66%), Gaps = 2/227 (0%)
 Frame = +2

Query: 20  RRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTY 199
           RR +WE+N K I  HN EY  GKH + + +N F DMTNEEF+    G    K   +G  +
Sbjct: 48  RRAVWEKNKKIIDLHNQEYSEGKHAFRMAMNAFGDMTNEEFRQVMNGFQNQKHK-KGKLF 106

Query: 200 MAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQ 379
             P  + V P SVDW +KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ 
Sbjct: 107 HEPLLVDV-PKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQN 165

Query: 380 LVDCSGAYGNYGCGAGLMDNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTG 553
           LVDCS A GN GC  GLMDNAF+YIKD  G++SE  YPY ATD  +C   P       TG
Sbjct: 166 LVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTG 225

Query: 554 FTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCS 694
           F DI  Q E  L  AVATVGP+SVAIDAGH SFQ YKSGIY +  CS
Sbjct: 226 FVDI-PQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCS 271
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  247 bits (630), Expect = 3e-65
 Identities = 126/231 (54%), Positives = 154/231 (66%), Gaps = 2/231 (0%)
 Frame = +2

Query: 20  RRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTY 199
           RR +WE+N+K I+ HN EY  GKH +S+ +N F DMTNEEF+    G    K   +G  +
Sbjct: 48  RRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQKHK-KGKVF 106

Query: 200 MAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQ 379
                + V P SVDWR+KGYVT VKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ 
Sbjct: 107 HESLVLEV-PKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQN 165

Query: 380 LVDCSGAYGNYGCGAGLMDNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTG 553
           LVDCS   GN GC  GLMDNAF+Y+KD  G+++E  YPY   +  +C   P       TG
Sbjct: 166 LVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTG 225

Query: 554 FTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQI 706
           F DI  Q E  L  AVATVGP+SVAIDAGH+SFQ YKSGIY +  CS+  +
Sbjct: 226 FVDI-PQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDL 275
>sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor
          Length = 326

 Score =  246 bits (628), Expect = 5e-65
 Identities = 117/232 (50%), Positives = 154/232 (66%)
 Frame = +2

Query: 11  DITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEG 190
           D  RR IWE+N+K+IQ+HNL +DLG  TY+LGLNQF DMT EEFKAKYL  M     +  
Sbjct: 37  DQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYLTEMSRASDILS 96

Query: 191 STYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFS 370
                  N   +P  +DWR+ GYVT VK+Q  CGSCW+FS TG++EGQY +     ISFS
Sbjct: 97  HGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFS 156

Query: 371 EQQLVDCSGAYGNYGCGAGLMDNAFRYIKDQGIESEGDYPYTATDGTCKRNPSKIVTKCT 550
           EQQLVDCSG +GN GC  GLM+NA++Y+K  G+E+E  YPYTA +G C+ N    V K T
Sbjct: 157 EQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAVEGQCRYNKQLGVAKVT 216

Query: 551 GFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQI 706
           G+  + S +E +L N V    P +VA+D   + F +Y+SGIY  ++CS  ++
Sbjct: 217 GYYTVHSGSEVELKNLVGARRPAAVAVDV-ESDFMMYRSGIYQSQTCSPLRV 267
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  243 bits (621), Expect = 3e-64
 Identities = 127/230 (55%), Positives = 153/230 (66%), Gaps = 1/230 (0%)
 Frame = +2

Query: 20  RRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTY 199
           RR IWE+N++ IQ HN EY  G+H +S+ +N F DMTNEEF+    G    K   +G  +
Sbjct: 48  RRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHK-KGRLF 106

Query: 200 MAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQ 379
             P  + + P SVDWR+KG VTPVKNQ QCGSCW+FSA+G LEGQ F K  +LIS SEQ 
Sbjct: 107 QEPLMLKI-PKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQN 165

Query: 380 LVDCSGAYGNYGCGAGLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGF 556
           LVDCS A GN GC  GLMD AF+YIK+  G++SE  YPY A DG+CK      V   TGF
Sbjct: 166 LVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGF 225

Query: 557 TDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQI 706
            DI  Q E  L  AVATVGP+SVA+DA H S Q Y SGIY E +CS+  +
Sbjct: 226 VDI-PQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNL 274
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  240 bits (612), Expect = 3e-63
 Identities = 124/230 (53%), Positives = 152/230 (66%), Gaps = 1/230 (0%)
 Frame = +2

Query: 20  RRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTY 199
           RR +WE+N++ IQ HN EY  GKH +++ +N F DMTNEEF+    G    K   +G  +
Sbjct: 48  RRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQKHK-KGRLF 106

Query: 200 MAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQ 379
             P  + + P +VDWR+KG VTPVKNQ QCGSCW+FSA+G LEGQ F K  +LIS SEQ 
Sbjct: 107 QEPLMLQI-PKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQN 165

Query: 380 LVDCSGAYGNYGCGAGLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGF 556
           LVDCS   GN GC  GLMD AF+YIK+  G++SE  YPY A DG+CK      V   TGF
Sbjct: 166 LVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGF 225

Query: 557 TDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQI 706
            DI  Q E  L  AVATVGP+SVA+DA H S Q Y SGIY E +CS+  +
Sbjct: 226 VDI-PQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDL 274
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
          Length = 329

 Score =  238 bits (607), Expect = 1e-62
 Identities = 121/238 (50%), Positives = 157/238 (65%), Gaps = 3/238 (1%)
 Frame = +2

Query: 2   ELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPT 181
           ++ +I+RRLIWE+NLK+I  HNLE  LG HTY L +N   DMT+EE   K  G+ K  P+
Sbjct: 40  KVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGL-KVPPS 98

Query: 182 LEGS--TYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNR 355
              S  T   P+  G  P S+D+R+KGYVTPVKNQ QCGSCW+FS+ G+LEGQ  +K  +
Sbjct: 99  RSHSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGK 158

Query: 356 LISFSEQQLVDCSGAYGNYGCGAGLMDNAFRYI-KDQGIESEGDYPYTATDGTCKRNPSK 532
           L++ S Q LVDC     NYGCG G M NAF+Y+ +++GI+SE  YPY   D +C  NP+ 
Sbjct: 159 LLNLSPQNLVDCVSE--NYGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTG 216

Query: 533 IVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQI 706
              KC G+ +I   NE  L  AVA VGPVSVAIDA   SFQ Y  G+Y +E+CS+  +
Sbjct: 217 KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSDNV 274
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  237 bits (605), Expect = 2e-62
 Identities = 124/238 (52%), Positives = 155/238 (65%), Gaps = 8/238 (3%)
 Frame = +2

Query: 2   ELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLG------- 160
           E+ +  R  I+ +N   I KHN  +  GK +Y LGLN++ADM + EFK    G       
Sbjct: 42  EVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKETMNGYNHTLRQ 101

Query: 161 IMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYF 340
           +M+ +  L G+TY+ P ++ V P SVDWR+ G VT VK+Q  CGSCW+FS+TG+LEGQ+F
Sbjct: 102 LMRERTGLVGATYIPPAHVTV-PKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHF 160

Query: 341 RKNNRLISFSEQQLVDCSGAYGNYGCGAGLMDNAFRYIKDQ-GIESEGDYPYTATDGTCK 517
           RK   L+S SEQ LVDCS  YGN GC  GLMDNAFRYIKD  GI++E  YPY   D +C 
Sbjct: 161 RKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCH 220

Query: 518 RNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESC 691
            N + I    TGF DI   +E  +  AVAT+GPVSVAIDA H SFQLY  G+YNE  C
Sbjct: 221 FNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPEC 278
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 85,613,907
Number of Sequences: 369166
Number of extensions: 1698615
Number of successful extensions: 5297
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4616
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4880
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 6970118400
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)