Planarian EST Database


Dr_sW_012_B18

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_012_B18
         (771 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   238   1e-62
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   237   3e-62
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   236   7e-62
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   233   3e-61
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   229   8e-60
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   224   3e-58
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   221   1e-57
sp|Q24940|CATLP_FASHE  Cathepsin L-like proteinase precursor      221   2e-57
sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   221   2e-57
sp|P43236|CATK_RABIT  Cathepsin K precursor (OC-2 protein)        219   6e-57
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  238 bits (608), Expect = 1e-62
 Identities = 128/262 (48%), Positives = 165/262 (62%), Gaps = 6/262 (2%)
 Frame = +3

Query: 3   SLVVVAI-----TAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQ 167
           SLV+ A      +AVP KF  N  L+ +W  +K T  R Y    +  RR +WE+N+K I+
Sbjct: 4   SLVLAAFCLGIASAVP-KFDQN--LDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIE 60

Query: 168 KHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASV 347
            HN EY  GKH +++ +N F DMTNEEF+ + +G  + +   +G  +  P  +  LP SV
Sbjct: 61  LHNGEYSQGKHGFTMAMNAFGDMTNEEFR-QMMGCFRNQKFRKGKVFREPLFLD-LPKSV 118

Query: 348 DWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXX 527
           DWR+KGYVTPVKNQ+QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS        
Sbjct: 119 DWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGC 178

Query: 528 XXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLAN 704
               M  AF+Y+K+  G++SE  YPY A D  CK  P   V   TGFT +    E  L  
Sbjct: 179 NGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMK 238

Query: 705 AVATVGPVSVAIDAGHASFQLY 770
           AVATVGP+SVA+DAGH+SFQ Y
Sbjct: 239 AVATVGPISVAMDAGHSSFQFY 260
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  237 bits (604), Expect = 3e-62
 Identities = 124/238 (52%), Positives = 151/238 (63%), Gaps = 1/238 (0%)
 Frame = +3

Query: 60  LNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMT 239
           L  +W  +K    R Y    +  RR +WE+N+K I+ HN EY  GKH++++ +N F DMT
Sbjct: 25  LEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMT 84

Query: 240 NEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFS 419
           +EEF+    G    KP  +G  +  P      P SVDWR+KGYVTPVKNQ QCGSCW+FS
Sbjct: 85  SEEFRQVMNGFQNRKPR-KGKVFQEPLFYEA-PRSVDWREKGYVTPVKNQGQCGSCWAFS 142

Query: 420 ATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDY 596
           ATG+LEGQ FRK  RLIS SEQ LVDCS           LMD AF+Y++D  G++SE  Y
Sbjct: 143 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY 202

Query: 597 PYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 770
           PY AT+ +CK NP   V   TGF DI  Q E  L  AVATVGP+SVAIDAGH SF  Y
Sbjct: 203 PYEATEESCKYNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHESFLFY 259
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  236 bits (601), Expect = 7e-62
 Identities = 126/254 (49%), Positives = 155/254 (61%), Gaps = 2/254 (0%)
 Frame = +3

Query: 15  VAITAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLG 194
           + I +   KF  +  LN +W  +K T  R Y    +  RR +WE+N+K I+ HN EY  G
Sbjct: 12  LGIASAAPKF--DQSLNAQWYQWKATHRRLYGMNEEGWRRAVWEKNMKMIELHNREYSQG 69

Query: 195 KHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVT 374
           KH +++ +N F DMTNEEF+    G    K   +G  +  P     +P SVDWR+KGYVT
Sbjct: 70  KHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHK-KGKMFQEPL-FAEIPKSVDWREKGYVT 127

Query: 375 PVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAF 554
           PVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMDNAF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAF 187

Query: 555 RYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPV 728
           RY+KD  G++SE  YPY   D  TC   P       TGF D+  Q E  L  AVAT+GP+
Sbjct: 188 RYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKALMKAVATLGPI 246

Query: 729 SVAIDAGHASFQLY 770
           SVAIDAGH SFQ Y
Sbjct: 247 SVAIDAGHQSFQFY 260
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  233 bits (595), Expect = 3e-61
 Identities = 127/257 (49%), Positives = 153/257 (59%), Gaps = 2/257 (0%)
 Frame = +3

Query: 6   LVVVAITAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEY 185
           L V+ +        ++  L+  W  +K T  R Y    +  RR +WE+N K I  HN EY
Sbjct: 7   LTVLCLGVASAAPKLDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQEY 66

Query: 186 DLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKG 365
             GKH + + +N F DMTNEEF+    G    K   +G  +  P  + V P SVDW +KG
Sbjct: 67  SEGKHAFRMAMNAFGDMTNEEFRQVMNGFQNQKHK-KGKLFHEPLLVDV-PKSVDWTKKG 124

Query: 366 YVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMD 545
           YVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMD
Sbjct: 125 YVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMD 184

Query: 546 NAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATV 719
           NAF+YIKD  G++SE  YPY ATD  +C   P       TGF DI  Q E  L  AVATV
Sbjct: 185 NAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKALMKAVATV 243

Query: 720 GPVSVAIDAGHASFQLY 770
           GP+SVAIDAGH SFQ Y
Sbjct: 244 GPISVAIDAGHTSFQFY 260
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  229 bits (583), Expect = 8e-60
 Identities = 121/243 (49%), Positives = 152/243 (62%), Gaps = 2/243 (0%)
 Frame = +3

Query: 48  VNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQF 227
           ++  L+ +W  +K T GR Y    +  RR +WE+N+K I+ HN EY  GKH +S+ +N F
Sbjct: 21  LDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAF 80

Query: 228 ADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSC 407
            DMTNEEF+    G    K   +G  +     + V P SVDWR+KGYVT VKNQ QCGSC
Sbjct: 81  GDMTNEEFRQVMNGFQNQKHK-KGKVFHESLVLEV-PKSVDWREKGYVTAVKNQGQCGSC 138

Query: 408 WSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIES 584
           W+FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMDNAF+Y+KD  G+++
Sbjct: 139 WAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDT 198

Query: 585 EGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASF 761
           E  YPY   +  +C   P       TGF DI  Q E  L  AVATVGP+SVAIDAGH+SF
Sbjct: 199 EESYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHSSF 257

Query: 762 QLY 770
           Q Y
Sbjct: 258 QFY 260
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  224 bits (570), Expect = 3e-58
 Identities = 124/264 (46%), Positives = 162/264 (61%), Gaps = 9/264 (3%)
 Frame = +3

Query: 6   LVVVAITAVPQKFSVNSELNEEWETYKTTFGRKY-DELTDITRRLIWEQNLKYIQKHNLE 182
           + ++A+ A+ Q  S    + EEW TYK    + Y +E+ +  R  I+ +N   I KHN  
Sbjct: 6   VALLALVALTQAISPLDLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQL 65

Query: 183 YDLGKHTYSLGLNQFADMTNEEFKAKYLG-------IMKTKPTLEGSTYMAPENIGVLPA 341
           +  GK +Y LGLN++ADM + EFK    G       +M+ +  L G+TY+ P ++ V P 
Sbjct: 66  FAQGKVSYKLGLNKYADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTV-PK 124

Query: 342 SVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXX 521
           SVDWR+ G VT VK+Q  CGSCW+FS+TG+LEGQ+FRK   L+S SEQ LVDCS      
Sbjct: 125 SVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNN 184

Query: 522 XXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDL 698
                LMDNAFRYIKD  GI++E  YPY   D +C  N + I    TGF DI   +E  +
Sbjct: 185 GCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKM 244

Query: 699 ANAVATVGPVSVAIDAGHASFQLY 770
             AVAT+GPVSVAIDA H SFQLY
Sbjct: 245 KKAVATMGPVSVAIDASHESFQLY 268
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  221 bits (564), Expect = 1e-57
 Identities = 120/235 (51%), Positives = 146/235 (62%), Gaps = 1/235 (0%)
 Frame = +3

Query: 69  EWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEE 248
           EW  +K+T  R Y    +  RR IWE+N++ IQ HN EY  G+H +S+ +N F DMTNEE
Sbjct: 28  EWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEE 87

Query: 249 FKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATG 428
           F+    G    K   +G  +  P  + + P SVDWR+KG VTPVKNQ QCGSCW+FSA+G
Sbjct: 88  FRQVVNGYRHQKHK-KGRLFQEPLMLKI-PKSVDWREKGCVTPVKNQGQCGSCWAFSASG 145

Query: 429 SLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYT 605
            LEGQ F K  +LIS SEQ LVDCS           LMD AF+YIK+  G++SE  YPY 
Sbjct: 146 CLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYE 205

Query: 606 ATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 770
           A DG+CK      V   TGF DI  Q E  L  AVATVGP+SVA+DA H S Q Y
Sbjct: 206 AKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSLQFY 259
>sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor
          Length = 326

 Score =  221 bits (563), Expect = 2e-57
 Identities = 108/233 (46%), Positives = 144/233 (61%)
 Frame = +3

Query: 72  WETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEF 251
           W  +K  + ++Y+   D  RR IWE+N+K+IQ+HNL +DLG  TY+LGLNQF DMT EEF
Sbjct: 21  WHQWKRMYNKEYNGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEF 80

Query: 252 KAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGS 431
           KAKYL  M     +         N   +P  +DWR+ GYVT VK+Q  CGSCW+FS TG+
Sbjct: 81  KAKYLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGT 140

Query: 432 LEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQGIESEGDYPYTAT 611
           +EGQY +     ISFSEQQLVDCS           LM+NA++Y+K  G+E+E  YPYTA 
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAV 200

Query: 612 DGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 770
           +G C+ N    V K TG+  + S +E +L N V    P +VA+D   + F +Y
Sbjct: 201 EGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDV-ESDFMMY 252
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  221 bits (563), Expect = 2e-57
 Identities = 118/237 (49%), Positives = 147/237 (62%), Gaps = 1/237 (0%)
 Frame = +3

Query: 63  NEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTN 242
           N +W  +K+T  R Y    +  RR +WE+N++ IQ HN EY  GKH +++ +N F DMTN
Sbjct: 26  NAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTN 85

Query: 243 EEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSA 422
           EEF+    G    K   +G  +  P  + + P +VDWR+KG VTPVKNQ QCGSCW+FSA
Sbjct: 86  EEFRQIVNGYRHQKHK-KGRLFQEPLMLQI-PKTVDWREKGCVTPVKNQGQCGSCWAFSA 143

Query: 423 TGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYP 599
           +G LEGQ F K  +LIS SEQ LVDCS           LMD AF+YIK+  G++SE  YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203

Query: 600 YTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 770
           Y A DG+CK      V   TGF DI  Q E  L  AVATVGP+SVA+DA H S Q Y
Sbjct: 204 YEAKDGSCKYRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSLQFY 259
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
          Length = 329

 Score =  219 bits (558), Expect = 6e-57
 Identities = 121/259 (46%), Positives = 159/259 (61%), Gaps = 4/259 (1%)
 Frame = +3

Query: 6   LVVVAITAVPQKFSVNSELNEEWETYKTTFGRKYDELTD-ITRRLIWEQNLKYIQKHNLE 182
           L VV+    P++      L+ +WE +K T+ ++Y+   D I+RRLIWE+NLK+I  HNLE
Sbjct: 9   LPVVSFALHPEEI-----LDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLE 63

Query: 183 YDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGS--TYMAPENIGVLPASVDWR 356
             LG HTY L +N   DMT+EE   K  G+ K  P+   S  T   P+  G  P S+D+R
Sbjct: 64  ASLGVHTYELAMNHLGDMTSEEVVQKMTGL-KVPPSRSHSNDTLYIPDWEGRTPDSIDYR 122

Query: 357 QKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXX 536
           +KGYVTPVKNQ QCGSCW+FS+ G+LEGQ  +K  +L++ S Q LVDC            
Sbjct: 123 KKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGG 180

Query: 537 LMDNAFRYI-KDQGIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVA 713
            M NAF+Y+ +++GI+SE  YPY   D +C  NP+    KC G+ +I   NE  L  AVA
Sbjct: 181 YMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVA 240

Query: 714 TVGPVSVAIDAGHASFQLY 770
            VGPVSVAIDA   SFQ Y
Sbjct: 241 RVGPVSVAIDASLTSFQFY 259
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 83,373,739
Number of Sequences: 369166
Number of extensions: 1600622
Number of successful extensions: 4692
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4221
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4355
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 7163732800
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)