Planarian EST Database


Dr_sW_003_J22

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_003_J22
         (752 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   235   8e-62
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   235   1e-61
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   233   5e-61
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   230   4e-60
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   226   5e-59
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   220   4e-57
sp|Q24940|CATLP_FASHE  Cathepsin L-like proteinase precursor      219   5e-57
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   219   8e-57
sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   218   1e-56
sp|P43236|CATK_RABIT  Cathepsin K precursor (OC-2 protein)        216   5e-56
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  235 bits (600), Expect = 8e-62
 Identities = 123/248 (49%), Positives = 159/248 (64%), Gaps = 1/248 (0%)
 Frame = +1

Query: 7   TAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHT 186
           +AVP KF  N  L+ +W  +K T  R Y    +  RR +WE+N+K I+ HN EY  GKH 
Sbjct: 16  SAVP-KFDQN--LDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHG 72

Query: 187 YSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVK 366
           +++ +N F DMTNEEF+ + +G  + +   +G  +  P  +  LP SVDWR+KGYVTPVK
Sbjct: 73  FTMAMNAFGDMTNEEFR-QMMGCFRNQKFRKGKVFREPLFLD-LPKSVDWRKKGYVTPVK 130

Query: 367 NQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYI 546
           NQ+QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS            M  AF+Y+
Sbjct: 131 NQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYV 190

Query: 547 KDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAI 723
           K+  G++SE  YPY A D  CK  P   V   TGFT +    E  L  AVATVGP+SVA+
Sbjct: 191 KENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAM 250

Query: 724 DAGHASFQ 747
           DAGH+SFQ
Sbjct: 251 DAGHSSFQ 258
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  235 bits (599), Expect = 1e-61
 Identities = 123/235 (52%), Positives = 150/235 (63%), Gaps = 1/235 (0%)
 Frame = +1

Query: 43  LNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMT 222
           L  +W  +K    R Y    +  RR +WE+N+K I+ HN EY  GKH++++ +N F DMT
Sbjct: 25  LEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMT 84

Query: 223 NEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFS 402
           +EEF+    G    KP  +G  +  P      P SVDWR+KGYVTPVKNQ QCGSCW+FS
Sbjct: 85  SEEFRQVMNGFQNRKPR-KGKVFQEPLFYEA-PRSVDWREKGYVTPVKNQGQCGSCWAFS 142

Query: 403 ATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDY 579
           ATG+LEGQ FRK  RLIS SEQ LVDCS           LMD AF+Y++D  G++SE  Y
Sbjct: 143 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY 202

Query: 580 PYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASF 744
           PY AT+ +CK NP   V   TGF DI  Q E  L  AVATVGP+SVAIDAGH SF
Sbjct: 203 PYEATEESCKYNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHESF 256
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  233 bits (593), Expect = 5e-61
 Identities = 125/250 (50%), Positives = 153/250 (61%), Gaps = 2/250 (0%)
 Frame = +1

Query: 4   ITAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKH 183
           I +   KF  +  LN +W  +K T  R Y    +  RR +WE+N+K I+ HN EY  GKH
Sbjct: 14  IASAAPKF--DQSLNAQWYQWKATHRRLYGMNEEGWRRAVWEKNMKMIELHNREYSQGKH 71

Query: 184 TYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPV 363
            +++ +N F DMTNEEF+    G    K   +G  +  P     +P SVDWR+KGYVTPV
Sbjct: 72  GFTMAMNAFGDMTNEEFRQVMNGFQNQKHK-KGKMFQEPL-FAEIPKSVDWREKGYVTPV 129

Query: 364 KNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRY 543
           KNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMDNAFRY
Sbjct: 130 KNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRY 189

Query: 544 IKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSV 717
           +KD  G++SE  YPY   D  TC   P       TGF D+  Q E  L  AVAT+GP+SV
Sbjct: 190 VKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKALMKAVATLGPISV 248

Query: 718 AIDAGHASFQ 747
           AIDAGH SFQ
Sbjct: 249 AIDAGHQSFQ 258
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  230 bits (586), Expect = 4e-60
 Identities = 124/241 (51%), Positives = 148/241 (61%), Gaps = 2/241 (0%)
 Frame = +1

Query: 31  VNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQF 210
           ++  L+  W  +K T  R Y    +  RR +WE+N K I  HN EY  GKH + + +N F
Sbjct: 21  LDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQEYSEGKHAFRMAMNAF 80

Query: 211 ADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSC 390
            DMTNEEF+    G    K   +G  +  P  + V P SVDW +KGYVTPVKNQ QCGSC
Sbjct: 81  GDMTNEEFRQVMNGFQNQKHK-KGKLFHEPLLVDV-PKSVDWTKKGYVTPVKNQGQCGSC 138

Query: 391 WSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIES 567
           W+FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMDNAF+YIKD  G++S
Sbjct: 139 WAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDS 198

Query: 568 EGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASF 744
           E  YPY ATD  +C   P       TGF DI  Q E  L  AVATVGP+SVAIDAGH SF
Sbjct: 199 EESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHTSF 257

Query: 745 Q 747
           Q
Sbjct: 258 Q 258
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  226 bits (576), Expect = 5e-59
 Identities = 120/241 (49%), Positives = 151/241 (62%), Gaps = 2/241 (0%)
 Frame = +1

Query: 31  VNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQF 210
           ++  L+ +W  +K T GR Y    +  RR +WE+N+K I+ HN EY  GKH +S+ +N F
Sbjct: 21  LDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAF 80

Query: 211 ADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSC 390
            DMTNEEF+    G    K   +G  +     + V P SVDWR+KGYVT VKNQ QCGSC
Sbjct: 81  GDMTNEEFRQVMNGFQNQKHK-KGKVFHESLVLEV-PKSVDWREKGYVTAVKNQGQCGSC 138

Query: 391 WSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIES 567
           W+FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMDNAF+Y+KD  G+++
Sbjct: 139 WAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDT 198

Query: 568 EGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASF 744
           E  YPY   +  +C   P       TGF DI  Q E  L  AVATVGP+SVAIDAGH+SF
Sbjct: 199 EESYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHSSF 257

Query: 745 Q 747
           Q
Sbjct: 258 Q 258
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  220 bits (560), Expect = 4e-57
 Identities = 123/259 (47%), Positives = 158/259 (61%), Gaps = 9/259 (3%)
 Frame = +1

Query: 1   AITAVPQKFSVNSELNEEWETYKTTFGRKY-DELTDITRRLIWEQNLKYIQKHNLEYDLG 177
           A+ A+ Q  S    + EEW TYK    + Y +E+ +  R  I+ +N   I KHN  +  G
Sbjct: 10  ALVALTQAISPLDLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQG 69

Query: 178 KHTYSLGLNQFADMTNEEFKAKYLG-------IMKTKPTLEGSTYMAPENIGVLPASVDW 336
           K +Y LGLN++ADM + EFK    G       +M+ +  L G+TY+ P ++ V P SVDW
Sbjct: 70  KVSYKLGLNKYADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTV-PKSVDW 128

Query: 337 RQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXX 516
           R+ G VT VK+Q  CGSCW+FS+TG+LEGQ+FRK   L+S SEQ LVDCS          
Sbjct: 129 REHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNG 188

Query: 517 XLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAV 693
            LMDNAFRYIKD  GI++E  YPY   D +C  N + I    TGF DI   +E  +  AV
Sbjct: 189 GLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAV 248

Query: 694 ATVGPVSVAIDAGHASFQL 750
           AT+GPVSVAIDA H SFQL
Sbjct: 249 ATMGPVSVAIDASHESFQL 267
>sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor
          Length = 326

 Score =  219 bits (559), Expect = 5e-57
 Identities = 106/224 (47%), Positives = 140/224 (62%)
 Frame = +1

Query: 55  WETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEF 234
           W  +K  + ++Y+   D  RR IWE+N+K+IQ+HNL +DLG  TY+LGLNQF DMT EEF
Sbjct: 21  WHQWKRMYNKEYNGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEF 80

Query: 235 KAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGS 414
           KAKYL  M     +         N   +P  +DWR+ GYVT VK+Q  CGSCW+FS TG+
Sbjct: 81  KAKYLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGT 140

Query: 415 LEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQGIESEGDYPYTAT 594
           +EGQY +     ISFSEQQLVDCS           LM+NA++Y+K  G+E+E  YPYTA 
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAV 200

Query: 595 DGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAID 726
           +G C+ N    V K TG+  + S +E +L N V    P +VA+D
Sbjct: 201 EGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVD 244
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  219 bits (557), Expect = 8e-57
 Identities = 119/233 (51%), Positives = 145/233 (62%), Gaps = 1/233 (0%)
 Frame = +1

Query: 52  EWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEE 231
           EW  +K+T  R Y    +  RR IWE+N++ IQ HN EY  G+H +S+ +N F DMTNEE
Sbjct: 28  EWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEE 87

Query: 232 FKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATG 411
           F+    G    K   +G  +  P  + + P SVDWR+KG VTPVKNQ QCGSCW+FSA+G
Sbjct: 88  FRQVVNGYRHQKHK-KGRLFQEPLMLKI-PKSVDWREKGCVTPVKNQGQCGSCWAFSASG 145

Query: 412 SLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYT 588
            LEGQ F K  +LIS SEQ LVDCS           LMD AF+YIK+  G++SE  YPY 
Sbjct: 146 CLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYE 205

Query: 589 ATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQ 747
           A DG+CK      V   TGF DI  Q E  L  AVATVGP+SVA+DA H S Q
Sbjct: 206 AKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSLQ 257
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  218 bits (556), Expect = 1e-56
 Identities = 117/235 (49%), Positives = 146/235 (62%), Gaps = 1/235 (0%)
 Frame = +1

Query: 46  NEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTN 225
           N +W  +K+T  R Y    +  RR +WE+N++ IQ HN EY  GKH +++ +N F DMTN
Sbjct: 26  NAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTN 85

Query: 226 EEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSA 405
           EEF+    G    K   +G  +  P  + + P +VDWR+KG VTPVKNQ QCGSCW+FSA
Sbjct: 86  EEFRQIVNGYRHQKHK-KGRLFQEPLMLQI-PKTVDWREKGCVTPVKNQGQCGSCWAFSA 143

Query: 406 TGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYP 582
           +G LEGQ F K  +LIS SEQ LVDCS           LMD AF+YIK+  G++SE  YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203

Query: 583 YTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQ 747
           Y A DG+CK      V   TGF DI  Q E  L  AVATVGP+SVA+DA H S Q
Sbjct: 204 YEAKDGSCKYRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSLQ 257
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
          Length = 329

 Score =  216 bits (550), Expect = 5e-56
 Identities = 116/239 (48%), Positives = 151/239 (63%), Gaps = 4/239 (1%)
 Frame = +1

Query: 43  LNEEWETYKTTFGRKYDELTD-ITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADM 219
           L+ +WE +K T+ ++Y+   D I+RRLIWE+NLK+I  HNLE  LG HTY L +N   DM
Sbjct: 22  LDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDM 81

Query: 220 TNEEFKAKYLGIMKTKPTLEGS--TYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCW 393
           T+EE   K  G+ K  P+   S  T   P+  G  P S+D+R+KGYVTPVKNQ QCGSCW
Sbjct: 82  TSEEVVQKMTGL-KVPPSRSHSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCW 140

Query: 394 SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYI-KDQGIESE 570
           +FS+ G+LEGQ  +K  +L++ S Q LVDC             M NAF+Y+ +++GI+SE
Sbjct: 141 AFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGGYMTNAFQYVQRNRGIDSE 198

Query: 571 GDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQ 747
             YPY   D +C  NP+    KC G+ +I   NE  L  AVA VGPVSVAIDA   SFQ
Sbjct: 199 DAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQ 257
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.315    0.131    0.389 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 81,216,881
Number of Sequences: 369166
Number of extensions: 1562889
Number of successful extensions: 4556
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4102
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4224
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 6873311200
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)