Planarian EST Database


Dr_sW_025_L05

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_025_L05
         (534 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   177   1e-44
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   172   5e-43
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   170   2e-42
sp|Q24940|CATLP_FASHE  Cathepsin L-like proteinase precursor      169   3e-42
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   169   3e-42
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   168   7e-42
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   159   4e-39
sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   158   7e-39
sp|P61277|CATK_MACMU  Cathepsin K precursor >gi|47117667|sp|...   157   2e-38
sp|P43236|CATK_RABIT  Cathepsin K precursor (OC-2 protein)        156   3e-38
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  177 bits (449), Expect = 1e-44
 Identities = 87/173 (50%), Positives = 109/173 (63%)
 Frame = +1

Query: 16  NSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFA 195
           +  LN +W  +K T  R Y    +  RR +WE+N+K I+ HN EY  GKH +++ +N F 
Sbjct: 22  DQSLNAQWYQWKATHRRLYGMNEEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAFG 81

Query: 196 DMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCW 375
           DMTNEEF+    G    K   +G  +  P     +P SVDWR+KGYVTPVKNQ QCGSCW
Sbjct: 82  DMTNEEFRQVMNGFQNQKHK-KGKMFQEPL-FAEIPKSVDWREKGYVTPVKNQGQCGSCW 139

Query: 376 SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYITD 534
           +FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMDNAFRY+ D
Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKD 192
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  172 bits (436), Expect = 5e-43
 Identities = 86/174 (49%), Positives = 110/174 (63%)
 Frame = +1

Query: 13  VNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQF 192
           ++  L+ +W  +K T GR Y    +  RR +WE+N+K I+ HN EY  GKH +S+ +N F
Sbjct: 21  LDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAF 80

Query: 193 ADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSC 372
            DMTNEEF+    G    K   +G  +     + V P SVDWR+KGYVT VKNQ QCGSC
Sbjct: 81  GDMTNEEFRQVMNGFQNQKHK-KGKVFHESLVLEV-PKSVDWREKGYVTAVKNQGQCGSC 138

Query: 373 WSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYITD 534
           W+FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMDNAF+Y+ D
Sbjct: 139 WAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKD 192
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  170 bits (430), Expect = 2e-42
 Identities = 85/170 (50%), Positives = 106/170 (62%)
 Frame = +1

Query: 25  LNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMT 204
           L  +W  +K    R Y    +  RR +WE+N+K I+ HN EY  GKH++++ +N F DMT
Sbjct: 25  LEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMT 84

Query: 205 NEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFS 384
           +EEF+    G    KP  +G  +  P      P SVDWR+KGYVTPVKNQ QCGSCW+FS
Sbjct: 85  SEEFRQVMNGFQNRKPR-KGKVFQEPLFYEA-PRSVDWREKGYVTPVKNQGQCGSCWAFS 142

Query: 385 ATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYITD 534
           ATG+LEGQ FRK  RLIS SEQ LVDCS           LMD AF+Y+ D
Sbjct: 143 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQD 192
>sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor
          Length = 326

 Score =  169 bits (429), Expect = 3e-42
 Identities = 81/164 (49%), Positives = 105/164 (64%)
 Frame = +1

Query: 37  WETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEF 216
           W  +K  + ++Y+   D  RR IWE+N+K+IQ+HNL +DLG  TY+LGLNQF DMT EEF
Sbjct: 21  WHQWKRMYNKEYNGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEF 80

Query: 217 KAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGS 396
           KAKYL  M     +         N   +P  +DWR+ GYVT VK+Q  CGSCW+FS TG+
Sbjct: 81  KAKYLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGT 140

Query: 397 LEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYI 528
           +EGQY +     ISFSEQQLVDCS           LM+NA++Y+
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL 184
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  169 bits (429), Expect = 3e-42
 Identities = 86/174 (49%), Positives = 106/174 (60%)
 Frame = +1

Query: 13  VNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQF 192
           ++  L+  W  +K T  R Y    +  RR +WE+N K I  HN EY  GKH + + +N F
Sbjct: 21  LDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQEYSEGKHAFRMAMNAF 80

Query: 193 ADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSC 372
            DMTNEEF+    G    K   +G  +  P  + V P SVDW +KGYVTPVKNQ QCGSC
Sbjct: 81  GDMTNEEFRQVMNGFQNQKHK-KGKLFHEPLLVDV-PKSVDWTKKGYVTPVKNQGQCGSC 138

Query: 373 WSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYITD 534
           W+FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMDNAF+YI D
Sbjct: 139 WAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKD 192
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  168 bits (426), Expect = 7e-42
 Identities = 84/177 (47%), Positives = 113/177 (63%)
 Frame = +1

Query: 4   KFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGL 183
           KF  N  L+ +W  +K T  R Y    +  RR +WE+N+K I+ HN EY  GKH +++ +
Sbjct: 20  KFDQN--LDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAM 77

Query: 184 NQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQC 363
           N F DMTNEEF+ + +G  + +   +G  +  P  +  LP SVDWR+KGYVTPVKNQ+QC
Sbjct: 78  NAFGDMTNEEFR-QMMGCFRNQKFRKGKVFREPLFLD-LPKSVDWRKKGYVTPVKNQKQC 135

Query: 364 GSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYITD 534
           GSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS            M  AF+Y+ +
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKE 192
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  159 bits (402), Expect = 4e-39
 Identities = 83/167 (49%), Positives = 104/167 (62%)
 Frame = +1

Query: 34  EWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEE 213
           EW  +K+T  R Y    +  RR IWE+N++ IQ HN EY  G+H +S+ +N F DMTNEE
Sbjct: 28  EWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEE 87

Query: 214 FKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATG 393
           F+    G    K   +G  +  P  + + P SVDWR+KG VTPVKNQ QCGSCW+FSA+G
Sbjct: 88  FRQVVNGYRHQKHK-KGRLFQEPLMLKI-PKSVDWREKGCVTPVKNQGQCGSCWAFSASG 145

Query: 394 SLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYITD 534
            LEGQ F K  +LIS SEQ LVDCS           LMD AF+YI +
Sbjct: 146 CLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKE 192
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  158 bits (400), Expect = 7e-39
 Identities = 81/169 (47%), Positives = 105/169 (62%)
 Frame = +1

Query: 28  NEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTN 207
           N +W  +K+T  R Y    +  RR +WE+N++ IQ HN EY  GKH +++ +N F DMTN
Sbjct: 26  NAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTN 85

Query: 208 EEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSA 387
           EEF+    G    K   +G  +  P  + + P +VDWR+KG VTPVKNQ QCGSCW+FSA
Sbjct: 86  EEFRQIVNGYRHQKHK-KGRLFQEPLMLQI-PKTVDWREKGCVTPVKNQGQCGSCWAFSA 143

Query: 388 TGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYITD 534
           +G LEGQ F K  +LIS SEQ LVDCS           LMD AF+YI +
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKE 192
>sp|P61277|CATK_MACMU Cathepsin K precursor
 sp|P61276|CATK_MACFA Cathepsin K precursor
          Length = 329

 Score =  157 bits (396), Expect = 2e-38
 Identities = 82/170 (48%), Positives = 105/170 (61%), Gaps = 2/170 (1%)
 Frame = +1

Query: 25  LNEEWETYKTTFGRKYDELTD-ITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADM 201
           L+  WE +K T  ++Y+   D I+RRLIWE+NLKYI  HNLE  LG HTY L +N   DM
Sbjct: 22  LDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDM 81

Query: 202 TNEEFKAKYLGI-MKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 378
           TNEE   K  G+ +    +    T   P+  G  P SVD+R+KGYVTPVKNQ QCGSCW+
Sbjct: 82  TNEEVVQKMTGLKVPASHSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWA 141

Query: 379 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYI 528
           FS+ G+LEGQ  +K  +L++ S Q LVDC             M NAF+Y+
Sbjct: 142 FSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYV 189
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
          Length = 329

 Score =  156 bits (395), Expect = 3e-38
 Identities = 82/171 (47%), Positives = 109/171 (63%), Gaps = 3/171 (1%)
 Frame = +1

Query: 25  LNEEWETYKTTFGRKYDELTD-ITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADM 201
           L+ +WE +K T+ ++Y+   D I+RRLIWE+NLK+I  HNLE  LG HTY L +N   DM
Sbjct: 22  LDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDM 81

Query: 202 TNEEFKAKYLGIMKTKPTLEGS--TYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCW 375
           T+EE   K  G+ K  P+   S  T   P+  G  P S+D+R+KGYVTPVKNQ QCGSCW
Sbjct: 82  TSEEVVQKMTGL-KVPPSRSHSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCW 140

Query: 376 SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYI 528
           +FS+ G+LEGQ  +K  +L++ S Q LVDC             M NAF+Y+
Sbjct: 141 AFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGGYMTNAFQYV 189
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.316    0.132    0.399 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 55,203,218
Number of Sequences: 369166
Number of extensions: 1078787
Number of successful extensions: 3126
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 2905
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2993
length of database: 68,354,980
effective HSP length: 103
effective length of database: 49,327,275
effective search space used: 3650218350
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)