Planarian EST Database


Dr_sW_020_P07

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_020_P07
         (664 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   224   2e-58
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   223   3e-58
sp|Q10991|CATL_SHEEP  Cathepsin L [Contains: Cathepsin L hea...   221   1e-57
sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   220   3e-57
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   218   9e-57
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   216   3e-56
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   215   7e-56
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   215   7e-56
sp|P25784|CYSP3_HOMAM  Digestive cysteine proteinase 3 precu...   214   2e-55
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   209   5e-54
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  224 bits (571), Expect = 2e-58
 Identities = 110/192 (57%), Positives = 134/192 (69%), Gaps = 2/192 (1%)
 Frame = +2

Query: 26  SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESE 202
           +FS+TG+LEGQ+FRK+  L+S SEQ LVDCS           LMDNAFRYIKD G I++E
Sbjct: 150 AFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTE 209

Query: 203 GDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 382
             YPY A D +C  N   +     GFTDI   +E  +A AVATVGPVSVAIDA H SFQ 
Sbjct: 210 KSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 269

Query: 383 YKSGIYNEESCSTTQLDHGVLAVGYGT-QIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQ 559
           Y  G+YNE  C    LDHGVL VG+GT + G+ YW+VKNSW  TWG+ G+IKM ++K+NQ
Sbjct: 270 YSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQ 329

Query: 560 CGIATMASYPLV 595
           CGIA+ +SYPLV
Sbjct: 330 CGIASASSYPLV 341
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  223 bits (568), Expect = 3e-58
 Identities = 111/192 (57%), Positives = 131/192 (68%), Gaps = 2/192 (1%)
 Frame = +2

Query: 26  SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESE 202
           +FS+TG+LEGQ+FRK   L+S SEQ LVDCS           LMDNAFRYIKD G I++E
Sbjct: 148 AFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTE 207

Query: 203 GDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 382
             YPY   D +C  N + I    TGF DI   +E  +  AVAT+GPVSVAIDA H SFQL
Sbjct: 208 KSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQL 267

Query: 383 YKSGIYNEESCSTTQLDHGVLAVGYGT-QIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQ 559
           Y  G+YNE  C    LDHGVL VGYGT + G  YW+VKNSW  TWGE GYIKM++++ NQ
Sbjct: 268 YSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARNQNNQ 327

Query: 560 CGIATMASYPLV 595
           CGIAT +SYP V
Sbjct: 328 CGIATASSYPTV 339
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]
          Length = 217

 Score =  221 bits (563), Expect = 1e-57
 Identities = 113/192 (58%), Positives = 131/192 (68%), Gaps = 2/192 (1%)
 Frame = +2

Query: 26  SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESE 202
           +FSATG+LEGQ FRK  +L+S SEQ LVD S           LMDNAF+YIK+ G ++SE
Sbjct: 27  AFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQGNQGCNGGLMDNAFQYIKENGGLDSE 86

Query: 203 GDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 382
             YPY ATD +C   P     K TGF DI  Q E  L  AVATVGP+SVAIDAGH+SFQ 
Sbjct: 87  ESYPYEATDTSCNYKPEYSAAKDTGFVDIP-QREKALMKAVATVGPISVAIDAGHSSFQF 145

Query: 383 YKSGIYNEESCSTTQLDHGVLAVGYGTQ-IGKKYWIVKNSWDVTWGESGYIKMSKDKKNQ 559
           YKSGIY +  CS+  LDHGVL VGYG +    K+WIVKNSW   WG  GY+KM+KD+ N 
Sbjct: 146 YKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKFWIVKNSWGPEWGNKGYVKMAKDQNNH 205

Query: 560 CGIATMASYPLV 595
           CGIAT ASYP V
Sbjct: 206 CGIATAASYPTV 217
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  220 bits (560), Expect = 3e-57
 Identities = 104/191 (54%), Positives = 137/191 (71%), Gaps = 1/191 (0%)
 Frame = +2

Query: 26  SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIK-DQGIESE 202
           +FS TGSLEGQ+F K   LIS +EQQLVDCS            M++AF YIK + GI++E
Sbjct: 133 AFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTE 192

Query: 203 GDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 382
             YPY A DG+C+ + + +   C+G T+I S +ET L  AV  +GP+SV IDA H+SFQ 
Sbjct: 193 AAYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQF 252

Query: 383 YKSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQC 562
           Y SG+Y E SCS + LDH VLAVGYG++ G+ +W+VKNSW  +WG++GYIKMS+++ N C
Sbjct: 253 YSSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNC 312

Query: 563 GIATMASYPLV 595
           GIAT+ASYPLV
Sbjct: 313 GIATVASYPLV 323
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  218 bits (556), Expect = 9e-57
 Identities = 112/195 (57%), Positives = 131/195 (67%), Gaps = 5/195 (2%)
 Frame = +2

Query: 26  SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESE 202
           +FSATG+LEGQ FRK  RLIS SEQ LVDCS           LMD AF+Y++D G ++SE
Sbjct: 140 AFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSE 199

Query: 203 GDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 382
             YPY AT+ +CK NP   V   TGF DI  Q E  L  AVATVGP+SVAIDAGH SF  
Sbjct: 200 ESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHESFLF 258

Query: 383 YKSGIYNEESCSTTQLDHGVLAVGYGTQI----GKKYWIVKNSWDVTWGESGYIKMSKDK 550
           YK GIY E  CS+  +DHGVL VGYG +       KYW+VKNSW   WG  GY+KM+KD+
Sbjct: 259 YKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDR 318

Query: 551 KNQCGIATMASYPLV 595
           +N CGIA+ ASYP V
Sbjct: 319 RNHCGIASAASYPTV 333
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  216 bits (551), Expect = 3e-56
 Identities = 114/196 (58%), Positives = 130/196 (66%), Gaps = 6/196 (3%)
 Frame = +2

Query: 26  SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESE 202
           +FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMDNAF+YIKD G ++SE
Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSE 199

Query: 203 GDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQ 379
             YPY ATD  +C   P       TGF DI  Q E  L  AVATVGP+SVAIDAGH SFQ
Sbjct: 200 ESYPYLATDTNSCNYKPECSAANDTGFVDIP-QREKALMKAVATVGPISVAIDAGHTSFQ 258

Query: 380 LYKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKKYWIVKNSWDVTWGESGYIKMSKD 547
            YKSGIY +  CS   LDHGVL VGYG +       K+WIVKNSW   WG +GY+KM+KD
Sbjct: 259 FYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKD 318

Query: 548 KKNQCGIATMASYPLV 595
           + N CGIAT ASYP V
Sbjct: 319 QNNHCGIATAASYPTV 334
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  215 bits (548), Expect = 7e-56
 Identities = 108/195 (55%), Positives = 127/195 (65%), Gaps = 5/195 (2%)
 Frame = +2

Query: 26  SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESE 202
           +FSATG+LEGQ FRK  +L+S SEQ LVDCS            M  AF+Y+K+ G ++SE
Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSE 199

Query: 203 GDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 382
             YPY A D  CK  P   V   TGFT +    E  L  AVATVGP+SVA+DAGH+SFQ 
Sbjct: 200 ESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQF 259

Query: 383 YKSGIYNEESCSTTQLDHGVLAVGYG----TQIGKKYWIVKNSWDVTWGESGYIKMSKDK 550
           YKSGIY E  CS+  LDHGVL VGYG         KYW+VKNSW   WG +GY+K++KDK
Sbjct: 260 YKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDK 319

Query: 551 KNQCGIATMASYPLV 595
            N CGIAT ASYP V
Sbjct: 320 NNHCGIATAASYPNV 334
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  215 bits (548), Expect = 7e-56
 Identities = 112/195 (57%), Positives = 129/195 (66%), Gaps = 5/195 (2%)
 Frame = +2

Query: 26  SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESE 202
           +FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMDNAFRY+KD G ++SE
Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSE 199

Query: 203 GDYPYTATDG-TCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQ 379
             YPY   D  TC   P       TGF D+  Q E  L  AVAT+GP+SVAIDAGH SFQ
Sbjct: 200 ESYPYLGRDTETCNYKPECSAANDTGFVDLP-QREKALMKAVATLGPISVAIDAGHQSFQ 258

Query: 380 LYKSGIYNEESCSTTQLDHGVLAVGY---GTQIGKKYWIVKNSWDVTWGESGYIKMSKDK 550
            YKSGIY +  CS+  LDHGVL VGY   GT    K+WIVKNSW   WG +GY+KM+KD+
Sbjct: 259 FYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVKMAKDQ 318

Query: 551 KNQCGIATMASYPLV 595
            N CGIAT ASYP V
Sbjct: 319 NNHCGIATAASYPTV 333
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
          Length = 321

 Score =  214 bits (545), Expect = 2e-55
 Identities = 106/191 (55%), Positives = 134/191 (70%), Gaps = 1/191 (0%)
 Frame = +2

Query: 26  SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESE 202
           +FSATG+LEGQ+F KN+ L+S SEQQLVDCS            M +AF YIKD G I++E
Sbjct: 132 AFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTE 191

Query: 203 GDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 382
             YPY A D +C+ + + I   CTG  ++Q   E  L  AV+ VGP+SVAIDA H SFQ 
Sbjct: 192 SSYPYEAEDRSCRFDANSIGAICTGSVEVQHTEEA-LQEAVSGVGPISVAIDASHFSFQF 250

Query: 383 YKSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQC 562
           Y SG+Y E++CS T LDHGVLAVGYGT+  K YW+VKNSW  +WG++GYIKMS+++ N C
Sbjct: 251 YSSGVYYEQNCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNC 310

Query: 563 GIATMASYPLV 595
           GIA+  SYP V
Sbjct: 311 GIASEPSYPTV 321
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  209 bits (532), Expect = 5e-54
 Identities = 108/196 (55%), Positives = 130/196 (66%), Gaps = 6/196 (3%)
 Frame = +2

Query: 26  SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESE 202
           +FSATG+LEGQ FRK  +L+S SEQ LVDCS           LMDNAF+Y+KD G +++E
Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTE 199

Query: 203 GDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQ 379
             YPY   +  +C   P       TGF DI  Q E  L  AVATVGP+SVAIDAGH+SFQ
Sbjct: 200 ESYPYLGRETNSCTYKPECSAANDTGFVDIP-QREKALMKAVATVGPISVAIDAGHSSFQ 258

Query: 380 LYKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKKYWIVKNSWDVTWGESGYIKMSKD 547
            YKSGIY +  CS+  LDHGVL VGYG +       K+WIVKNSW   WG +GY+KM+KD
Sbjct: 259 FYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKD 318

Query: 548 KKNQCGIATMASYPLV 595
           + N CGI+T ASYP V
Sbjct: 319 QNNHCGISTAASYPTV 334
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 72,490,993
Number of Sequences: 369166
Number of extensions: 1380432
Number of successful extensions: 4353
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3855
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3941
length of database: 68,354,980
effective HSP length: 106
effective length of database: 48,773,070
effective search space used: 5560129980
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)