Planarian EST Database


Dr_sW_021_D02

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_021_D02
         (689 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q10991|CATL_SHEEP  Cathepsin L [Contains: Cathepsin L hea...   256   4e-68
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   253   3e-67
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   251   1e-66
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   251   1e-66
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   250   2e-66
sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   249   4e-66
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   249   6e-66
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   247   2e-65
sp|P25784|CYSP3_HOMAM  Digestive cysteine proteinase 3 precu...   244   2e-64
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   241   1e-63
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]
          Length = 217

 Score =  256 bits (654), Expect = 4e-68
 Identities = 128/208 (61%), Positives = 146/208 (70%), Gaps = 2/208 (0%)
 Frame = +3

Query: 3   GYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLM 182
           GYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVD S           LM
Sbjct: 11  GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQGNQGCNGGLM 70

Query: 183 DNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATV 359
           DNAF+YIK+  G++SE  YPY ATD +C   P     K TGF DI  Q E  L  AVATV
Sbjct: 71  DNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDI-PQREKALMKAVATV 129

Query: 360 GPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ-IGKKYWIVKNSWDVT 536
           GP+SVAIDAGH+SFQ YKSGIY +  CS+  LDHGVL VGYG +    K+WIVKNSW   
Sbjct: 130 GPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKFWIVKNSWGPE 189

Query: 537 WGESGYIKMSKDKKNQCGIATMASYPLV 620
           WG  GY+KM+KD+ N CGIAT ASYP V
Sbjct: 190 WGNKGYVKMAKDQNNHCGIATAASYPTV 217
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  253 bits (647), Expect = 3e-67
 Identities = 127/211 (60%), Positives = 146/211 (69%), Gaps = 5/211 (2%)
 Frame = +3

Query: 3   GYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLM 182
           GYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  RLIS SEQ LVDCS           LM
Sbjct: 124 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLM 183

Query: 183 DNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATV 359
           D AF+Y++D  G++SE  YPY AT+ +CK NP   V   TGF DI  Q E  L  AVATV
Sbjct: 184 DYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EKALMKAVATV 242

Query: 360 GPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQI----GKKYWIVKNSW 527
           GP+SVAIDAGH SF  YK GIY E  CS+  +DHGVL VGYG +       KYW+VKNSW
Sbjct: 243 GPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSW 302

Query: 528 DVTWGESGYIKMSKDKKNQCGIATMASYPLV 620
              WG  GY+KM+KD++N CGIA+ ASYP V
Sbjct: 303 GEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  251 bits (642), Expect = 1e-66
 Identities = 123/211 (58%), Positives = 143/211 (67%), Gaps = 5/211 (2%)
 Frame = +3

Query: 3   GYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLM 182
           GYVTPVKNQ+QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS            M
Sbjct: 124 GYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFM 183

Query: 183 DNAFRYIKDQG-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATV 359
             AF+Y+K+ G ++SE  YPY A D  CK  P   V   TGFT +    E  L  AVATV
Sbjct: 184 ARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATV 243

Query: 360 GPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYG----TQIGKKYWIVKNSW 527
           GP+SVA+DAGH+SFQ YKSGIY E  CS+  LDHGVL VGYG         KYW+VKNSW
Sbjct: 244 GPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSW 303

Query: 528 DVTWGESGYIKMSKDKKNQCGIATMASYPLV 620
              WG +GY+K++KDK N CGIAT ASYP V
Sbjct: 304 GPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  251 bits (642), Expect = 1e-66
 Identities = 129/212 (60%), Positives = 145/212 (68%), Gaps = 6/212 (2%)
 Frame = +3

Query: 3   GYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLM 182
           GYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS           LM
Sbjct: 124 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLM 183

Query: 183 DNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVAT 356
           DNAF+YIKD  G++SE  YPY ATD  +C   P       TGF DI  Q E  L  AVAT
Sbjct: 184 DNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKALMKAVAT 242

Query: 357 VGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKKYWIVKNS 524
           VGP+SVAIDAGH SFQ YKSGIY +  CS   LDHGVL VGYG +       K+WIVKNS
Sbjct: 243 VGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNS 302

Query: 525 WDVTWGESGYIKMSKDKKNQCGIATMASYPLV 620
           W   WG +GY+KM+KD+ N CGIAT ASYP V
Sbjct: 303 WGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  250 bits (639), Expect = 2e-66
 Identities = 127/211 (60%), Positives = 144/211 (68%), Gaps = 5/211 (2%)
 Frame = +3

Query: 3   GYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLM 182
           GYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS           LM
Sbjct: 124 GYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLM 183

Query: 183 DNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVAT 356
           DNAFRY+KD  G++SE  YPY   D  TC   P       TGF D+  Q E  L  AVAT
Sbjct: 184 DNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKALMKAVAT 242

Query: 357 VGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGY---GTQIGKKYWIVKNSW 527
           +GP+SVAIDAGH SFQ YKSGIY +  CS+  LDHGVL VGY   GT    K+WIVKNSW
Sbjct: 243 LGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSW 302

Query: 528 DVTWGESGYIKMSKDKKNQCGIATMASYPLV 620
              WG +GY+KM+KD+ N CGIAT ASYP V
Sbjct: 303 GPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  249 bits (637), Expect = 4e-66
 Identities = 117/207 (56%), Positives = 151/207 (72%), Gaps = 1/207 (0%)
 Frame = +3

Query: 3   GYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLM 182
           G VTPVK+Q QCGSCW+FS TGSLEGQ+F K   LIS +EQQLVDCS            M
Sbjct: 117 GAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWM 176

Query: 183 DNAFRYIK-DQGIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATV 359
           ++AF YIK + GI++E  YPY A DG+C+ + + +   C+G T+I S +ET L  AV  +
Sbjct: 177 NDAFDYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDI 236

Query: 360 GPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKNSWDVTW 539
           GP+SV IDA H+SFQ Y SG+Y E SCS + LDH VLAVGYG++ G+ +W+VKNSW  +W
Sbjct: 237 GPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSW 296

Query: 540 GESGYIKMSKDKKNQCGIATMASYPLV 620
           G++GYIKMS+++ N CGIAT+ASYPLV
Sbjct: 297 GDAGYIKMSRNRNNNCGIATVASYPLV 323
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  249 bits (635), Expect = 6e-66
 Identities = 121/208 (58%), Positives = 146/208 (70%), Gaps = 2/208 (0%)
 Frame = +3

Query: 3   GYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLM 182
           G VT VK+Q  CGSCW+FS+TG+LEGQ+FRK+  L+S SEQ LVDCS           LM
Sbjct: 134 GAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLM 193

Query: 183 DNAFRYIKDQG-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATV 359
           DNAFRYIKD G I++E  YPY A D +C  N   +     GFTDI   +E  +A AVATV
Sbjct: 194 DNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATV 253

Query: 360 GPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGT-QIGKKYWIVKNSWDVT 536
           GPVSVAIDA H SFQ Y  G+YNE  C    LDHGVL VG+GT + G+ YW+VKNSW  T
Sbjct: 254 GPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTT 313

Query: 537 WGESGYIKMSKDKKNQCGIATMASYPLV 620
           WG+ G+IKM ++K+NQCGIA+ +SYPLV
Sbjct: 314 WGDKGFIKMLRNKENQCGIASASSYPLV 341
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  247 bits (631), Expect = 2e-65
 Identities = 122/208 (58%), Positives = 143/208 (68%), Gaps = 2/208 (0%)
 Frame = +3

Query: 3   GYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLM 182
           G VT VK+Q  CGSCW+FS+TG+LEGQ+FRK   L+S SEQ LVDCS           LM
Sbjct: 132 GAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLM 191

Query: 183 DNAFRYIKDQG-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATV 359
           DNAFRYIKD G I++E  YPY   D +C  N + I    TGF DI   +E  +  AVAT+
Sbjct: 192 DNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATM 251

Query: 360 GPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGT-QIGKKYWIVKNSWDVT 536
           GPVSVAIDA H SFQLY  G+YNE  C    LDHGVL VGYGT + G  YW+VKNSW  T
Sbjct: 252 GPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTT 311

Query: 537 WGESGYIKMSKDKKNQCGIATMASYPLV 620
           WGE GYIKM++++ NQCGIAT +SYP V
Sbjct: 312 WGEQGYIKMARNQNNQCGIATASSYPTV 339
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
          Length = 321

 Score =  244 bits (622), Expect = 2e-64
 Identities = 118/205 (57%), Positives = 148/205 (72%), Gaps = 1/205 (0%)
 Frame = +3

Query: 9   VTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDN 188
           VTPVK+Q+QCGSCW+FSATG+LEGQ+F KN+ L+S SEQQLVDCS            M +
Sbjct: 118 VTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTS 177

Query: 189 AFRYIKDQG-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGP 365
           AF YIKD G I++E  YPY A D +C+ + + I   CTG  ++Q   E  L  AV+ VGP
Sbjct: 178 AFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTGSVEVQHTEEA-LQEAVSGVGP 236

Query: 366 VSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKNSWDVTWGE 545
           +SVAIDA H SFQ Y SG+Y E++CS T LDHGVLAVGYGT+  K YW+VKNSW  +WG+
Sbjct: 237 ISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGD 296

Query: 546 SGYIKMSKDKKNQCGIATMASYPLV 620
           +GYIKMS+++ N CGIA+  SYP V
Sbjct: 297 AGYIKMSRNRDNNCGIASEPSYPTV 321
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  241 bits (615), Expect = 1e-63
 Identities = 122/212 (57%), Positives = 144/212 (67%), Gaps = 6/212 (2%)
 Frame = +3

Query: 3   GYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLM 182
           GYVT VKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS           LM
Sbjct: 124 GYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLM 183

Query: 183 DNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVAT 356
           DNAF+Y+KD  G+++E  YPY   +  +C   P       TGF DI  Q E  L  AVAT
Sbjct: 184 DNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVAT 242

Query: 357 VGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKKYWIVKNS 524
           VGP+SVAIDAGH+SFQ YKSGIY +  CS+  LDHGVL VGYG +       K+WIVKNS
Sbjct: 243 VGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNS 302

Query: 525 WDVTWGESGYIKMSKDKKNQCGIATMASYPLV 620
           W   WG +GY+KM+KD+ N CGI+T ASYP V
Sbjct: 303 WGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 74,802,883
Number of Sequences: 369166
Number of extensions: 1424699
Number of successful extensions: 4583
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4017
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4137
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 5927776870
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)