Planarian EST Database


Dr_sW_018_F15

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_018_F15
         (592 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   201   1e-51
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   201   1e-51
sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   198   7e-51
sp|Q10991|CATL_SHEEP  Cathepsin L [Contains: Cathepsin L hea...   197   2e-50
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   194   1e-49
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   192   4e-49
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   191   9e-49
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   191   9e-49
sp|P25784|CYSP3_HOMAM  Digestive cysteine proteinase 3 precu...   189   3e-48
sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   188   8e-48
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  201 bits (511), Expect = 1e-51
 Identities = 100/174 (57%), Positives = 116/174 (66%), Gaps = 2/174 (1%)
 Frame = +2

Query: 8   LISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNPSK 184
           L+S SEQ LVDCS           LMDNAFRYIKD G I++E  YPY   D +C  N + 
Sbjct: 166 LVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKAT 225

Query: 185 IVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDH 364
           I    TGF DI   +E  +  AVAT+GPVSVAIDA H SFQLY  G+YNE  C    LDH
Sbjct: 226 IGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDH 285

Query: 365 GVLAVGYGT-QIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 523
           GVL VGYGT + G  YW+VKNSW  TWGE GYIKM++++ NQCGIAT +SYP V
Sbjct: 286 GVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARNQNNQCGIATASSYPTV 339
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  201 bits (511), Expect = 1e-51
 Identities = 99/174 (56%), Positives = 118/174 (67%), Gaps = 2/174 (1%)
 Frame = +2

Query: 8   LISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNPSK 184
           L+S SEQ LVDCS           LMDNAFRYIKD G I++E  YPY A D +C  N   
Sbjct: 168 LVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGT 227

Query: 185 IVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDH 364
           +     GFTDI   +E  +A AVATVGPVSVAIDA H SFQ Y  G+YNE  C    LDH
Sbjct: 228 VGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDH 287

Query: 365 GVLAVGYGT-QIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 523
           GVL VG+GT + G+ YW+VKNSW  TWG+ G+IKM ++K+NQCGIA+ +SYPLV
Sbjct: 288 GVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  198 bits (504), Expect = 7e-51
 Identities = 93/173 (53%), Positives = 124/173 (71%), Gaps = 1/173 (0%)
 Frame = +2

Query: 8   LISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIK-DQGIESEGDYPYTATDGTCKRNPSK 184
           LIS +EQQLVDCS            M++AF YIK + GI++E  YPY A DG+C+ + + 
Sbjct: 151 LISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGSCRFDSNS 210

Query: 185 IVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDH 364
           +   C+G T+I S +ET L  AV  +GP+SV IDA H+SFQ Y SG+Y E SCS + LDH
Sbjct: 211 VAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDH 270

Query: 365 GVLAVGYGTQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 523
            VLAVGYG++ G+ +W+VKNSW  +WG++GYIKMS+++ N CGIAT+ASYPLV
Sbjct: 271 AVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]
          Length = 217

 Score =  197 bits (501), Expect = 2e-50
 Identities = 101/175 (57%), Positives = 117/175 (66%), Gaps = 2/175 (1%)
 Frame = +2

Query: 5   RLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNPS 181
           +L+S SEQ LVD S           LMDNAF+YIK+ G ++SE  YPY ATD +C   P 
Sbjct: 44  KLVSLSEQNLVDSSRPQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPE 103

Query: 182 KIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLD 361
               K TGF DI  Q E  L  AVATVGP+SVAIDAGH+SFQ YKSGIY +  CS+  LD
Sbjct: 104 YSAAKDTGFVDIP-QREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLD 162

Query: 362 HGVLAVGYGTQ-IGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 523
           HGVL VGYG +    K+WIVKNSW   WG  GY+KM+KD+ N CGIAT ASYP V
Sbjct: 163 HGVLVVGYGFEGTNNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  194 bits (494), Expect = 1e-49
 Identities = 100/178 (56%), Positives = 117/178 (65%), Gaps = 5/178 (2%)
 Frame = +2

Query: 5   RLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNPS 181
           RLIS SEQ LVDCS           LMD AF+Y++D G ++SE  YPY AT+ +CK NP 
Sbjct: 157 RLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPK 216

Query: 182 KIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLD 361
             V   TGF DI  Q E  L  AVATVGP+SVAIDAGH SF  YK GIY E  CS+  +D
Sbjct: 217 YSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD 275

Query: 362 HGVLAVGYGTQI----GKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 523
           HGVL VGYG +       KYW+VKNSW   WG  GY+KM+KD++N CGIA+ ASYP V
Sbjct: 276 HGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  192 bits (489), Expect = 4e-49
 Identities = 102/179 (56%), Positives = 116/179 (64%), Gaps = 6/179 (3%)
 Frame = +2

Query: 5   RLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEGDYPYTATD-GTCKRNP 178
           +L+S SEQ LVDCS           LMDNAF+YIKD G ++SE  YPY ATD  +C   P
Sbjct: 157 KLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKP 216

Query: 179 SKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQL 358
                  TGF DI  Q E  L  AVATVGP+SVAIDAGH SFQ YKSGIY +  CS   L
Sbjct: 217 ECSAANDTGFVDIP-QREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDL 275

Query: 359 DHGVLAVGYGTQ----IGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 523
           DHGVL VGYG +       K+WIVKNSW   WG +GY+KM+KD+ N CGIAT ASYP V
Sbjct: 276 DHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  191 bits (486), Expect = 9e-49
 Identities = 96/178 (53%), Positives = 113/178 (63%), Gaps = 5/178 (2%)
 Frame = +2

Query: 5   RLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNPS 181
           +L+S SEQ LVDCS            M  AF+Y+K+ G ++SE  YPY A D  CK  P 
Sbjct: 157 KLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPE 216

Query: 182 KIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLD 361
             V   TGFT +    E  L  AVATVGP+SVA+DAGH+SFQ YKSGIY E  CS+  LD
Sbjct: 217 NSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLD 276

Query: 362 HGVLAVGYG----TQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 523
           HGVL VGYG         KYW+VKNSW   WG +GY+K++KDK N CGIAT ASYP V
Sbjct: 277 HGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  191 bits (486), Expect = 9e-49
 Identities = 100/178 (56%), Positives = 115/178 (64%), Gaps = 5/178 (2%)
 Frame = +2

Query: 5   RLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEGDYPYTATDG-TCKRNP 178
           +L+S SEQ LVDCS           LMDNAFRY+KD G ++SE  YPY   D  TC   P
Sbjct: 157 KLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKP 216

Query: 179 SKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQL 358
                  TGF D+  Q E  L  AVAT+GP+SVAIDAGH SFQ YKSGIY +  CS+  L
Sbjct: 217 ECSAANDTGFVDLP-QREKALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDL 275

Query: 359 DHGVLAVGY---GTQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 523
           DHGVL VGY   GT    K+WIVKNSW   WG +GY+KM+KD+ N CGIAT ASYP V
Sbjct: 276 DHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
          Length = 321

 Score =  189 bits (481), Expect = 3e-48
 Identities = 94/175 (53%), Positives = 119/175 (68%), Gaps = 1/175 (0%)
 Frame = +2

Query: 2   NRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNP 178
           + L+S SEQQLVDCS            M +AF YIKD G I++E  YPY A D +C+ + 
Sbjct: 148 DELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDA 207

Query: 179 SKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQL 358
           + I   CTG  ++Q   E  L  AV+ VGP+SVAIDA H SFQ Y SG+Y E++CS T L
Sbjct: 208 NSIGAICTGSVEVQHTEEA-LQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTFL 266

Query: 359 DHGVLAVGYGTQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 523
           DHGVLAVGYGT+  K YW+VKNSW  +WG++GYIKMS+++ N CGIA+  SYP V
Sbjct: 267 DHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  188 bits (478), Expect = 8e-48
 Identities = 100/178 (56%), Positives = 116/178 (65%), Gaps = 5/178 (2%)
 Frame = +2

Query: 5   RLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNPS 181
           +LIS SEQ LVDCS           LMD AF+YIK+ G ++SE  YPY A DG+CK    
Sbjct: 157 KLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAE 216

Query: 182 KIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLD 361
             V   TGF DI  Q E  L  AVATVGP+SVA+DA H S Q Y SGIY E +CS+  LD
Sbjct: 217 YAVANDTGFVDIPQQ-EKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLD 275

Query: 362 HGVLAVGY---GTQIGK-KYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 523
           HGVL VGY   GT   K KYW+VKNSW   WG  GYIK++KD+ N CG+AT ASYP+V
Sbjct: 276 HGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 64,080,522
Number of Sequences: 369166
Number of extensions: 1259473
Number of successful extensions: 3907
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3439
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3516
length of database: 68,354,980
effective HSP length: 105
effective length of database: 48,957,805
effective search space used: 4455160255
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)