Planarian EST Database


Dr_sW_028_G17

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_028_G17
         (335 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q9R013|CATF_MOUSE  Cathepsin F precursor                       120   1e-27
sp|Q9VN93|CPR1_DROME  Putative cysteine proteinase CG12163 p...   119   3e-27
sp|Q9UBX1|CATF_HUMAN  Cathepsin F precursor (CATSF)               117   8e-27
sp|Q26534|CATL_SCHMA  Cathepsin L precursor (SMCL1)               115   3e-26
sp|Q10716|CYSP1_MAIZE  Cysteine proteinase 1 precursor             91   8e-19
sp|P04988|CYSP1_DICDI  Cysteine proteinase 1 precursor             90   1e-18
sp|P43296|RD19A_ARATH  Cysteine proteinase RD19a precursor (...    84   1e-16
sp|P25784|CYSP3_HOMAM  Digestive cysteine proteinase 3 precu...    83   2e-16
sp|P25804|CYSP_PEA  Cysteine proteinase 15A precursor (Turgo...    81   9e-16
sp|P43295|A494_ARATH  Probable cysteine proteinase A494 prec...    79   3e-15
>sp|Q9R013|CATF_MOUSE Cathepsin F precursor
          Length = 462

 Score =  120 bits (300), Expect = 1e-27
 Identities = 55/109 (50%), Positives = 71/109 (65%)
 Frame = +1

Query: 4   GGLPSQAYKVIQQMGGLETESDYPYKADRKTCMLDKSKIAVYINGSESIDSSETTMAAWC 183
           GGLPS AY  I+ +GGLETE DY Y+   +TC        VYIN S  +  +E  +AAW 
Sbjct: 313 GGLPSNAYAAIKNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWL 372

Query: 184 SINGPISIGINAFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTTSS 330
           +  GPIS+ INAF MQFYR GI+HPF+  C+P  +DH VL+VG+   S+
Sbjct: 373 AQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNRSN 421
>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 precursor
          Length = 614

 Score =  119 bits (297), Expect = 3e-27
 Identities = 56/109 (51%), Positives = 74/109 (67%), Gaps = 1/109 (0%)
 Frame = +1

Query: 1   NGGLPSQAYKVIQQMGGLETESDYPYKADRKTCMLDKSKIAVYINGSESIDS-SETTMAA 177
           NGGL   AYK I+ +GGLE E++YPYKA +  C  +++   V + G   +   +ET M  
Sbjct: 457 NGGLMDNAYKAIKDIGGLEYEAEYPYKAKKNQCHFNRTLSHVQVAGFVDLPKGNETAMQE 516

Query: 178 WCSINGPISIGINAFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTT 324
           W   NGPISIGINA AMQFYRGG+SHP+K  C+  +LDHGVL+VG+  +
Sbjct: 517 WLLANGPISIGINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVS 565
>sp|Q9UBX1|CATF_HUMAN Cathepsin F precursor (CATSF)
          Length = 484

 Score =  117 bits (293), Expect = 8e-27
 Identities = 54/108 (50%), Positives = 69/108 (63%)
 Frame = +1

Query: 4   GGLPSQAYKVIQQMGGLETESDYPYKADRKTCMLDKSKIAVYINGSESIDSSETTMAAWC 183
           GGLPS AY  I+ +GGLETE DY Y+   ++C     K  VYIN S  +  +E  +AAW 
Sbjct: 335 GGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWL 394

Query: 184 SINGPISIGINAFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTTS 327
           +  GPIS+ INAF MQFYR GIS P +  C+P  +DH VL+VG+   S
Sbjct: 395 AKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRS 442
>sp|Q26534|CATL_SCHMA Cathepsin L precursor (SMCL1)
          Length = 319

 Score =  115 bits (288), Expect = 3e-26
 Identities = 55/108 (50%), Positives = 72/108 (66%)
 Frame = +1

Query: 1   NGGLPSQAYKVIQQMGGLETESDYPYKADRKTCMLDKSKIAVYINGSESIDSSETTMAAW 180
           NGGLPS AY+ I +MGGL  E +YPY A  + C L    +AVYIN S ++   ET +AAW
Sbjct: 168 NGGLPSNAYESIIKMGGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVNLTQDETELAAW 227

Query: 181 CSINGPISIGINAFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTT 324
              N  IS+G+NA  +QFY+ GISHP+ IFC+   LDH VL+VG+  +
Sbjct: 228 LYHNSTISVGMNALLLQFYQHGISHPWWIFCSKYLLDHAVLLVGYGVS 275
>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 precursor
          Length = 371

 Score = 90.9 bits (224), Expect = 8e-19
 Identities = 47/108 (43%), Positives = 64/108 (59%)
 Frame = +1

Query: 1   NGGLPSQAYKVIQQMGGLETESDYPYKADRKTCMLDKSKIAVYINGSESIDSSETTMAAW 180
           NGGL + A+  +Q+ GGLE+E DYPY      C  DKSKI   +     +   E  ++A 
Sbjct: 209 NGGLMTTAFSYLQKAGGLESEKDYPYTGSDGKCKFDKSKIVASVQNFSVVSVDEAQISAN 268

Query: 181 CSINGPISIGINAFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTT 324
              +GP++IGINA  MQ Y GG+S P+   C   HLDHGVL+VG+  +
Sbjct: 269 LIKHGPLAIGINAAYMQTYIGGVSCPY--ICG-RHLDHGVLLVGYGAS 313
>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 precursor
          Length = 343

 Score = 90.1 bits (222), Expect = 1e-18
 Identities = 47/111 (42%), Positives = 66/111 (59%), Gaps = 1/111 (0%)
 Frame = +1

Query: 1   NGGLPSQAYKVIQQMGGLETESDYPYKADRKT-CMLDKSKIAVYINGSESIDSSETTMAA 177
           NGGL   AY  I + GG++TES YPY A+  T C  + + I   I+    I  +ET MA 
Sbjct: 191 NGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAG 250

Query: 178 WCSINGPISIGINAFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTTSS 330
           +    GP++I  +A   QFY GG+   F I CNP+ LDHG+LIVG++  ++
Sbjct: 251 YIVSTGPLAIAADAVEWQFYIGGV---FDIPCNPNSLDHGILIVGYSAKNT 298
>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a precursor (RD19)
          Length = 368

 Score = 83.6 bits (205), Expect = 1e-16
 Identities = 48/106 (45%), Positives = 63/106 (59%), Gaps = 1/106 (0%)
 Frame = +1

Query: 1   NGGLPSQAYKVIQQMGGLETESDYPYKA-DRKTCMLDKSKIAVYINGSESIDSSETTMAA 177
           NGGL + A++   + GGL  E DYPY   D KTC LDKSKI   ++    I   E  +AA
Sbjct: 207 NGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIAA 266

Query: 178 WCSINGPISIGINAFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGF 315
               NGP+++ INA  MQ Y GG+S P+   C    L+HGVL+VG+
Sbjct: 267 NLVKNGPLAVAINAGYMQTYIGGVSCPY--ICT-RRLNHGVLLVGY 309
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
          Length = 321

 Score = 83.2 bits (204), Expect = 2e-16
 Identities = 43/111 (38%), Positives = 65/111 (58%), Gaps = 2/111 (1%)
 Frame = +1

Query: 4   GGLPSQAYKVIQQMGGLETESDYPYKADRKTCMLDKSKIAVYINGSESIDSSETTMAAWC 183
           GG  + A+  I+  GG++TES YPY+A+ ++C  D + I     GS  +  +E  +    
Sbjct: 172 GGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTGSVEVQHTEEALQEAV 231

Query: 184 SINGPISIGINA--FAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTTSS 330
           S  GPIS+ I+A  F+ QFY  G+   ++  C+P  LDHGVL VG+ T S+
Sbjct: 232 SGVGPISVAIDASHFSFQFYSSGVY--YEQNCSPTFLDHGVLAVGYGTEST 280
>sp|P25804|CYSP_PEA Cysteine proteinase 15A precursor (Turgor-responsive protein 15A)
          Length = 363

 Score = 80.9 bits (198), Expect = 9e-16
 Identities = 41/105 (39%), Positives = 59/105 (56%)
 Frame = +1

Query: 1   NGGLPSQAYKVIQQMGGLETESDYPYKADRKTCMLDKSKIAVYINGSESIDSSETTMAAW 180
           NGGL + A++ + + GG+  E DY Y     +C  DKSK+   ++    +   E  +AA 
Sbjct: 204 NGGLMNNAFEYLLESGGVVQEKDYAYTGRDGSCKFDKSKVVASVSNFSVVTLDEDQIAAN 263

Query: 181 CSINGPISIGINAFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGF 315
              NGP+++ INA  MQ Y  G+S P+   C    LDHGVL+VGF
Sbjct: 264 LVKNGPLAVAINAAWMQTYMSGVSCPY--VCAKSRLDHGVLLVGF 306
>sp|P43295|A494_ARATH Probable cysteine proteinase A494 precursor
          Length = 361

 Score = 79.3 bits (194), Expect = 3e-15
 Identities = 44/108 (40%), Positives = 65/108 (60%), Gaps = 1/108 (0%)
 Frame = +1

Query: 1   NGGLPSQAYKVIQQMGGLETESDYPYKA-DRKTCMLDKSKIAVYINGSESIDSSETTMAA 177
           NGGL + A++   + GGL  E DYPY   D  +C LD+SKI   ++    +  +E  +AA
Sbjct: 204 NGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAA 263

Query: 178 WCSINGPISIGINAFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNT 321
               NGP+++ INA  MQ Y GG+S P+   C+   L+HGVL+VG+ +
Sbjct: 264 NLIKNGPLAVAINAAYMQTYIGGVSCPY--ICS-RRLNHGVLLVGYGS 308
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 37,431,925
Number of Sequences: 369166
Number of extensions: 663522
Number of successful extensions: 1588
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 1469
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1479
length of database: 68,354,980
effective HSP length: 79
effective length of database: 53,760,915
effective search space used: 1720349280
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)