Planarian EST Database


Dr_sW_022_I14

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_022_I14
         (769 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P55112|NAS4_CAEEL  Zinc metalloproteinase nas-4 precursor...   104   3e-22
sp|P31579|LCE_ORYLA  Low choriolytic enzyme precursor (Hatch...    89   2e-17
sp|P31581|HCE21_ORYLA  High choriolytic enzyme 2 precursor (...    88   3e-17
sp|P31580|HCE23_ORYLA  High choriolytic enzyme 1 precursor (...    87   4e-17
sp|P07584|ASTA_ASTFL  Astacin precursor (Crayfish small-mole...    86   8e-17
sp|Q18439|NAS8_CAEEL  Zinc metalloproteinase nas-8 precursor...    86   1e-16
sp|P55113|NAS7_CAEEL  Zinc metalloproteinase nas-7 precursor...    84   5e-16
sp|Q9XTD6|NAS12_CAEEL  Zinc metalloproteinase nas-12 precurs...    83   7e-16
sp|P55115|NAS15_CAEEL  Zinc metalloproteinase nas-15 precurs...    83   7e-16
sp|Q20191|NAS13_CAEEL  Zinc metalloproteinase nas-13 precurs...    82   1e-15
>sp|P55112|NAS4_CAEEL Zinc metalloproteinase nas-4 precursor (Nematode astacin 4)
          Length = 315

 Score =  104 bits (259), Expect = 3e-22
 Identities = 58/169 (34%), Positives = 96/169 (56%), Gaps = 5/169 (2%)
 Frame = +3

Query: 177 KRWPNNQVLFSFAPAFPNDKKGIVRECLVELQKDLGNCVKFSESSASHYIEVQSL--DQG 350
           +RWPNN++ ++ +  + +  + ++   + E       CVKF     S + +   +  D+G
Sbjct: 102 RRWPNNEIPYTLSSQYGSYARSVIANAMNEYHTK--TCVKFVARDPSKHHDYLWIHPDEG 159

Query: 351 CYSLLGYTGGPKQPLNLQNPGCMYSKGTVKHEFIHALGFMHTHMRKDRDNHINVKWDRIL 530
           CYSL+G TGG KQP++L + GC+   GT+ HE +HA+GF H   R+DRD++I+V W  ++
Sbjct: 160 CYSLVGKTGG-KQPVSLDS-GCI-QVGTIVHELMHAVGFFHEQSRQDRDSYIDVVWQNVM 216

Query: 531 TSHCSQFVKCD---GCEVDGPYETHSVMHYPSYGFACVPGENVVFKRDG 668
                QF K +      +D PY+  S+MHY  Y F+    + +V K+ G
Sbjct: 217 NGADDQFEKYNLNVISHLDEPYDYASIMHYGPYAFSGSGKKTLVPKKSG 265
>sp|P31579|LCE_ORYLA Low choriolytic enzyme precursor (Hatching enzyme zinc-protease LCE
           subunit) (Choriolysin L)
          Length = 271

 Score = 88.6 bits (218), Expect = 2e-17
 Identities = 60/190 (31%), Positives = 96/190 (50%), Gaps = 11/190 (5%)
 Frame = +3

Query: 90  LNNMQM-----GDMVLDKSDIALINGFKAHYSNTKRWPNN-----QVLFSFAPAFPNDKK 239
           +NN  M     GD+VL K+  A+   F A   ++ RWP +     +V +  +  + +D+K
Sbjct: 52  MNNNSMEELLEGDLVLPKTRNAM-KCFGA--PDSCRWPKSSNGIVKVPYVVSDNYESDEK 108

Query: 240 GIVRECLVELQKDLGNCVKF-SESSASHYIEVQSLDQGCYSLLGYTGGPKQPLNLQNPGC 416
             +R  + E  +    C+ F   ++   Y+ ++    GC S++GY G  KQ + LQ  GC
Sbjct: 109 ETIRNAMKEFAEK--TCIHFVPRNNERAYLSLEPRF-GCKSMMGYVGD-KQVVVLQRFGC 164

Query: 417 MYSKGTVKHEFIHALGFMHTHMRKDRDNHINVKWDRILTSHCSQFVKCDGCEVDGPYETH 596
           +     ++HE +HALGF H H R DRD H+ + W+ I+      F K D   +  PY+  
Sbjct: 165 I-KHAVIQHELLHALGFYHEHTRSDRDQHVKINWENIIKDFTHNFDKNDTDNLGTPYDYG 223

Query: 597 SVMHYPSYGF 626
           S+MHY    F
Sbjct: 224 SIMHYGRTAF 233
>sp|P31581|HCE21_ORYLA High choriolytic enzyme 2 precursor (Hatching enzyme zinc-protease
           HCE 2 subunit) (Choriolysin H 2)
          Length = 279

 Score = 87.8 bits (216), Expect = 3e-17
 Identities = 47/123 (38%), Positives = 70/123 (56%), Gaps = 1/123 (0%)
 Frame = +3

Query: 288 CVKFSESSASH-YIEVQSLDQGCYSLLGYTGGPKQPLNLQNPGCMYSKGTVKHEFIHALG 464
           C++F   +  + +I V S + GCYS LG  GG +Q L+L   GCMYS G ++HE  HALG
Sbjct: 129 CIRFVRRTNEYDFISVVSKN-GCYSELGRKGG-QQELSLNRGGCMYS-GIIQHELNHALG 185

Query: 465 FMHTHMRKDRDNHINVKWDRILTSHCSQFVKCDGCEVDGPYETHSVMHYPSYGFACVPGE 644
           F H   R DRD+++ + W  I+ +    F K D   ++ PY+  S+MHY    F+   G 
Sbjct: 186 FQHEQTRSDRDSYVRINWQNIIPASAYNFNKHDTNNLNTPYDYSSIMHYGRDAFSIAYGR 245

Query: 645 NVV 653
           + +
Sbjct: 246 DSI 248
>sp|P31580|HCE23_ORYLA High choriolytic enzyme 1 precursor (Hatching enzyme zinc-protease
           HCE 1 subunit) (Choriolysin H 1)
          Length = 270

 Score = 87.4 bits (215), Expect = 4e-17
 Identities = 46/123 (37%), Positives = 70/123 (56%), Gaps = 1/123 (0%)
 Frame = +3

Query: 288 CVKFSESSASH-YIEVQSLDQGCYSLLGYTGGPKQPLNLQNPGCMYSKGTVKHEFIHALG 464
           C++F   +  + +I V S   GCYS LG  GG +Q L++   GCMYS G ++HE  HALG
Sbjct: 120 CIRFVRRTNEYDFISVVS-KTGCYSELGRKGG-QQELSINRGGCMYS-GIIQHELNHALG 176

Query: 465 FMHTHMRKDRDNHINVKWDRILTSHCSQFVKCDGCEVDGPYETHSVMHYPSYGFACVPGE 644
           F H   R DRD+++ + W+ I+ +    F K D   ++ PY+  S+MHY    F+   G 
Sbjct: 177 FQHEQTRSDRDSYVRINWENIIPASAYNFNKHDTNNLNTPYDYSSIMHYGRDAFSIAYGR 236

Query: 645 NVV 653
           + +
Sbjct: 237 DSI 239
>sp|P07584|ASTA_ASTFL Astacin precursor (Crayfish small-molecule proteinase)
          Length = 251

 Score = 86.3 bits (212), Expect = 8e-17
 Identities = 49/151 (32%), Positives = 83/151 (54%), Gaps = 2/151 (1%)
 Frame = +3

Query: 183 WPNNQVLFSFAPAFPNDKKGIVRECLVELQKDLGNCVKF-SESSASHYIEVQSLDQGCYS 359
           W    + ++FA     D+  I+   + EL++    C++F   ++ S Y+E+ +   GC+S
Sbjct: 59  WSGGVIPYTFAGVSGADQSAILSG-MQELEEK--TCIRFVPRTTESDYVEIFTSGSGCWS 115

Query: 360 LLGYTGGPKQPLNLQNPGCMYSKGTVKHEFIHALGFMHTHMRKDRDNHINVKWDRILTSH 539
            +G   G +Q ++LQ  GC+Y  GT+ HE +HA+GF H H R DRDN++ + +  +  S 
Sbjct: 116 YVGRISGAQQ-VSLQANGCVYH-GTIIHELMHAIGFYHEHTRMDRDNYVTINYQNVDPSM 173

Query: 540 CSQF-VKCDGCEVDGPYETHSVMHYPSYGFA 629
            S F +      V   Y+ +S+MHY  Y F+
Sbjct: 174 TSNFDIDTYSRYVGEDYQYYSIMHYGKYSFS 204
>sp|Q18439|NAS8_CAEEL Zinc metalloproteinase nas-8 precursor (Nematode astacin 8)
          Length = 403

 Score = 85.9 bits (211), Expect = 1e-16
 Identities = 63/190 (33%), Positives = 96/190 (50%), Gaps = 11/190 (5%)
 Frame = +3

Query: 168 SNTKRWPNNQVLFSFAPAFPNDKKGIVRECLVELQKDLGNCVKFSESSA--SHYIEVQSL 341
           + T++WPN ++ +  +  + ND++  V     +   D   CV+F   +A  + Y+ +  +
Sbjct: 116 TGTRKWPNGRIPYVISNQY-NDRERAVLARSFQAYHDK-TCVRFVPRTAVDNDYLYIGKI 173

Query: 342 DQGCYSLLGYTGGPKQPLNLQNPGCMYSKGTVKHEFIHALGFMHTHMRKDRDNHINVKWD 521
           D GCYS +G  GG +Q L+L N GC+    T  HE +H++GF H H R DRD HI + W 
Sbjct: 174 D-GCYSDVGRAGG-RQELSLDN-GCL-QYDTAIHELMHSVGFYHEHERWDRDEHITILWH 229

Query: 522 RILTSHCSQFVKCDGCE---VDGPYETHSVMHYPSYGFACVPGENVVFKRD------GGL 674
            I      QF K D  E       Y+ +S+MHY S  F+    E +V K+       G  
Sbjct: 230 NIDREAYDQFGKVDLAESSYYGQLYDYYSIMHYDSLAFSKNGFETMVAKQSEMTAVIGAA 289

Query: 675 IDYNACYIKK 704
           ID++   I K
Sbjct: 290 IDFSPIDILK 299
>sp|P55113|NAS7_CAEEL Zinc metalloproteinase nas-7 precursor (Nematode astacin 7)
          Length = 382

 Score = 83.6 bits (205), Expect = 5e-16
 Identities = 55/175 (31%), Positives = 91/175 (52%), Gaps = 7/175 (4%)
 Frame = +3

Query: 126 KSDIALINGFKAHYSN--TKRWPNNQVLFSFAPAFPNDKKGIVRECLVELQKDLGNCVKF 299
           KSDI L    K +  +   K WPN ++ ++ +P +   ++ ++ + + +  +    C++F
Sbjct: 68  KSDIRLPRRHKRNGVSRAAKLWPNARIPYAISPHYSPHERALLAKAVKQYHEK--TCIRF 125

Query: 300 --SESSASHYIEVQSLDQGCYSLLGYTGGPKQPLNLQNPGCMYSKGTVKHEFIHALGFMH 473
              ++    Y+ +  +D GC+S +G T G  Q L+L N GCM    T+ HE +H +GF H
Sbjct: 126 VPRQTGEPDYLFIGKVD-GCFSEVGRTSGV-QVLSLDN-GCM-EYATIIHEMMHVVGFYH 181

Query: 474 THMRKDRDNHINVKWDRILTSHCSQFVKCDGCEVD---GPYETHSVMHYPSYGFA 629
            H R DRDN I++ W  I      QF K D  +      PY+  S++HY S  F+
Sbjct: 182 EHERWDRDNFIDIIWQNIDRGALDQFGKVDLSKTSYYGQPYDYKSILHYDSLAFS 236
>sp|Q9XTD6|NAS12_CAEEL Zinc metalloproteinase nas-12 precursor (Nematode astacin 12)
          Length = 384

 Score = 83.2 bits (204), Expect = 7e-16
 Identities = 70/226 (30%), Positives = 105/226 (46%), Gaps = 16/226 (7%)
 Frame = +3

Query: 6   FCL--LLVIATINYVAGQCGCPRQKLL-KRGLNNMQMGDMVL----------DKSDIALI 146
           FCL  LL+   I+    Q     Q+L+ +    +   GDM+L           K     I
Sbjct: 11  FCLGYLLLFCKISNAVKQSWEINQELITEANKEHTVFGDMLLTPAQLIRYENSKDSDLSI 70

Query: 147 NGFKAHYSNTKRWPNNQVLFSFAPAFPNDKKGIVRECLVELQKDLGNCVKFSESSASH-Y 323
            G     S+  RW NN V +  +P +   +K I+   L   ++   +C KF E +  + Y
Sbjct: 71  RGVSIKGSSMNRWSNNIVPYVISPQYSPAQKQILVSSLRYFERV--SCFKFVERTTQNDY 128

Query: 324 IEVQSLDQGCYSLLGYTGGPKQPLNLQNPGCMYSKGTVKHEFIHALGFMHTHMRKDRDNH 503
           + +  LD GCYS +G  GG +Q L+L    C+ +   + HE +HA+GF H H R DRD+ 
Sbjct: 129 LFIVPLD-GCYSYVGKIGG-RQTLSLA-ADCI-ADYIIWHEMMHAIGFEHEHQRPDRDSF 184

Query: 504 INVKWDRILTSHCSQFVKCDGCEVDGP--YETHSVMHYPSYGFACV 635
           I V +  ++      F K     V+ P  Y+  S+MHY  Y F  V
Sbjct: 185 IRVDYANVIPGQMINFDKLKTSHVEYPDIYDFKSIMHYDGYAFGRV 230
>sp|P55115|NAS15_CAEEL Zinc metalloproteinase nas-15 precursor (Nematode astacin 15)
          Length = 571

 Score = 83.2 bits (204), Expect = 7e-16
 Identities = 50/180 (27%), Positives = 93/180 (51%), Gaps = 5/180 (2%)
 Frame = +3

Query: 153 FKAHYSNTKRWPNNQVLFSFAPAFPNDKKGIVRECLVELQKDLGNCVKF--SESSASHYI 326
           + A  +  + WP  ++ ++ +  + +  + ++   + E       C+++   E++  +Y+
Sbjct: 113 YNAIKNRLQLWPEGRIPYTISSQYSSYSRSLIAASMQEYASH--TCIRWVPKEAADVNYV 170

Query: 327 EVQSLDQGCYSLLGYTGGPKQPLNLQNPGCMYSKGTVKHEFIHALGFMHTHMRKDRDNHI 506
            +   D+GCYS++G  GG KQ L+L + GC+  KG + HE +HA+GF H   R DRD+HI
Sbjct: 171 HIYP-DRGCYSMVGKMGG-KQSLSLGS-GCI-QKGIILHELMHAVGFFHEQSRTDRDDHI 226

Query: 507 NVKWDRILTSHCSQFVKCDGCEVDG---PYETHSVMHYPSYGFACVPGENVVFKRDGGLI 677
            + W+ I      QF K     +      Y+  S+MHY +  F+      ++ K++G  I
Sbjct: 227 TIMWNNIQAGMQGQFEKYGHGTIQSLGTGYDYGSIMHYGTKAFSRNGQPTMIPKKNGATI 286
>sp|Q20191|NAS13_CAEEL Zinc metalloproteinase nas-13 precursor (Nematode astacin 13)
          Length = 527

 Score = 82.4 bits (202), Expect = 1e-15
 Identities = 51/155 (32%), Positives = 80/155 (51%), Gaps = 5/155 (3%)
 Frame = +3

Query: 180 RWPNNQVLFSFAPAFPNDKKGIVRECLVELQKDLGNCVKFSESSASH--YIEVQSLDQGC 353
           +W   ++ ++ +  + +  +  + E + E +K    C+ FS  SA    YI +   D GC
Sbjct: 195 KWEQARIPYTISSQYSSYSRSKIAEAIEEYRKK--TCIDFSPKSAGDLDYIHIVP-DDGC 251

Query: 354 YSLLGYTGGPKQPLNLQNPGCMYSKGTVKHEFIHALGFMHTHMRKDRDNHINVKWDRILT 533
           YSL+G  GG KQP++L + GC+  KG + HE +HA+GF H   R DRD ++ + W  +  
Sbjct: 252 YSLVGRIGG-KQPVSLGD-GCI-QKGIIIHELMHAVGFFHEQSRADRDEYVKINWSNVEA 308

Query: 534 SHCSQFVKCDGCEVD---GPYETHSVMHYPSYGFA 629
               QF K     +D     Y+  SVMHY    F+
Sbjct: 309 GLQDQFDKYSLNMIDHLGTKYDYGSVMHYAPTAFS 343
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 98,932,329
Number of Sequences: 369166
Number of extensions: 2178962
Number of successful extensions: 5020
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4776
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4962
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 7115329200
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)