Planarian EST Database


Dr_sW_003_H18

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_003_H18
         (473 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P69525|TMPS9_MOUSE  Transmembrane protease, serine 9 (Pol...    86   6e-17
sp|P69526|TMPS9_RAT  Transmembrane protease, serine 9 (Polys...    86   6e-17
sp|P98073|ENTK_HUMAN  Enteropeptidase precursor (Enterokinas...    85   7e-17
sp|Q91VE3|KLK7_MOUSE  Kallikrein-7 precursor (Stratum corneu...    85   7e-17
sp|P98072|ENTK_BOVIN  Enteropeptidase precursor (Enterokinas...    84   2e-16
sp|Q9UKR3|KLK13_HUMAN  Kallikrein 13 precursor (Kallikrein-l...    83   4e-16
sp|P98074|ENTK_PIG  Enteropeptidase precursor (Enterokinase)...    81   1e-15
sp|Q7Z410|TMPS9_HUMAN  Transmembrane protease, serine 9 (Pol...    81   1e-15
sp|P26262|KLKB1_MOUSE  Plasma kallikrein precursor (Plasma p...    80   2e-15
sp|P97435|ENTK_MOUSE  Enteropeptidase (Enterokinase) [Contai...    80   2e-15
>sp|P69525|TMPS9_MOUSE Transmembrane protease, serine 9 (Polyserase-1) (Polyserine
            protease-1) (Polyserase-I) [Contains: Serase-1; Serase-2;
            Serase-3]
          Length = 1065

 Score = 85.5 bits (210), Expect = 6e-17
 Identities = 48/111 (43%), Positives = 66/111 (59%)
 Frame = +1

Query: 49   GDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLSTN 228
            G RC I GWGS L  G +++  LQKA + +LS + C + Y   I  R  LCAG      +
Sbjct: 952  GARCVITGWGS-LREGGSMARQLQKAAVRVLSEQTCRRFYPVQISSR-MLCAGFPQGGVD 1009

Query: 229  SCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWI 381
            SC+GD+GGPL  ++   Q +  G+ S+G   CG P+ P V+TRVA V+ WI
Sbjct: 1010 SCSGDAGGPLACREPSGQWVLTGVTSWG-YGCGRPHFPGVYTRVAAVLGWI 1059

 Score = 68.2 bits (165), Expect = 9e-12
 Identities = 42/113 (37%), Positives = 57/113 (50%)
 Frame = +1

Query: 46  PGDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLST 225
           PG +C I GWG           +LQKAT+ +L    C+ LY   + +R  +CAG      
Sbjct: 324 PGKKCLISGWGYLKEDFLVKPEVLQKATVELLDQSLCSSLYGHSLTDR-MVCAGYLDGKV 382

Query: 226 NSCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWIL 384
           +SC GDSGGPL  ++   +    GI S+G   C     P V+TRV  +  WIL
Sbjct: 383 DSCQGDSGGPLVCEEPSGRFFLAGIVSWGI-GCAEARRPGVYTRVTRLRDWIL 434

 Score = 65.9 bits (159), Expect = 5e-11
 Identities = 40/112 (35%), Positives = 56/112 (50%)
 Frame = +1

Query: 49  GDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLSTN 228
           G +C I GWG+          ILQKA++ I+  + C  LY+  + +R  LCAG      +
Sbjct: 625 GRKCMISGWGNMQEGNATKPDILQKASVGIIEQKMCGALYNFSLTDR-MLCAGFLEGRVD 683

Query: 229 SCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWIL 384
           SC GDSGGPL  ++        GI S+G   C     P V+ R+  +  WIL
Sbjct: 684 SCQGDSGGPLACEETPGVFYLAGIVSWGI-GCAQAKKPGVYARITRLKDWIL 734
>sp|P69526|TMPS9_RAT Transmembrane protease, serine 9 (Polyserase-1) (Polyserine
            protease-1) (Polyserase-I) [Contains: Serase-1; Serase-2;
            Serase-3]
          Length = 1061

 Score = 85.5 bits (210), Expect = 6e-17
 Identities = 48/111 (43%), Positives = 66/111 (59%)
 Frame = +1

Query: 49   GDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLSTN 228
            G RC I GWGS L  G +++  LQKA + +LS + C + Y   I  R  LCAG      +
Sbjct: 948  GARCVITGWGS-LREGGSMARQLQKAAVRVLSEQTCRRFYPVQISSR-MLCAGFPQGGVD 1005

Query: 229  SCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWI 381
            SC+GD+GGPL  ++   Q +  G+ S+G   CG P+ P V+TRVA V+ WI
Sbjct: 1006 SCSGDAGGPLACREPSGQWVLTGVTSWG-YGCGRPHFPGVYTRVAAVLGWI 1055

 Score = 65.9 bits (159), Expect = 5e-11
 Identities = 40/112 (35%), Positives = 56/112 (50%)
 Frame = +1

Query: 49  GDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLSTN 228
           G +C I GWG+          ILQKA++ I+  + C  LY+  + +R  LCAG      +
Sbjct: 625 GRKCMISGWGNMQEGNATKPDILQKASVGIIEQKMCGALYNFSLTDR-MLCAGFLEGRVD 683

Query: 229 SCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWIL 384
           SC GDSGGPL  ++        GI S+G   C     P V+ R+  +  WIL
Sbjct: 684 SCQGDSGGPLACEETPGVFYLAGIVSWGI-GCAQAKKPGVYARITRLKDWIL 734

 Score = 65.1 bits (157), Expect = 8e-11
 Identities = 40/113 (35%), Positives = 56/113 (49%)
 Frame = +1

Query: 46  PGDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLST 225
           P  +C I GWG           +LQKAT+ +L    C+ LY   + +R  +CAG      
Sbjct: 324 PRKKCLISGWGYLKEDFLVKPEVLQKATVELLDQNLCSSLYGHSLTDR-MVCAGYLDGKV 382

Query: 226 NSCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWIL 384
           +SC GDSGGPL  ++   +    G+ S+G   C     P V+TRV  +  WIL
Sbjct: 383 DSCQGDSGGPLVCEEPSGRFFLAGVVSWGI-GCAEARRPGVYTRVTRLRDWIL 434
>sp|P98073|ENTK_HUMAN Enteropeptidase precursor (Enterokinase) [Contains: Enteropeptidase
            non-catalytic heavy chain; Enteropeptidase catalytic
            light chain]
          Length = 1019

 Score = 85.1 bits (209), Expect = 7e-17
 Identities = 45/112 (40%), Positives = 63/112 (56%)
 Frame = +1

Query: 46   PGDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLST 225
            PG  C I GWG+ +Y G   ++ILQ+A + +LSN  C +      I  N +CAG      
Sbjct: 906  PGRNCSIAGWGTVVYQGTT-ANILQEADVPLLSNERCQQQMPEYNITENMICAGYEEGGI 964

Query: 226  NSCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWI 381
            +SC GDSGGPL  ++  N+    G+ S+G + C LPN P V+ RV+    WI
Sbjct: 965  DSCQGDSGGPLMCQEN-NRWFLAGVTSFGYK-CALPNRPGVYARVSRFTEWI 1014
>sp|Q91VE3|KLK7_MOUSE Kallikrein-7 precursor (Stratum corneum chymotryptic enzyme)
           (Thymopsin)
          Length = 249

 Score = 85.1 bits (209), Expect = 7e-17
 Identities = 43/113 (38%), Positives = 62/113 (54%)
 Frame = +1

Query: 46  PGDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLST 225
           PG  C + GWG+           L  + + ++S+REC K+Y   ++ +  LCAG     T
Sbjct: 136 PGTSCTVSGWGTTTSPDVTFPSDLMCSDVKLISSRECKKVYKD-LLGKTMLCAGIPDSKT 194

Query: 226 NSCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWIL 384
           N+C GDSGGPL   D L      G+ S+G+ PCG PN P V+T+V    RW++
Sbjct: 195 NTCNGDSGGPLVCNDTLQ-----GLVSWGTYPCGQPNDPGVYTQVCKYKRWVM 242
>sp|P98072|ENTK_BOVIN Enteropeptidase precursor (Enterokinase) [Contains: Enteropeptidase
            non-catalytic heavy chain; Enteropeptidase catalytic
            light chain]
          Length = 1035

 Score = 83.6 bits (205), Expect = 2e-16
 Identities = 44/112 (39%), Positives = 63/112 (56%)
 Frame = +1

Query: 46   PGDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLST 225
            PG  C I GWG+ +Y G   + +LQ+A + +LSN +C +      I  N +CAG      
Sbjct: 922  PGRICSIAGWGALIYQGST-ADVLQEADVPLLSNEKCQQQMPEYNITENMVCAGYEAGGV 980

Query: 226  NSCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWI 381
            +SC GDSGGPL  ++  N+ +  G+ S+G + C LPN P V+ RV     WI
Sbjct: 981  DSCQGDSGGPLMCQEN-NRWLLAGVTSFGYQ-CALPNRPGVYARVPRFTEWI 1030
>sp|Q9UKR3|KLK13_HUMAN Kallikrein 13 precursor (Kallikrein-like protein 4) (KLK-L4)
          Length = 277

 Score = 82.8 bits (203), Expect = 4e-16
 Identities = 46/112 (41%), Positives = 60/112 (53%)
 Frame = +1

Query: 46  PGDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLST 225
           PG  C + GWG+      N    LQ A + + S+ EC ++Y  G I  N LCAG+     
Sbjct: 153 PGTTCRVSGWGTTTSPQVNYPKTLQCANIQLRSDEECRQVYP-GKITDNMLCAGTKEGGK 211

Query: 226 NSCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWI 381
           +SC GDSGGPL     L      GI S+G  PCG P+ P V+TRV+  + WI
Sbjct: 212 DSCEGDSGGPLVCNRTL-----YGIVSWGDFPCGQPDRPGVYTRVSRYVLWI 258
>sp|P98074|ENTK_PIG Enteropeptidase precursor (Enterokinase) [Contains: Enteropeptidase
            non-catalytic mini chain; Enteropeptidase non-catalytic
            heavy chain; Enteropeptidase catalytic light chain]
          Length = 1034

 Score = 81.3 bits (199), Expect = 1e-15
 Identities = 45/112 (40%), Positives = 61/112 (54%)
 Frame = +1

Query: 46   PGDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLST 225
            PG  C I GWG  +Y G   + ILQ+A + +LSN +C +      I  N +CAG      
Sbjct: 921  PGRICSIAGWGKVIYQGSP-ADILQEADVPLLSNEKCQQQMPEYNITENMMCAGYEEGGI 979

Query: 226  NSCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWI 381
            +SC GDSGGPL   +  N+ +  G+ S+G + C LPN P V+ RV     WI
Sbjct: 980  DSCQGDSGGPLMCLEN-NRWLLAGVTSFGYQ-CALPNRPGVYARVPKFTEWI 1029
>sp|Q7Z410|TMPS9_HUMAN Transmembrane protease, serine 9 (Polyserase-1) (Polyserase-I)
            (Polyserine protease-1) [Contains: Serase-1; Serase-2;
            Serase-3]
          Length = 1059

 Score = 80.9 bits (198), Expect = 1e-15
 Identities = 46/111 (41%), Positives = 65/111 (58%)
 Frame = +1

Query: 49   GDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLSTN 228
            G RC I GWGS +  G +++  LQKA + +LS + C + Y   I  R  LCAG      +
Sbjct: 946  GTRCVITGWGS-VREGGSMARQLQKAAVRLLSEQTCRRFYPVQISSR-MLCAGFPQGGVD 1003

Query: 229  SCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWI 381
            SC+GD+GGPL  ++   + +  G+ S+G   CG P+ P V+TRVA V  WI
Sbjct: 1004 SCSGDAGGPLACREPSGRWVLTGVTSWG-YGCGRPHFPGVYTRVAAVRGWI 1053

 Score = 67.8 bits (164), Expect = 1e-11
 Identities = 39/112 (34%), Positives = 58/112 (51%)
 Frame = +1

Query: 49  GDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLSTN 228
           G +C I GWG+          +LQKA++ I+  + C+ LY+  + +R  +CAG      +
Sbjct: 623 GRKCMISGWGNTQEGNATKPELLQKASVGIIDQKTCSVLYNFSLTDR-MICAGFLEGKVD 681

Query: 229 SCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWIL 384
           SC GDSGGPL  ++        GI S+G   C     P V+TR+  +  WIL
Sbjct: 682 SCQGDSGGPLACEEAPGVFYLAGIVSWGI-GCAQVKKPGVYTRITRLKGWIL 732

 Score = 65.1 bits (157), Expect = 8e-11
 Identities = 41/113 (36%), Positives = 55/113 (48%)
 Frame = +1

Query: 46  PGDRCEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLST 225
           P  +C I GWG           +LQKAT+ +L    CA LY   + +R  +CAG      
Sbjct: 322 PSKKCLISGWGYLKEDFLVKPEVLQKATVELLDQALCASLYGHSLTDR-MVCAGYLDGKV 380

Query: 226 NSCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWIL 384
           +SC GDSGGPL  ++   +    GI S+G   C     P V+ RV  +  WIL
Sbjct: 381 DSCQGDSGGPLVCEEPSGRFFLAGIVSWGI-GCAEARRPGVYARVTRLRDWIL 432
>sp|P26262|KLKB1_MOUSE Plasma kallikrein precursor (Plasma prekallikrein) (Kininogenin)
           (Fletcher factor) [Contains: Plasma kallikrein heavy
           chain; Plasma kallikrein light chain]
          Length = 638

 Score = 80.5 bits (197), Expect = 2e-15
 Identities = 43/109 (39%), Positives = 62/109 (56%)
 Frame = +1

Query: 58  CEIVGWGSKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPLSTNSCA 237
           C + GWG     G+   +ILQKAT+ ++ N EC K Y   +I +  +CAG     T++C 
Sbjct: 517 CWVTGWGYTKEQGET-QNILQKATIPLVPNEECQKKYRDYVINKQMICAGYKEGGTDACK 575

Query: 238 GDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWIL 384
           GDSGGPL  K      + +GI S+G   CG  + P V+T+V+  + WIL
Sbjct: 576 GDSGGPLVCKHSGRWQL-VGITSWG-EGCGRKDQPGVYTKVSEYMDWIL 622
>sp|P97435|ENTK_MOUSE Enteropeptidase (Enterokinase) [Contains: Enteropeptidase
            non-catalytic heavy chain; Enteropeptidase catalytic
            light chain]
          Length = 1069

 Score = 80.5 bits (197), Expect = 2e-15
 Identities = 42/114 (36%), Positives = 66/114 (57%), Gaps = 1/114 (0%)
 Frame = +1

Query: 43   VPGDRCEIVGWG-SKLYSGKNISHILQKATLIILSNRECAKLYDSGIIERNKLCAGSGPL 219
            +PG  C I GWG  K+ +G  +  +L++A + ++SN +C +      I  + +CAG    
Sbjct: 954  IPGRTCSIAGWGYDKINAGSTVD-VLKEADVPLISNEKCQQQLPEYNITESMICAGYEEG 1012

Query: 220  STNSCAGDSGGPLFFKDGLNQNIQIGINSYGSRPCGLPNVPSVFTRVAHVIRWI 381
              +SC GDSGGPL  ++  N+   +G+ S+G + C LPN P V+ RV+  I WI
Sbjct: 1013 GIDSCQGDSGGPLMCQEN-NRWFLVGVTSFGVQ-CALPNHPGVYVRVSQFIEWI 1064
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 60,374,481
Number of Sequences: 369166
Number of extensions: 1325234
Number of successful extensions: 3712
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3129
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3208
length of database: 68,354,980
effective HSP length: 101
effective length of database: 49,696,745
effective search space used: 2783017720
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)