Planarian EST Database


Dr_sW_028_J02

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_028_J02
         (589 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q9UKR3|KLK13_HUMAN  Kallikrein 13 precursor (Kallikrein-l...   101   1e-21
sp|Q91VE3|KLK7_MOUSE  Kallikrein-7 precursor (Stratum corneu...    96   5e-20
sp|P69525|TMPS9_MOUSE  Transmembrane protease, serine 9 (Pol...    95   1e-19
sp|P69526|TMPS9_RAT  Transmembrane protease, serine 9 (Polys...    95   1e-19
sp|Q61955|NRPN_MOUSE  Neuropsin precursor (NP) (Kallikrein 8)      94   3e-19
sp|O88780|NRPN_RAT  Neuropsin precursor (NP) (Kallikrein-8) ...    93   6e-19
sp|P98073|ENTK_HUMAN  Enteropeptidase precursor (Enterokinas...    92   7e-19
sp|P26262|KLKB1_MOUSE  Plasma kallikrein precursor (Plasma p...    92   7e-19
sp|Q7Z410|TMPS9_HUMAN  Transmembrane protease, serine 9 (Pol...    91   2e-18
sp|P97435|ENTK_MOUSE  Enteropeptidase (Enterokinase) [Contai...    90   4e-18
>sp|Q9UKR3|KLK13_HUMAN Kallikrein 13 precursor (Kallikrein-like protein 4) (KLK-L4)
          Length = 277

 Score =  101 bits (252), Expect = 1e-21
 Identities = 61/166 (36%), Positives = 83/166 (50%), Gaps = 1/166 (0%)
 Frame = +2

Query: 20  QVTKTIIHPSYRHEYAYLE-GNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGDRCEV 196
           +V  +I HP YR    +L   +D+ L++    +      YI    L  N    PG  C V
Sbjct: 102 EVVHSIPHPEYRRSPTHLNHDHDIMLLELQSPVQLT--GYIQTLPLSHNNRLTPGTTCRV 159

Query: 197 VGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDS 376
            GWG+   P  +  + LQ A I + S+ EC +++       + +CAG+      SC GDS
Sbjct: 160 SGWGTTTSPQVNYPKTLQCANIQLRSDEECRQVYPG-KITDNMLCAGTKEGGKDSCEGDS 218

Query: 377 GGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIR 514
           GGPL C     +    GIVS+G  PCGQP+ P VYTRVS    WIR
Sbjct: 219 GGPLVC-----NRTLYGIVSWGDFPCGQPDRPGVYTRVSRYVLWIR 259
>sp|Q91VE3|KLK7_MOUSE Kallikrein-7 precursor (Stratum corneum chymotryptic enzyme)
           (Thymopsin)
          Length = 249

 Score = 96.3 bits (238), Expect = 5e-20
 Identities = 54/164 (32%), Positives = 87/164 (53%)
 Frame = +2

Query: 20  QVTKTIIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGDRCEVV 199
           + TK+  HP Y  +      ND+ L++ D  +  +    ++  +L  +    PG  C V 
Sbjct: 90  KATKSFRHPGYSTK---THVNDIMLVRLDEPVKMSSK--VEAVQLPEHC-EPPGTSCTVS 143

Query: 200 GWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSG 379
           GWG+   P       L  + + +IS+ EC +++ ++   ++ +CAG     T++C GDSG
Sbjct: 144 GWGTTTSPDVTFPSDLMCSDVKLISSRECKKVYKDLLG-KTMLCAGIPDSKTNTCNGDSG 202

Query: 380 GPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 511
           GPL C D  +     G+VS+GT PCGQPN P VYT+V +   W+
Sbjct: 203 GPLVCNDTLQ-----GLVSWGTYPCGQPNDPGVYTQVCKYKRWV 241
>sp|P69525|TMPS9_MOUSE Transmembrane protease, serine 9 (Polyserase-1) (Polyserine
            protease-1) (Polyserase-I) [Contains: Serase-1; Serase-2;
            Serase-3]
          Length = 1065

 Score = 94.7 bits (234), Expect = 1e-19
 Identities = 58/161 (36%), Positives = 85/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 50   YRHEY--AYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGD--RCEVVGWGSML 217
            YRH +   Y    DVAL++  G + +++        + +   ++P D  RC + GWGS L
Sbjct: 909  YRHPFYNIYTLDYDVALLELAGPVRRSRL----VRPICLPGPARPPDGARCVITGWGS-L 963

Query: 218  PPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGPLFCT 397
              G   +R LQKAA+ ++S   C R +  V      +CAG  +    SC GD+GGPL C 
Sbjct: 964  REGGSMARQLQKAAVRVLSEQTCRRFYP-VQISSRMLCAGFPQGGVDSCSGDAGGPLACR 1022

Query: 398  DGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIRAN 520
            +     +  G+ S+G   CG+P+ P VYTRV+ V GWI  N
Sbjct: 1023 EPSGQWVLTGVTSWG-YGCGRPHFPGVYTRVAAVLGWIGQN 1062

 Score = 78.6 bits (192), Expect = 1e-14
 Identities = 41/111 (36%), Positives = 57/111 (51%)
 Frame = +2

Query: 179 GDRCEVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTS 358
           G +C + GWG+M          LQKA++ II    CG ++ N +     +CAG       
Sbjct: 625 GRKCMISGWGNMQEGNATKPDILQKASVGIIEQKMCGALY-NFSLTDRMLCAGFLEGRVD 683

Query: 359 SCRGDSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 511
           SC+GDSGGPL C +        GIVS+G + C Q   P VY R++ +  WI
Sbjct: 684 SCQGDSGGPLACEETPGVFYLAGIVSWG-IGCAQAKKPGVYARITRLKDWI 733

 Score = 76.3 bits (186), Expect = 5e-14
 Identities = 40/112 (35%), Positives = 57/112 (50%)
 Frame = +2

Query: 176 PGDRCEVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLST 355
           PG +C + GWG +          LQKA + ++  S C  ++ +    R  +CAG      
Sbjct: 324 PGKKCLISGWGYLKEDFLVKPEVLQKATVELLDQSLCSSLYGHSLTDRM-VCAGYLDGKV 382

Query: 356 SSCRGDSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 511
            SC+GDSGGPL C +        GIVS+G + C +   P VYTRV+ +  WI
Sbjct: 383 DSCQGDSGGPLVCEEPSGRFFLAGIVSWG-IGCAEARRPGVYTRVTRLRDWI 433
>sp|P69526|TMPS9_RAT Transmembrane protease, serine 9 (Polyserase-1) (Polyserine
            protease-1) (Polyserase-I) [Contains: Serase-1; Serase-2;
            Serase-3]
          Length = 1061

 Score = 94.7 bits (234), Expect = 1e-19
 Identities = 58/161 (36%), Positives = 85/161 (52%), Gaps = 4/161 (2%)
 Frame = +2

Query: 50   YRHEY--AYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKP--GDRCEVVGWGSML 217
            YRH +   Y    DVAL++  G + +++        + +   ++P  G RC + GWGS L
Sbjct: 905  YRHPFYNIYTLDYDVALLELAGPVRRSRL----VRPICLPGPTRPPEGARCVITGWGS-L 959

Query: 218  PPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGPLFCT 397
              G   +R LQKAA+ ++S   C R +  V      +CAG  +    SC GD+GGPL C 
Sbjct: 960  REGGSMARQLQKAAVRVLSEQTCRRFYP-VQISSRMLCAGFPQGGVDSCSGDAGGPLACR 1018

Query: 398  DGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIRAN 520
            +     +  G+ S+G   CG+P+ P VYTRV+ V GWI  N
Sbjct: 1019 EPSGQWVLTGVTSWG-YGCGRPHFPGVYTRVAAVLGWIGQN 1058

 Score = 78.6 bits (192), Expect = 1e-14
 Identities = 41/111 (36%), Positives = 57/111 (51%)
 Frame = +2

Query: 179 GDRCEVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTS 358
           G +C + GWG+M          LQKA++ II    CG ++ N +     +CAG       
Sbjct: 625 GRKCMISGWGNMQEGNATKPDILQKASVGIIEQKMCGALY-NFSLTDRMLCAGFLEGRVD 683

Query: 359 SCRGDSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 511
           SC+GDSGGPL C +        GIVS+G + C Q   P VY R++ +  WI
Sbjct: 684 SCQGDSGGPLACEETPGVFYLAGIVSWG-IGCAQAKKPGVYARITRLKDWI 733

 Score = 71.6 bits (174), Expect = 1e-12
 Identities = 37/112 (33%), Positives = 56/112 (50%)
 Frame = +2

Query: 176 PGDRCEVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLST 355
           P  +C + GWG +          LQKA + ++  + C  ++ +    R  +CAG      
Sbjct: 324 PRKKCLISGWGYLKEDFLVKPEVLQKATVELLDQNLCSSLYGHSLTDRM-VCAGYLDGKV 382

Query: 356 SSCRGDSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 511
            SC+GDSGGPL C +        G+VS+G + C +   P VYTRV+ +  WI
Sbjct: 383 DSCQGDSGGPLVCEEPSGRFFLAGVVSWG-IGCAEARRPGVYTRVTRLRDWI 433
>sp|Q61955|NRPN_MOUSE Neuropsin precursor (NP) (Kallikrein 8)
          Length = 260

 Score = 94.0 bits (232), Expect = 3e-19
 Identities = 56/165 (33%), Positives = 83/165 (50%)
 Frame = +2

Query: 20  QVTKTIIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGDRCEVV 199
           QV ++I HP Y +       +D+ LI+     S N  D +   +L  N   K G +C + 
Sbjct: 99  QVAQSIQHPCYNNSNPEDHSHDIMLIRLQN--SANLGDKVKPVQL-ANLCPKVGQKCIIS 155

Query: 200 GWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSG 379
           GWG++  P ++    L  A + I S ++C R +         +CAGS      +C+GDSG
Sbjct: 156 GWGTVTSPQENFPNTLNCAEVKIYSQNKCERAYPG-KITEGMVCAGSSN-GADTCQGDSG 213

Query: 380 GPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIR 514
           GPL C DG       GI S+G+ PCG+P  P VYT++     WI+
Sbjct: 214 GPLVC-DGMLQ----GITSWGSDPCGKPEKPGVYTKICRYTTWIK 253
>sp|O88780|NRPN_RAT Neuropsin precursor (NP) (Kallikrein-8) (Brain serine protease 1)
          Length = 260

 Score = 92.8 bits (229), Expect = 6e-19
 Identities = 53/165 (32%), Positives = 81/165 (49%)
 Frame = +2

Query: 20  QVTKTIIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGDRCEVV 199
           QV ++I HP +         +D+ LI+     S N  D +   EL  N   K G +C + 
Sbjct: 99  QVARSIQHPCFNSSNPEDHSHDIMLIRLQN--SANLGDKVKPIEL-ANLCPKVGQKCIIS 155

Query: 200 GWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSG 379
           GWG++  P ++    L  A + I S ++C R +         +CAGS      +C+GDSG
Sbjct: 156 GWGTVTSPQENFPNTLNCAEVKIYSQNKCERAYPG-KITEGMVCAGSSN-GADTCQGDSG 213

Query: 380 GPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIR 514
           GPL C       +  GI ++G+ PCG+P  P VYT++     WI+
Sbjct: 214 GPLVCNG-----VLQGITTWGSDPCGKPEKPGVYTKICRYTNWIK 253
>sp|P98073|ENTK_HUMAN Enteropeptidase precursor (Enterokinase) [Contains: Enteropeptidase
            non-catalytic heavy chain; Enteropeptidase catalytic
            light chain]
          Length = 1019

 Score = 92.4 bits (228), Expect = 7e-19
 Identities = 54/166 (32%), Positives = 85/166 (51%), Gaps = 1/166 (0%)
 Frame = +2

Query: 23   VTKTIIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNEL-DVNAISKPGDRCEVV 199
            + + +I+P Y       + ND+A++  +  +  N  DYI    L + N +  PG  C + 
Sbjct: 859  IDEIVINPHYNRRR---KDNDIAMMHLEFKV--NYTDYIQPICLPEENQVFPPGRNCSIA 913

Query: 200  GWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSG 379
            GWG+++  G  T+  LQ+A + ++SN  C +         + ICAG       SC+GDSG
Sbjct: 914  GWGTVVYQGT-TANILQEADVPLLSNERCQQQMPEYNITENMICAGYEEGGIDSCQGDSG 972

Query: 380  GPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIRA 517
            GPL C +  +  +  G+ S+G   C  PN P VY RVS    WI++
Sbjct: 973  GPLMCQENNRWFL-AGVTSFG-YKCALPNRPGVYARVSRFTEWIQS 1016
>sp|P26262|KLKB1_MOUSE Plasma kallikrein precursor (Plasma prekallikrein) (Kininogenin)
           (Fletcher factor) [Contains: Plasma kallikrein heavy
           chain; Plasma kallikrein light chain]
          Length = 638

 Score = 92.4 bits (228), Expect = 7e-19
 Identities = 61/167 (36%), Positives = 86/167 (51%), Gaps = 3/167 (1%)
 Frame = +2

Query: 20  QVTKTIIHPSYRHEYAYLEGN-DVALIKFDGYISKNQYD--YIDYNELDVNAISKPGDRC 190
           ++ + IIH     EY   EGN D+ALIK    ++  ++       ++ D N I      C
Sbjct: 465 RIKELIIH----QEYKVSEGNYDIALIKLQTPLNYTEFQKPICLPSKADTNTIYT---NC 517

Query: 191 EVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRG 370
            V GWG     G+ T   LQKA I ++ N EC + + +    +  ICAG     T +C+G
Sbjct: 518 WVTGWGYTKEQGE-TQNILQKATIPLVPNEECQKKYRDYVINKQMICAGYKEGGTDACKG 576

Query: 371 DSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 511
           DSGGPL C    +  + VGI S+G   CG+ + P VYT+VSE   WI
Sbjct: 577 DSGGPLVCKHSGRWQL-VGITSWGE-GCGRKDQPGVYTKVSEYMDWI 621
>sp|Q7Z410|TMPS9_HUMAN Transmembrane protease, serine 9 (Polyserase-1) (Polyserase-I)
            (Polyserine protease-1) [Contains: Serase-1; Serase-2;
            Serase-3]
          Length = 1059

 Score = 90.9 bits (224), Expect = 2e-18
 Identities = 59/172 (34%), Positives = 84/172 (48%), Gaps = 8/172 (4%)
 Frame = +2

Query: 20   QVTKTIIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKP------- 178
            +V +   HP Y     Y    DVAL++  G + +++          V  I  P       
Sbjct: 897  RVARIYKHPFYN---LYTLDYDVALLELAGPVRRSRL---------VRPICLPEPAPRPP 944

Query: 179  -GDRCEVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLST 355
             G RC + GWGS+   G   +R LQKAA+ ++S   C R +  V      +CAG  +   
Sbjct: 945  DGTRCVITGWGSVREGGS-MARQLQKAAVRLLSEQTCRRFYP-VQISSRMLCAGFPQGGV 1002

Query: 356  SSCRGDSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 511
             SC GD+GGPL C +     +  G+ S+G   CG+P+ P VYTRV+ V GWI
Sbjct: 1003 DSCSGDAGGPLACREPSGRWVLTGVTSWG-YGCGRPHFPGVYTRVAAVRGWI 1053

 Score = 80.9 bits (198), Expect = 2e-15
 Identities = 50/161 (31%), Positives = 76/161 (47%)
 Frame = +2

Query: 29   KTIIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGDRCEVVGWG 208
            + ++HP Y          D+A+++    ++ N+Y       L +      G +C + GWG
Sbjct: 577  RVVLHPLYNPGILDF---DLAVLELASPLAFNKYIQPVCLPLAIQKFPV-GRKCMISGWG 632

Query: 209  SMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGPL 388
            +           LQKA++ II    C  ++ N +     ICAG       SC+GDSGGPL
Sbjct: 633  NTQEGNATKPELLQKASVGIIDQKTCSVLY-NFSLTDRMICAGFLEGKVDSCQGDSGGPL 691

Query: 389  FCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 511
             C +        GIVS+G + C Q   P VYTR++ + GWI
Sbjct: 692  ACEEAPGVFYLAGIVSWG-IGCAQVKKPGVYTRITRLKGWI 731

 Score = 71.2 bits (173), Expect = 2e-12
 Identities = 38/115 (33%), Positives = 56/115 (48%)
 Frame = +2

Query: 167 ISKPGDRCEVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGR 346
           I  P  +C + GWG +          LQKA + ++  + C  ++ +    R  +CAG   
Sbjct: 319 IFPPSKKCLISGWGYLKEDFLVKPEVLQKATVELLDQALCASLYGHSLTDRM-VCAGYLD 377

Query: 347 LSTSSCRGDSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 511
               SC+GDSGGPL C +        GIVS+G + C +   P VY RV+ +  WI
Sbjct: 378 GKVDSCQGDSGGPLVCEEPSGRFFLAGIVSWG-IGCAEARRPGVYARVTRLRDWI 431
>sp|P97435|ENTK_MOUSE Enteropeptidase (Enterokinase) [Contains: Enteropeptidase
            non-catalytic heavy chain; Enteropeptidase catalytic
            light chain]
          Length = 1069

 Score = 90.1 bits (222), Expect = 4e-18
 Identities = 58/166 (34%), Positives = 82/166 (49%), Gaps = 1/166 (0%)
 Frame = +2

Query: 23   VTKTIIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNEL-DVNAISKPGDRCEVV 199
            V + +I+P Y         ND+A++  +  +  N  DYI    L + N I  PG  C + 
Sbjct: 908  VDQIVINPHYDRRRKV---NDIAMMHLEFKV--NYTDYIQPICLPEENQIFIPGRTCSIA 962

Query: 200  GWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSG 379
            GWG        T   L++A + +ISN +C +         S ICAG       SC+GDSG
Sbjct: 963  GWGYDKINAGSTVDVLKEADVPLISNEKCQQQLPEYNITESMICAGYEEGGIDSCQGDSG 1022

Query: 380  GPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIRA 517
            GPL C +  +  + VG+ S+G V C  PN P VY RVS+   WI +
Sbjct: 1023 GPLMCQENNRWFL-VGVTSFG-VQCALPNHPGVYVRVSQFIEWIHS 1066
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 67,603,285
Number of Sequences: 369166
Number of extensions: 1424460
Number of successful extensions: 4964
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4247
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4358
length of database: 68,354,980
effective HSP length: 105
effective length of database: 48,957,805
effective search space used: 4406202450
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)