Planarian EST Database


Dr_sW_001_K01

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_001_K01
         (553 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q9UKR3|KLK13_HUMAN  Kallikrein 13 precursor (Kallikrein-l...   100   4e-21
sp|Q91VE3|KLK7_MOUSE  Kallikrein-7 precursor (Stratum corneu...    96   6e-20
sp|P69525|TMPS9_MOUSE  Transmembrane protease, serine 9 (Pol...    95   1e-19
sp|P69526|TMPS9_RAT  Transmembrane protease, serine 9 (Polys...    95   1e-19
sp|P98073|ENTK_HUMAN  Enteropeptidase precursor (Enterokinas...    92   1e-18
sp|P26262|KLKB1_MOUSE  Plasma kallikrein precursor (Plasma p...    91   1e-18
sp|Q61955|NRPN_MOUSE  Neuropsin precursor (NP) (Kallikrein 8)      91   2e-18
sp|Q7Z410|TMPS9_HUMAN  Transmembrane protease, serine 9 (Pol...    90   3e-18
sp|O88780|NRPN_RAT  Neuropsin precursor (NP) (Kallikrein-8) ...    89   5e-18
sp|P97435|ENTK_MOUSE  Enteropeptidase (Enterokinase) [Contai...    89   7e-18
>sp|Q9UKR3|KLK13_HUMAN Kallikrein 13 precursor (Kallikrein-like protein 4) (KLK-L4)
          Length = 277

 Score = 99.8 bits (247), Expect = 4e-21
 Identities = 60/162 (37%), Positives = 81/162 (50%), Gaps = 1/162 (0%)
 Frame = +3

Query: 9   TIIHPSYRHEYAYLE-GNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGDRCEVVGWG 185
           +I HP YR    +L   +D+ L++    +      YI    L  N    PG  C V GWG
Sbjct: 106 SIPHPEYRRSPTHLNHDHDIMLLELQSPVQLT--GYIQTLPLSHNNRLTPGTTCRVSGWG 163

Query: 186 SMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGPL 365
           +   P  +  + LQ A I + S+ EC +++       + +CAG+      SC GDSGGPL
Sbjct: 164 TTTSPQVNYPKTLQCANIQLRSDEECRQVYPG-KITDNMLCAGTKEGGKDSCEGDSGGPL 222

Query: 366 FCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIR 491
            C     +    GIVS+G  PCGQP+ P VYTRVS    WIR
Sbjct: 223 VC-----NRTLYGIVSWGDFPCGQPDRPGVYTRVSRYVLWIR 259
>sp|Q91VE3|KLK7_MOUSE Kallikrein-7 precursor (Stratum corneum chymotryptic enzyme)
           (Thymopsin)
          Length = 249

 Score = 95.9 bits (237), Expect = 6e-20
 Identities = 54/162 (33%), Positives = 86/162 (53%)
 Frame = +3

Query: 3   TKTIIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGDRCEVVGW 182
           TK+  HP Y  +      ND+ L++ D  +  +    ++  +L  +    PG  C V GW
Sbjct: 92  TKSFRHPGYSTK---THVNDIMLVRLDEPVKMSSK--VEAVQLPEHC-EPPGTSCTVSGW 145

Query: 183 GSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGP 362
           G+   P       L  + + +IS+ EC +++ ++   ++ +CAG     T++C GDSGGP
Sbjct: 146 GTTTSPDVTFPSDLMCSDVKLISSRECKKVYKDLLG-KTMLCAGIPDSKTNTCNGDSGGP 204

Query: 363 LFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 488
           L C D  +     G+VS+GT PCGQPN P VYT+V +   W+
Sbjct: 205 LVCNDTLQ-----GLVSWGTYPCGQPNDPGVYTQVCKYKRWV 241
>sp|P69525|TMPS9_MOUSE Transmembrane protease, serine 9 (Polyserase-1) (Polyserine
            protease-1) (Polyserase-I) [Contains: Serase-1; Serase-2;
            Serase-3]
          Length = 1065

 Score = 94.7 bits (234), Expect = 1e-19
 Identities = 58/161 (36%), Positives = 85/161 (52%), Gaps = 4/161 (2%)
 Frame = +3

Query: 27   YRHEY--AYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGD--RCEVVGWGSML 194
            YRH +   Y    DVAL++  G + +++        + +   ++P D  RC + GWGS L
Sbjct: 909  YRHPFYNIYTLDYDVALLELAGPVRRSRL----VRPICLPGPARPPDGARCVITGWGS-L 963

Query: 195  PPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGPLFCT 374
              G   +R LQKAA+ ++S   C R +  V      +CAG  +    SC GD+GGPL C 
Sbjct: 964  REGGSMARQLQKAAVRVLSEQTCRRFYP-VQISSRMLCAGFPQGGVDSCSGDAGGPLACR 1022

Query: 375  DGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIRAN 497
            +     +  G+ S+G   CG+P+ P VYTRV+ V GWI  N
Sbjct: 1023 EPSGQWVLTGVTSWG-YGCGRPHFPGVYTRVAAVLGWIGQN 1062

 Score = 78.6 bits (192), Expect = 1e-14
 Identities = 41/111 (36%), Positives = 57/111 (51%)
 Frame = +3

Query: 156 GDRCEVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTS 335
           G +C + GWG+M          LQKA++ II    CG ++ N +     +CAG       
Sbjct: 625 GRKCMISGWGNMQEGNATKPDILQKASVGIIEQKMCGALY-NFSLTDRMLCAGFLEGRVD 683

Query: 336 SCRGDSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 488
           SC+GDSGGPL C +        GIVS+G + C Q   P VY R++ +  WI
Sbjct: 684 SCQGDSGGPLACEETPGVFYLAGIVSWG-IGCAQAKKPGVYARITRLKDWI 733

 Score = 76.3 bits (186), Expect = 5e-14
 Identities = 40/112 (35%), Positives = 57/112 (50%)
 Frame = +3

Query: 153 PGDRCEVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLST 332
           PG +C + GWG +          LQKA + ++  S C  ++ +    R  +CAG      
Sbjct: 324 PGKKCLISGWGYLKEDFLVKPEVLQKATVELLDQSLCSSLYGHSLTDRM-VCAGYLDGKV 382

Query: 333 SSCRGDSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 488
            SC+GDSGGPL C +        GIVS+G + C +   P VYTRV+ +  WI
Sbjct: 383 DSCQGDSGGPLVCEEPSGRFFLAGIVSWG-IGCAEARRPGVYTRVTRLRDWI 433
>sp|P69526|TMPS9_RAT Transmembrane protease, serine 9 (Polyserase-1) (Polyserine
            protease-1) (Polyserase-I) [Contains: Serase-1; Serase-2;
            Serase-3]
          Length = 1061

 Score = 94.7 bits (234), Expect = 1e-19
 Identities = 58/161 (36%), Positives = 85/161 (52%), Gaps = 4/161 (2%)
 Frame = +3

Query: 27   YRHEY--AYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKP--GDRCEVVGWGSML 194
            YRH +   Y    DVAL++  G + +++        + +   ++P  G RC + GWGS L
Sbjct: 905  YRHPFYNIYTLDYDVALLELAGPVRRSRL----VRPICLPGPTRPPEGARCVITGWGS-L 959

Query: 195  PPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGPLFCT 374
              G   +R LQKAA+ ++S   C R +  V      +CAG  +    SC GD+GGPL C 
Sbjct: 960  REGGSMARQLQKAAVRVLSEQTCRRFYP-VQISSRMLCAGFPQGGVDSCSGDAGGPLACR 1018

Query: 375  DGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIRAN 497
            +     +  G+ S+G   CG+P+ P VYTRV+ V GWI  N
Sbjct: 1019 EPSGQWVLTGVTSWG-YGCGRPHFPGVYTRVAAVLGWIGQN 1058

 Score = 78.6 bits (192), Expect = 1e-14
 Identities = 41/111 (36%), Positives = 57/111 (51%)
 Frame = +3

Query: 156 GDRCEVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTS 335
           G +C + GWG+M          LQKA++ II    CG ++ N +     +CAG       
Sbjct: 625 GRKCMISGWGNMQEGNATKPDILQKASVGIIEQKMCGALY-NFSLTDRMLCAGFLEGRVD 683

Query: 336 SCRGDSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 488
           SC+GDSGGPL C +        GIVS+G + C Q   P VY R++ +  WI
Sbjct: 684 SCQGDSGGPLACEETPGVFYLAGIVSWG-IGCAQAKKPGVYARITRLKDWI 733

 Score = 71.6 bits (174), Expect = 1e-12
 Identities = 37/112 (33%), Positives = 56/112 (50%)
 Frame = +3

Query: 153 PGDRCEVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLST 332
           P  +C + GWG +          LQKA + ++  + C  ++ +    R  +CAG      
Sbjct: 324 PRKKCLISGWGYLKEDFLVKPEVLQKATVELLDQNLCSSLYGHSLTDRM-VCAGYLDGKV 382

Query: 333 SSCRGDSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 488
            SC+GDSGGPL C +        G+VS+G + C +   P VYTRV+ +  WI
Sbjct: 383 DSCQGDSGGPLVCEEPSGRFFLAGVVSWG-IGCAEARRPGVYTRVTRLRDWI 433
>sp|P98073|ENTK_HUMAN Enteropeptidase precursor (Enterokinase) [Contains: Enteropeptidase
            non-catalytic heavy chain; Enteropeptidase catalytic
            light chain]
          Length = 1019

 Score = 91.7 bits (226), Expect = 1e-18
 Identities = 54/162 (33%), Positives = 83/162 (51%), Gaps = 1/162 (0%)
 Frame = +3

Query: 12   IIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNEL-DVNAISKPGDRCEVVGWGS 188
            +I+P Y       + ND+A++  +  +  N  DYI    L + N +  PG  C + GWG+
Sbjct: 863  VINPHYNRRR---KDNDIAMMHLEFKV--NYTDYIQPICLPEENQVFPPGRNCSIAGWGT 917

Query: 189  MLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGPLF 368
            ++  G  T+  LQ+A + ++SN  C +         + ICAG       SC+GDSGGPL 
Sbjct: 918  VVYQGT-TANILQEADVPLLSNERCQQQMPEYNITENMICAGYEEGGIDSCQGDSGGPLM 976

Query: 369  CTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIRA 494
            C +  +  +  G+ S+G   C  PN P VY RVS    WI++
Sbjct: 977  CQENNRWFL-AGVTSFG-YKCALPNRPGVYARVSRFTEWIQS 1016
>sp|P26262|KLKB1_MOUSE Plasma kallikrein precursor (Plasma prekallikrein) (Kininogenin)
           (Fletcher factor) [Contains: Plasma kallikrein heavy
           chain; Plasma kallikrein light chain]
          Length = 638

 Score = 91.3 bits (225), Expect = 1e-18
 Identities = 61/162 (37%), Positives = 83/162 (51%), Gaps = 3/162 (1%)
 Frame = +3

Query: 12  IIHPSYRHEYAYLEGN-DVALIKFDGYISKNQYD--YIDYNELDVNAISKPGDRCEVVGW 182
           IIH     EY   EGN D+ALIK    ++  ++       ++ D N I      C V GW
Sbjct: 470 IIH----QEYKVSEGNYDIALIKLQTPLNYTEFQKPICLPSKADTNTIYT---NCWVTGW 522

Query: 183 GSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGP 362
           G     G+ T   LQKA I ++ N EC + + +    +  ICAG     T +C+GDSGGP
Sbjct: 523 GYTKEQGE-TQNILQKATIPLVPNEECQKKYRDYVINKQMICAGYKEGGTDACKGDSGGP 581

Query: 363 LFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 488
           L C    +  + VGI S+G   CG+ + P VYT+VSE   WI
Sbjct: 582 LVCKHSGRWQL-VGITSWGE-GCGRKDQPGVYTKVSEYMDWI 621
>sp|Q61955|NRPN_MOUSE Neuropsin precursor (NP) (Kallikrein 8)
          Length = 260

 Score = 90.5 bits (223), Expect = 2e-18
 Identities = 54/162 (33%), Positives = 81/162 (50%)
 Frame = +3

Query: 6   KTIIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGDRCEVVGWG 185
           ++I HP Y +       +D+ LI+     S N  D +   +L  N   K G +C + GWG
Sbjct: 102 QSIQHPCYNNSNPEDHSHDIMLIRLQN--SANLGDKVKPVQL-ANLCPKVGQKCIISGWG 158

Query: 186 SMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGPL 365
           ++  P ++    L  A + I S ++C R +         +CAGS      +C+GDSGGPL
Sbjct: 159 TVTSPQENFPNTLNCAEVKIYSQNKCERAYPG-KITEGMVCAGSSN-GADTCQGDSGGPL 216

Query: 366 FCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIR 491
            C DG       GI S+G+ PCG+P  P VYT++     WI+
Sbjct: 217 VC-DGMLQ----GITSWGSDPCGKPEKPGVYTKICRYTTWIK 253
>sp|Q7Z410|TMPS9_HUMAN Transmembrane protease, serine 9 (Polyserase-1) (Polyserase-I)
            (Polyserine protease-1) [Contains: Serase-1; Serase-2;
            Serase-3]
          Length = 1059

 Score = 90.1 bits (222), Expect = 3e-18
 Identities = 57/164 (34%), Positives = 82/164 (50%), Gaps = 10/164 (6%)
 Frame = +3

Query: 27   YRHEY--AYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKP--------GDRCEVV 176
            Y+H +   Y    DVAL++  G + +++          V  I  P        G RC + 
Sbjct: 902  YKHPFYNLYTLDYDVALLELAGPVRRSRL---------VRPICLPEPAPRPPDGTRCVIT 952

Query: 177  GWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSG 356
            GWGS+   G   +R LQKAA+ ++S   C R +  V      +CAG  +    SC GD+G
Sbjct: 953  GWGSVREGGS-MARQLQKAAVRLLSEQTCRRFYP-VQISSRMLCAGFPQGGVDSCSGDAG 1010

Query: 357  GPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 488
            GPL C +     +  G+ S+G   CG+P+ P VYTRV+ V GWI
Sbjct: 1011 GPLACREPSGRWVLTGVTSWG-YGCGRPHFPGVYTRVAAVRGWI 1053

 Score = 80.9 bits (198), Expect = 2e-15
 Identities = 50/161 (31%), Positives = 76/161 (47%)
 Frame = +3

Query: 6    KTIIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGDRCEVVGWG 185
            + ++HP Y          D+A+++    ++ N+Y       L +      G +C + GWG
Sbjct: 577  RVVLHPLYNPGILDF---DLAVLELASPLAFNKYIQPVCLPLAIQKFPV-GRKCMISGWG 632

Query: 186  SMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGPL 365
            +           LQKA++ II    C  ++ N +     ICAG       SC+GDSGGPL
Sbjct: 633  NTQEGNATKPELLQKASVGIIDQKTCSVLY-NFSLTDRMICAGFLEGKVDSCQGDSGGPL 691

Query: 366  FCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 488
             C +        GIVS+G + C Q   P VYTR++ + GWI
Sbjct: 692  ACEEAPGVFYLAGIVSWG-IGCAQVKKPGVYTRITRLKGWI 731

 Score = 71.2 bits (173), Expect = 2e-12
 Identities = 38/115 (33%), Positives = 56/115 (48%)
 Frame = +3

Query: 144 ISKPGDRCEVVGWGSMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGR 323
           I  P  +C + GWG +          LQKA + ++  + C  ++ +    R  +CAG   
Sbjct: 319 IFPPSKKCLISGWGYLKEDFLVKPEVLQKATVELLDQALCASLYGHSLTDRM-VCAGYLD 377

Query: 324 LSTSSCRGDSGGPLFCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWI 488
               SC+GDSGGPL C +        GIVS+G + C +   P VY RV+ +  WI
Sbjct: 378 GKVDSCQGDSGGPLVCEEPSGRFFLAGIVSWG-IGCAEARRPGVYARVTRLRDWI 431
>sp|O88780|NRPN_RAT Neuropsin precursor (NP) (Kallikrein-8) (Brain serine protease 1)
          Length = 260

 Score = 89.4 bits (220), Expect = 5e-18
 Identities = 51/162 (31%), Positives = 79/162 (48%)
 Frame = +3

Query: 6   KTIIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNELDVNAISKPGDRCEVVGWG 185
           ++I HP +         +D+ LI+     S N  D +   EL  N   K G +C + GWG
Sbjct: 102 RSIQHPCFNSSNPEDHSHDIMLIRLQN--SANLGDKVKPIEL-ANLCPKVGQKCIISGWG 158

Query: 186 SMLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGPL 365
           ++  P ++    L  A + I S ++C R +         +CAGS      +C+GDSGGPL
Sbjct: 159 TVTSPQENFPNTLNCAEVKIYSQNKCERAYPG-KITEGMVCAGSSN-GADTCQGDSGGPL 216

Query: 366 FCTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIR 491
            C       +  GI ++G+ PCG+P  P VYT++     WI+
Sbjct: 217 VCNG-----VLQGITTWGSDPCGKPEKPGVYTKICRYTNWIK 253
>sp|P97435|ENTK_MOUSE Enteropeptidase (Enterokinase) [Contains: Enteropeptidase
            non-catalytic heavy chain; Enteropeptidase catalytic
            light chain]
          Length = 1069

 Score = 89.0 bits (219), Expect = 7e-18
 Identities = 57/162 (35%), Positives = 80/162 (49%), Gaps = 1/162 (0%)
 Frame = +3

Query: 12   IIHPSYRHEYAYLEGNDVALIKFDGYISKNQYDYIDYNEL-DVNAISKPGDRCEVVGWGS 188
            +I+P Y         ND+A++  +  +  N  DYI    L + N I  PG  C + GWG 
Sbjct: 912  VINPHYDRRRKV---NDIAMMHLEFKV--NYTDYIQPICLPEENQIFIPGRTCSIAGWGY 966

Query: 189  MLPPGQDTSRFLQKAAINIISNSECGRIFSNVAYVRSNICAGSGRLSTSSCRGDSGGPLF 368
                   T   L++A + +ISN +C +         S ICAG       SC+GDSGGPL 
Sbjct: 967  DKINAGSTVDVLKEADVPLISNEKCQQQLPEYNITESMICAGYEEGGIDSCQGDSGGPLM 1026

Query: 369  CTDGRKHHIQVGIVSYGTVPCGQPNVPSVYTRVSEVAGWIRA 494
            C +  +  + VG+ S+G V C  PN P VY RVS+   WI +
Sbjct: 1027 CQENNRWFL-VGVTSFG-VQCALPNHPGVYVRVSQFIEWIHS 1066
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 64,276,781
Number of Sequences: 369166
Number of extensions: 1355634
Number of successful extensions: 4753
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4048
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4150
length of database: 68,354,980
effective HSP length: 104
effective length of database: 49,142,540
effective search space used: 3882260660
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)