Planaria EST Database


DrC_02770

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_02770
         (537 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P98072|ENTK_BOVIN  Enteropeptidase precursor (Enterokinas...    94   3e-19
sp|Q05511|HEPS_RAT  Serine protease hepsin [Contains: Serine...    89   5e-18
sp|P05981|HEPS_HUMAN  Serine protease hepsin (Transmembrane ...    89   5e-18
sp|P98073|ENTK_HUMAN  Enteropeptidase precursor (Enterokinas...    89   7e-18
sp|O35453|HEPS_MOUSE  Serine protease hepsin [Contains: Seri...    88   1e-17
sp|P98074|ENTK_PIG  Enteropeptidase precursor (Enterokinase)...    87   2e-17
sp|P97435|ENTK_MOUSE  Enteropeptidase (Enterokinase) [Contai...    86   4e-17
sp|P03952|KLKB1_HUMAN  Plasma kallikrein precursor (Plasma p...    79   5e-15
sp|P08419|ELA2_PIG  Elastase-2 precursor                           79   9e-15
sp|Q8VHJ4|TM11D_RAT  Transmembrane protease, serine 11D prec...    79   9e-15
>sp|P98072|ENTK_BOVIN Enteropeptidase precursor (Enterokinase) [Contains: Enteropeptidase
            non-catalytic heavy chain; Enteropeptidase catalytic
            light chain]
          Length = 1035

 Score = 93.6 bits (231), Expect = 3e-19
 Identities = 53/148 (35%), Positives = 74/148 (50%), Gaps = 5/148 (3%)
 Frame = +3

Query: 12   GEWPWIVSLKNPVLRKLLEKHTPTYMNWSLVGSDNHICGGTLINSEWIVTALHCMYHYIP 191
            G WPW+V+L               Y +      D  +CG +L++ +W+V+A HC+Y    
Sbjct: 810  GAWPWVVAL---------------YFD------DQQVCGASLVSRDWLVSAAHCVYGRNM 848

Query: 192  QPVPILSNWSATLGAYNISGKSEHSISI-LIDKVVYRKSFFDN--NNDFAMMHLSQPVNF 362
            +P    S W A LG +  S  +   I   LID++V    +     NND AMMHL   VN+
Sbjct: 849  EP----SKWKAVLGLHMASNLTSPQIETRLIDQIVINPHYNKRRKNNDIAMMHLEMKVNY 904

Query: 363  TDYIYPACLP--TTLATPGQMCYAVGWG 440
            TDYI P CLP    +  PG++C   GWG
Sbjct: 905  TDYIQPICLPEENQVFPPGRICSIAGWG 932
>sp|Q05511|HEPS_RAT Serine protease hepsin [Contains: Serine protease hepsin
           non-catalytic chain; Serine protease hepsin catalytic
           chain]
          Length = 416

 Score = 89.4 bits (220), Expect = 5e-18
 Identities = 51/167 (30%), Positives = 78/167 (46%), Gaps = 11/167 (6%)
 Frame = +3

Query: 3   SYYGEWPWIVSLKNPVLRKLLEKHTPTYMNWSLVGSDNHICGGTLINSEWIVTALHCMYH 182
           S  G WPW VSL+                         H+CGG+L++ +W++TA HC   
Sbjct: 168 SSLGRWPWQVSLRY---------------------DGTHLCGGSLLSGDWVLTAAHCF-- 204

Query: 183 YIPQPVPILSNWSATLGAYNISGKSEHSISILIDKVVYRKSFF--------DNNNDFAMM 338
             P+   +LS W    GA  ++  S H++ + +  V+Y   +         +N+ND A++
Sbjct: 205 --PERNRVLSRWRVFAGA--VARTSPHAVQLGVQAVIYHGGYLPFRDPTIDENSNDIALV 260

Query: 339 HLSQPVNFTDYIYPACLPTT--LATPGQMCYAVGWG-TSFSIQNTVI 470
           HLS  +  T+YI P CLP        G++C   GWG T F  Q  V+
Sbjct: 261 HLSSSLPLTEYIQPVCLPAAGQALVDGKVCTVTGWGNTQFYGQQAVV 307
>sp|P05981|HEPS_HUMAN Serine protease hepsin (Transmembrane protease, serine 1)
           [Contains: Serine protease hepsin non-catalytic chain;
           Serine protease hepsin catalytic chain]
          Length = 417

 Score = 89.4 bits (220), Expect = 5e-18
 Identities = 48/155 (30%), Positives = 73/155 (47%), Gaps = 10/155 (6%)
 Frame = +3

Query: 12  GEWPWIVSLKNPVLRKLLEKHTPTYMNWSLVGSDNHICGGTLINSEWIVTALHCMYHYIP 191
           G WPW VSL+                         H+CGG+L++ +W++TA HC     P
Sbjct: 172 GRWPWQVSLRY---------------------DGAHLCGGSLLSGDWVLTAAHCF----P 206

Query: 192 QPVPILSNWSATLGAYNISGKSEHSISILIDKVVYRKSFF--------DNNNDFAMMHLS 347
           +   +LS W    GA  ++  S H + + +  VVY   +         +N+ND A++HLS
Sbjct: 207 ERNRVLSRWRVFAGA--VAQASPHGLQLGVQAVVYHGGYLPFRDPNSEENSNDIALVHLS 264

Query: 348 QPVNFTDYIYPACLPTT--LATPGQMCYAVGWGTS 446
            P+  T+YI P CLP        G++C   GWG +
Sbjct: 265 SPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNT 299
>sp|P98073|ENTK_HUMAN Enteropeptidase precursor (Enterokinase) [Contains: Enteropeptidase
            non-catalytic heavy chain; Enteropeptidase catalytic
            light chain]
          Length = 1019

 Score = 89.0 bits (219), Expect = 7e-18
 Identities = 54/149 (36%), Positives = 75/149 (50%), Gaps = 5/149 (3%)
 Frame = +3

Query: 12   GEWPWIVSLKNPVLRKLLEKHTPTYMNWSLVGSDNHICGGTLINSEWIVTALHCMYHYIP 191
            G WPW+V L               Y    L      +CG +L++S+W+V+A HC+Y    
Sbjct: 794  GAWPWVVGL---------------YYGGRL------LCGASLVSSDWLVSAAHCVYGRNL 832

Query: 192  QPVPILSNWSATLGAYNISG-KSEHSISILIDKVVYRKSFFDN--NNDFAMMHLSQPVNF 362
            +P    S W+A LG +  S   S  ++  LID++V    +     +ND AMMHL   VN+
Sbjct: 833  EP----SKWTAILGLHMKSNLTSPQTVPRLIDEIVINPHYNRRRKDNDIAMMHLEFKVNY 888

Query: 363  TDYIYPACLP--TTLATPGQMCYAVGWGT 443
            TDYI P CLP    +  PG+ C   GWGT
Sbjct: 889  TDYIQPICLPEENQVFPPGRNCSIAGWGT 917
>sp|O35453|HEPS_MOUSE Serine protease hepsin [Contains: Serine protease hepsin
           non-catalytic chain; Serine protease hepsin catalytic
           chain]
          Length = 436

 Score = 88.2 bits (217), Expect = 1e-17
 Identities = 50/167 (29%), Positives = 78/167 (46%), Gaps = 11/167 (6%)
 Frame = +3

Query: 3   SYYGEWPWIVSLKNPVLRKLLEKHTPTYMNWSLVGSDNHICGGTLINSEWIVTALHCMYH 182
           S  G WPW VSL+                         H+CGG+L++ +W++TA HC   
Sbjct: 188 SSLGRWPWQVSLRY---------------------DGTHLCGGSLLSGDWVLTAAHCF-- 224

Query: 183 YIPQPVPILSNWSATLGAYNISGKSEHSISILIDKVVYRKSFF--------DNNNDFAMM 338
             P+   +LS W    GA  ++  S H++ + +  V+Y   +         +N+ND A++
Sbjct: 225 --PERNRVLSRWRVFAGA--VARTSPHAVQLGVQAVIYHGGYLPFRDPTIDENSNDIALV 280

Query: 339 HLSQPVNFTDYIYPACLPTT--LATPGQMCYAVGWG-TSFSIQNTVI 470
           HLS  +  T+YI P CLP        G++C   GWG T F  Q  ++
Sbjct: 281 HLSSSLPLTEYIQPVCLPAAGQALVDGKVCTVTGWGNTQFYGQQAMV 327
>sp|P98074|ENTK_PIG Enteropeptidase precursor (Enterokinase) [Contains: Enteropeptidase
            non-catalytic mini chain; Enteropeptidase non-catalytic
            heavy chain; Enteropeptidase catalytic light chain]
          Length = 1034

 Score = 87.4 bits (215), Expect = 2e-17
 Identities = 52/148 (35%), Positives = 75/148 (50%), Gaps = 5/148 (3%)
 Frame = +3

Query: 12   GEWPWIVSLKNPVLRKLLEKHTPTYMNWSLVGSDNHICGGTLINSEWIVTALHCMYHYIP 191
            G WPW+V+L               Y N  L      +CG +L++ +W+V+A HC+Y    
Sbjct: 809  GAWPWVVAL---------------YYNGQL------LCGASLVSRDWLVSAAHCVYGRNL 847

Query: 192  QPVPILSNWSATLGAYNISG-KSEHSISILIDKVVYRKSFFDN--NNDFAMMHLSQPVNF 362
            +P    S W A LG +  S   S   ++ LID++V    +     ++D AMMHL   VN+
Sbjct: 848  EP----SKWKAILGLHMTSNLTSPQIVTRLIDEIVINPHYNRRRKDSDIAMMHLEFKVNY 903

Query: 363  TDYIYPACLP--TTLATPGQMCYAVGWG 440
            TDYI P CLP    +  PG++C   GWG
Sbjct: 904  TDYIQPICLPEENQVFPPGRICSIAGWG 931
>sp|P97435|ENTK_MOUSE Enteropeptidase (Enterokinase) [Contains: Enteropeptidase
            non-catalytic heavy chain; Enteropeptidase catalytic
            light chain]
          Length = 1069

 Score = 86.3 bits (212), Expect = 4e-17
 Identities = 50/148 (33%), Positives = 75/148 (50%), Gaps = 5/148 (3%)
 Frame = +3

Query: 12   GEWPWIVSLKNPVLRKLLEKHTPTYMNWSLVGSDNHICGGTLINSEWIVTALHCMYHYIP 191
            G WPW+V+L +       ++ T           D  +CG +L++S+W+V+A HC+Y    
Sbjct: 839  GAWPWVVALYHR------DRST-----------DRLLCGASLVSSDWLVSAAHCVYRRNL 881

Query: 192  QPVPILSNWSATLGAYNISG-KSEHSISILIDKVVYRKSFFDNN--NDFAMMHLSQPVNF 362
             P    + W+A LG +  S   S   +  ++D++V    +      ND AMMHL   VN+
Sbjct: 882  DP----TRWTAVLGLHMQSNLTSPQVVRRVVDQIVINPHYDRRRKVNDIAMMHLEFKVNY 937

Query: 363  TDYIYPACLP--TTLATPGQMCYAVGWG 440
            TDYI P CLP    +  PG+ C   GWG
Sbjct: 938  TDYIQPICLPEENQIFIPGRTCSIAGWG 965
>sp|P03952|KLKB1_HUMAN Plasma kallikrein precursor (Plasma prekallikrein) (Kininogenin)
           (Fletcher factor) [Contains: Plasma kallikrein heavy
           chain; Plasma kallikrein light chain]
          Length = 638

 Score = 79.3 bits (194), Expect = 5e-15
 Identities = 48/176 (27%), Positives = 85/176 (48%), Gaps = 4/176 (2%)
 Frame = +3

Query: 3   SYYGEWPWIVSLKNPVLRKLLEKHTPTYMNWSLVGSDNHICGGTLINSEWIVTALHCMYH 182
           S +GEWPW VSL+  +                   +  H+CGG+LI  +W++TA HC   
Sbjct: 397 SSWGEWPWQVSLQVKLT------------------AQRHLCGGSLIGHQWVLTAAHCF-- 436

Query: 183 YIPQPVPILSNWSATLGAYNISGKSEHSISILIDKVVYRKSF--FDNNNDFAMMHLSQPV 356
                +P+   W    G  N+S  ++ +    I +++  +++   + N+D A++ L  P+
Sbjct: 437 ---DGLPLQDVWRIYSGILNLSDITKDTPFSQIKEIIIHQNYKVSEGNHDIALIKLQAPL 493

Query: 357 NFTDYIYPACLPT--TLATPGQMCYAVGWGTSFSIQNTVINPILKHTTLLITQAKE 518
           N+T++  P CLP+    +T    C+  GWG  FS +   I  IL+   + +   +E
Sbjct: 494 NYTEFQKPICLPSKGDTSTIYTNCWVTGWG--FSKEKGEIQNILQKVNIPLVTNEE 547
>sp|P08419|ELA2_PIG Elastase-2 precursor
          Length = 269

 Score = 78.6 bits (192), Expect = 9e-15
 Identities = 49/169 (28%), Positives = 77/169 (45%), Gaps = 7/169 (4%)
 Frame = +3

Query: 18  WPWIVSLKNPVLRKLLEKHTPTYMNWSLVGSDNHICGGTLINSEWIVTALHCMYHYIPQP 197
           WPW VSL+                 +   G   H CGGTL++  W++TA HC        
Sbjct: 40  WPWQVSLQ-----------------YDSSGQWRHTCGGTLVDQSWVLTAAHC-------- 74

Query: 198 VPILSNWSATLGAYNISGKSEHSISILIDKVVYRKSF----FDNNNDFAMMHLSQPVNFT 365
           +     +   LG +++S     S+++ + K+V  + +      N ND A++ L+ PV+ T
Sbjct: 75  ISSSRTYRVVLGRHSLSTNEPGSLAVKVSKLVVHQDWNSNQLSNGNDIALLKLASPVSLT 134

Query: 366 DYIYPACLPT--TLATPGQMCYAVGWGTSFSIQNTVINP-ILKHTTLLI 503
           D I   CLP   T+     +CY  GWG    +Q    +P IL+   LL+
Sbjct: 135 DKIQLGCLPAAGTILPNNYVCYVTGWG---RLQTNGASPDILQQGQLLV 180
>sp|Q8VHJ4|TM11D_RAT Transmembrane protease, serine 11D precursor (Airway trypsin-like
           protease) (AT) (Adrenal secretory serine protease) (AsP)
           [Contains: Transmembrane protease, serine 11D
           non-catalytic chain; Transmembrane protease, serine 11D
           catalytic chain]
          Length = 417

 Score = 78.6 bits (192), Expect = 9e-15
 Identities = 51/157 (32%), Positives = 71/157 (45%), Gaps = 3/157 (1%)
 Frame = +3

Query: 12  GEWPWIVSLKNPVLRKLLEKHTPTYMNWSLVGSDNHICGGTLINSEWIVTALHCMYHYI- 188
           G+WPW VSL+                      ++ H CGGTLI++ W++TA HC   Y  
Sbjct: 195 GDWPWQVSLQL---------------------NNVHHCGGTLISNLWVLTAAHCFRSYSN 233

Query: 189 PQPVPILSNWSATLGAYNISGKSEHSISILIDKVVYRKSFFDNNNDFAMMHLSQPVNFTD 368
           PQ       W+AT G   IS +    +  ++    Y       +ND A++ L +PV FT 
Sbjct: 234 PQ------QWTATFGVSTISPRLRVRVRAILAHAEYNS--ITRDNDIAVVQLDRPVTFTR 285

Query: 369 YIYPACLP--TTLATPGQMCYAVGWGTSFSIQNTVIN 473
            I+  CLP  T    P  + Y  GWG+     NTV N
Sbjct: 286 NIHRVCLPAATQNIMPDSVAYVTGWGSLTYGGNTVTN 322
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 69,871,221
Number of Sequences: 369166
Number of extensions: 1472947
Number of successful extensions: 4413
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3805
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3968
length of database: 68,354,980
effective HSP length: 103
effective length of database: 49,327,275
effective search space used: 3699545625
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)