Planarian EST Database


Dr_sW_018_E01

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_018_E01
         (681 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P11219|AGI_ORYSA  Lectin precursor (Agglutinin) [Contains...    52   1e-06
sp|P10968|AGI1_WHEAT  Agglutinin isolectin 1 precursor (WGA1...    48   3e-05
sp|P02876|AGI2_WHEAT  Agglutinin isolectin 2 precursor (WGA2...    44   4e-04
sp|P10969|AGI3_WHEAT  Agglutinin isolectin 3 precursor (WGA3)      44   4e-04
sp|P10039|TENA_CHICK  Tenascin precursor (TN) (Hexabrachion)...    43   9e-04
sp|P15312|AGI_HORVU  Root-specific lectin precursor                43   9e-04
sp|P10040|CRB_DROME  Crumbs protein precursor (95F)                41   0.003
sp|Q69Z28|ATS16_MOUSE  ADAMTS-16 precursor (A disintegrin an...    39   0.016
sp|Q8TE57|ATS16_HUMAN  ADAMTS-16 precursor (A disintegrin an...    38   0.021
sp|P22105|TENX_HUMAN  Tenascin-X precursor (TN-X) (Hexabrach...    37   0.037
>sp|P11219|AGI_ORYSA Lectin precursor (Agglutinin) [Contains: Lectin 10 kDa peptide;
           Lectin 8 kDa peptide]
          Length = 227

 Score = 52.0 bits (123), Expect = 1e-06
 Identities = 45/157 (28%), Positives = 58/157 (36%)
 Frame = +3

Query: 33  GGGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGI 212
           GGG  C NN QC S+Y Y        C   S++ G     GPC+ D  C  N   E+   
Sbjct: 78  GGGATCSNN-QCCSQYGY--------CGFGSEYCGSGCQNGPCRADIKCGRNANGELCPN 128

Query: 213 KYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGGPCVRDW 392
             C  +     GY G G          S +C      G C  E    ++ GG        
Sbjct: 129 NMCCSQ----WGYCGLG----------SEFCGNGCQSGACCPEKRCGKQAGGD------- 167

Query: 393 QCYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKD 503
           +C + +C + G   YC   G   G    SG C K  D
Sbjct: 168 KCPNNFCCSAG--GYCGLGGNYCGSGCQSGGCYKGGD 202

 Score = 39.7 bits (91), Expect = 0.007
 Identities = 45/155 (29%), Positives = 58/155 (37%), Gaps = 18/155 (11%)
 Frame = +3

Query: 96  AETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIKYCVQE-------SKQKIGYI 254
           A   + + +++  GK   G  C ++  C S F +   G  YC          S Q+ G  
Sbjct: 19  AAAAVAATNAQTCGKQNDGMICPHNL-CCSQFGYCGLGRDYCGTGCQSGACCSSQRCGSQ 77

Query: 255 GEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGG---GPCVRDWQC----YSRY 410
           G GA C N+  C   GYC      GF       SE  G G   GPC  D +C        
Sbjct: 78  GGGATCSNNQCCSQYGYC------GF------GSEYCGSGCQNGPCRADIKCGRNANGEL 125

Query: 411 CFNN---GKLRYCAKDGKQFGEVGISGPCRKNKDC 506
           C NN    +  YC    +  G    SG C   K C
Sbjct: 126 CPNNMCCSQWGYCGLGSEFCGNGCQSGACCPEKRC 160
>sp|P10968|AGI1_WHEAT Agglutinin isolectin 1 precursor (WGA1) (Isolectin A)
          Length = 212

 Score = 47.8 bits (112), Expect = 3e-05
 Identities = 51/169 (30%), Positives = 60/169 (35%), Gaps = 7/169 (4%)
 Frame = +3

Query: 36  GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 215
           GG  C NN QC S+Y Y        C   +++ G    GGPC+ D  C S          
Sbjct: 77  GGATCTNN-QCCSQYGY--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 118

Query: 216 YCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG---GPCVR 386
                          G LC N+  C           GFC    + SE  GGG   G C  
Sbjct: 119 -------------AGGKLCPNNLCC--------SQWGFC---GLGSEFCGGGCQSGACST 154

Query: 387 DWQC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 521
           D  C      R C NN    YC     ++G  GI GP      C SG C
Sbjct: 155 DKPCGKDAGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 195
>sp|P02876|AGI2_WHEAT Agglutinin isolectin 2 precursor (WGA2) (Isolectin D)
          Length = 213

 Score = 43.9 bits (102), Expect = 4e-04
 Identities = 49/169 (28%), Positives = 59/169 (34%), Gaps = 7/169 (4%)
 Frame = +3

Query: 36  GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 215
           GG  C NN  C S+Y +        C   +++ G    GGPC+ D  C S          
Sbjct: 78  GGATCPNN-HCCSQYGH--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 119

Query: 216 YCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG---GPCVR 386
                          G LC N+  C           GFC    + SE  GGG   G C  
Sbjct: 120 -------------SGGKLCPNNLCC--------SQWGFC---GLGSEFCGGGCQSGACST 155

Query: 387 DWQC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 521
           D  C      R C NN    YC     ++G  GI GP      C SG C
Sbjct: 156 DKPCGKDAGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 196

 Score = 36.2 bits (82), Expect = 0.082
 Identities = 38/139 (27%), Positives = 56/139 (40%), Gaps = 15/139 (10%)
 Frame = +3

Query: 135 GKVGGGGPCKNDFFCASNFCHEINGIKYCVQE-------SKQKIGYIGEGALCKNHHSCY 293
           G+ G    C N+  C S + +   G  YC +        + ++ G    GA C N+H C 
Sbjct: 31  GEQGSNMECPNNL-CCSQYGYCGMGGDYCGKGCQNGACWTSKRCGSQAGGATCPNNHCCS 89

Query: 294 S-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDWQCYS----RYCFNN---GKLRYCAKD 449
             G+C      GF A+      +   GGPC  D +C S    + C NN    +  +C   
Sbjct: 90  QYGHC------GFGAEYCGAGCQ---GGPCRADIKCGSQSGGKLCPNNLCCSQWGFCGLG 140

Query: 450 GKQFGEVGISGPCRKNKDC 506
            +  G    SG C  +K C
Sbjct: 141 SEFCGGGCQSGACSTDKPC 159
>sp|P10969|AGI3_WHEAT Agglutinin isolectin 3 precursor (WGA3)
          Length = 186

 Score = 43.9 bits (102), Expect = 4e-04
 Identities = 48/163 (29%), Positives = 62/163 (38%), Gaps = 1/163 (0%)
 Frame = +3

Query: 36  GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 215
           GG  C NN  C S+Y +        C   +++ G    GGPC+ D  C S          
Sbjct: 51  GGKTCPNN-HCCSQYGH--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 92

Query: 216 YCVQESKQKIGYIGEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDW 392
                          G LC N+  C   GYC  +G+E FC  E  ++       PC +D 
Sbjct: 93  -------------AGGKLCPNNLCCSQWGYC-GLGSE-FCG-EGCQNGACSTDKPCGKD- 135

Query: 393 QCYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 521
               R C NN    YC     ++G  GI GP      C SG C
Sbjct: 136 -AGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 169
>sp|P10039|TENA_CHICK Tenascin precursor (TN) (Hexabrachion) (Cytotactin) (Neuronectin)
           (GMEM) (JI) (Miotendinous antigen)
           (Glioma-associated-extracellular matrix antigen) (GP
           150-225)
          Length = 1808

 Score = 42.7 bits (99), Expect = 9e-04
 Identities = 49/174 (28%), Positives = 69/174 (39%), Gaps = 15/174 (8%)
 Frame = +3

Query: 48  CKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHE----INGIK 215
           C N G+C +  C        +C  D  F G+  G   C ND       CH     +NG  
Sbjct: 413 CHNRGRCINGQC--------VC--DEGFIGEDCGELRCPND-------CHNRGRCVNGQC 455

Query: 216 YCVQESKQKIGYIGE--GAL-----CKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGG 374
            C +      G+IGE  G L     C +H  C +G CV   +EG+  ++        G  
Sbjct: 456 ECHE------GFIGEDCGELRCPNDCNSHGRCVNGQCVC--DEGYTGEDC-------GEL 500

Query: 375 PCVRDWQCYSRYCFNNGKLRYCAKD----GKQFGEVGISGPCRKNKDCSSGRCV 524
            C  D  C++R     G+   C  D    G+  GE+     C ++  C  GRCV
Sbjct: 501 RCPND--CHNRGRCVEGR---CVCDNGFMGEDCGELSCPNDCHQHGRCVDGRCV 549

 Score = 40.8 bits (94), Expect = 0.003
 Identities = 44/167 (26%), Positives = 65/167 (38%), Gaps = 8/167 (4%)
 Frame = +3

Query: 48  CKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIKYCVQ 227
           C N G+C    C        +C  D  + G+  G   C ND F        ING  +C +
Sbjct: 289 CHNRGRCVDNEC--------VC--DEGYTGEDCGELICPNDCFDRGRC---INGTCFCEE 335

Query: 228 ESKQKIGYIGE--GAL-----CKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGGPCVR 386
                 GY GE  G L     C  +  C +G CV   +EGF   +  +         C +
Sbjct: 336 ------GYTGEDCGELTCPNNCNGNGRCENGLCVC--HEGFVGDDCSQKR-------CPK 380

Query: 387 DWQCYSR-YCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRCV 524
           D  C +R +C +   + +    G+  GE+     C     C +G+CV
Sbjct: 381 D--CNNRGHCVDGRCVCHEGYLGEDCGELRCPNDCHNRGRCINGQCV 425

 Score = 30.4 bits (67), Expect = 4.5
 Identities = 52/205 (25%), Positives = 71/205 (34%), Gaps = 27/205 (13%)
 Frame = +3

Query: 48  CKNNGQCYSRYCY------KVDAETKICSKDSKFKGK-VGGGGPCKNDFF---CASNFCH 197
           C N G C    C         D     C  D   +GK V G   C   +    C    C 
Sbjct: 196 CLNRGLCVRGKCICEEGFTGEDCSQAACPSDCNDQGKCVDGVCVCFEGYTGPDCGEELCP 255

Query: 198 EINGIK------YCVQESKQKIGYIGEGA---LCKNHHSCYS-GYCVK---VGNEGFCAK 338
              GI        CV       G+ GE     LC N+  C++ G CV    V +EG+  +
Sbjct: 256 HGCGIHGRCVGGRCVCHE----GFTGEDCNEPLCPNN--CHNRGRCVDNECVCDEGYTGE 309

Query: 339 ESMKSEKVGGGGPCVRDWQCYSRYCFNNGKLRYCAKD--GKQFGEVGISGPCRKNKDCSS 512
           +          G  +    C+ R    NG   +C +   G+  GE+     C  N  C +
Sbjct: 310 DC---------GELICPNDCFDRGRCINGTC-FCEEGYTGEDCGELTCPNNCNGNGRCEN 359

Query: 513 GRCVSMKTKLQDGTESK--VKVCQN 581
           G CV  +  + D    K   K C N
Sbjct: 360 GLCVCHEGFVGDDCSQKRCPKDCNN 384
>sp|P15312|AGI_HORVU Root-specific lectin precursor
          Length = 212

 Score = 42.7 bits (99), Expect = 9e-04
 Identities = 48/167 (28%), Positives = 61/167 (36%), Gaps = 5/167 (2%)
 Frame = +3

Query: 36  GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 215
           GG  C NN  C S++ Y        C   +++ G    GGPC+ D  C S          
Sbjct: 77  GGKTCPNN-HCCSQWGY--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 118

Query: 216 YCVQESKQKIGYIGEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDW 392
                          G LC N+  C   GYC  +G+E FC +          GG C  D 
Sbjct: 119 -------------AGGKLCPNNLCCSQWGYC-GLGSE-FCGEGCQ-------GGACSTDK 156

Query: 393 QC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 521
            C      + C NN    YC     ++G  GI GP      C SG C
Sbjct: 157 PCGKAAGGKVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 195
>sp|P10040|CRB_DROME Crumbs protein precursor (95F)
          Length = 2146

 Score = 40.8 bits (94), Expect = 0.003
 Identities = 46/179 (25%), Positives = 65/179 (36%), Gaps = 17/179 (9%)
 Frame = +3

Query: 36  GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASN-------FC 194
           G G C ++ + Y   C       K C KD+   G      PC+N   C  N       FC
Sbjct: 276 GHGTCSSSPEGYECRC-TARYSGKNCQKDN---GSPCAKNPCENGGSCLENSRGDYQCFC 331

Query: 195 HEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGG 374
              +  ++C  E       +    LC+ +    +G CV +G  G    E  K      G 
Sbjct: 332 DPNHSGQHCETE-------VNIHPLCQTNPCLNNGACVVIGGSGALTCECPKGY---AGA 381

Query: 375 PCVRDW-QCYSRYCFNNG----KLRYCAKDGKQFGEVGI-----SGPCRKNKDCSSGRC 521
            C  D  +C S+ C NNG    ++   + D    G  G         C KN   + GRC
Sbjct: 382 RCEVDTDECASQPCQNNGSCIDRINGFSCDCSGTGYTGAFCQTNVDECDKNPCLNGGRC 440
>sp|Q69Z28|ATS16_MOUSE ADAMTS-16 precursor (A disintegrin and metalloproteinase with
           thrombospondin motifs 16) (ADAM-TS 16) (ADAM-TS16)
          Length = 1222

 Score = 38.5 bits (88), Expect = 0.016
 Identities = 35/126 (27%), Positives = 45/126 (35%), Gaps = 5/126 (3%)
 Frame = +3

Query: 159 CKNDF---FCASNFCHEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGF 329
           C  DF    C + +CH I       ++ + K     EG LC     C  G CVK G+EG 
Sbjct: 527 CMLDFRKDICKALWCHRIG------RKCETKFMPAAEGTLCGQDMWCRGGQCVKYGDEG- 579

Query: 330 CAKESMKSEKVGGGGPCVRDWQCYSRYCFN--NGKLRYCAKDGKQFGEVGISGPCRKNKD 503
                   +   G       W   SR C    + + R C       G     G  R  K 
Sbjct: 580 -------PKPTHGHWSDWSPWSPCSRTCGGGISHRDRLCTNPRPSHGGKFCQGSTRTLKL 632

Query: 504 CSSGRC 521
           C+S RC
Sbjct: 633 CNSQRC 638
>sp|Q8TE57|ATS16_HUMAN ADAMTS-16 precursor (A disintegrin and metalloproteinase with
           thrombospondin motifs 16) (ADAM-TS 16) (ADAM-TS16)
          Length = 1224

 Score = 38.1 bits (87), Expect = 0.021
 Identities = 33/126 (26%), Positives = 46/126 (36%), Gaps = 5/126 (3%)
 Frame = +3

Query: 159 CKNDF---FCASNFCHEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGF 329
           C  DF    C + +CH I       ++ + K     EG +C +   C  G CVK G+EG 
Sbjct: 529 CMLDFKKDICKALWCHRIG------RKCETKFMPAAEGTICGHDMWCRGGQCVKYGDEG- 581

Query: 330 CAKESMKSEKVGGGGPCVRDWQCYSRYCFN--NGKLRYCAKDGKQFGEVGISGPCRKNKD 503
                   +   G       W   SR C    + + R C       G     G  R  K 
Sbjct: 582 -------PKPTHGHWSDWSSWSPCSRTCGGGVSHRSRLCTNPKPSHGGKFCEGSTRTLKL 634

Query: 504 CSSGRC 521
           C+S +C
Sbjct: 635 CNSQKC 640
>sp|P22105|TENX_HUMAN Tenascin-X precursor (TN-X) (Hexabrachion-like protein)
          Length = 4289

 Score = 37.4 bits (85), Expect = 0.037
 Identities = 48/206 (23%), Positives = 71/206 (34%), Gaps = 26/206 (12%)
 Frame = +3

Query: 42  GICKNNGQCYSRYCY------KVDAETKICSKDSKFKGKVGGGGPCKNDF-FCASNFCHE 200
           G C   G+C    C         D  ++ C +D +      G G C+N    C + +  E
Sbjct: 406 GDCNQRGRCEDGRCVCWPGYTGTDCGSRACPRDCR------GRGRCENGVCVCNAGYSGE 459

Query: 201 INGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG--- 371
             G++ C  + +      G G        C+ GY    G +  C   +   +  G G   
Sbjct: 460 DCGVRSCPGDCR------GRGRCESGRCMCWPGY---TGRD--CGTRACPGDCRGRGRCV 508

Query: 372 -GPCVRD-----WQCYSRYCFNNGKLRYCAKDGKQFGEVGISGP----------CRKNKD 503
            G CV +       C SR C  + +     +DG    + G SG           CR    
Sbjct: 509 DGRCVCNPGFTGEDCGSRRCPGDCRGHGLCEDGVCVCDAGYSGEDCSTRSCPGGCRGRGQ 568

Query: 504 CSSGRCVSMKTKLQDGTESKVKVCQN 581
           C  GRCV        G +  V+ C N
Sbjct: 569 CLDGRCVCEDG--YSGEDCGVRQCPN 592

 Score = 36.6 bits (83), Expect = 0.063
 Identities = 39/165 (23%), Positives = 59/165 (35%), Gaps = 1/165 (0%)
 Frame = +3

Query: 33  GGGGICKNNGQCYSRYCYK-VDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEING 209
           G GG C + G+C     Y   D  T+ C +D + +G+   G     +  C + +  +  G
Sbjct: 347 GEGGRCVD-GRCVCWPGYTGEDCSTRTCPRDCRGRGRCEDG-----ECICDTGYSGDDCG 400

Query: 210 IKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGGPCVRD 389
           ++ C  +             C     C  G CV     G+   +        G   C RD
Sbjct: 401 VRSCPGD-------------CNQRGRCEDGRCVCW--PGYTGTDC-------GSRACPRD 438

Query: 390 WQCYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRCV 524
            +   R C N   +      G+  G     G CR    C SGRC+
Sbjct: 439 CRGRGR-CENGVCVCNAGYSGEDCGVRSCPGDCRGRGRCESGRCM 482
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 81,852,694
Number of Sequences: 369166
Number of extensions: 1929922
Number of successful extensions: 5730
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 5164
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5677
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 5782011865
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)