Planarian EST Database


Dr_sW_018_E02

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_018_E02
         (642 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P11219|AGI_ORYSA  Lectin precursor (Agglutinin) [Contains...    50   6e-06
sp|P10968|AGI1_WHEAT  Agglutinin isolectin 1 precursor (WGA1...    48   2e-05
sp|P02876|AGI2_WHEAT  Agglutinin isolectin 2 precursor (WGA2...    44   4e-04
sp|P10969|AGI3_WHEAT  Agglutinin isolectin 3 precursor (WGA3)      44   4e-04
sp|P10039|TENA_CHICK  Tenascin precursor (TN) (Hexabrachion)...    43   8e-04
sp|P15312|AGI_HORVU  Root-specific lectin precursor                43   8e-04
sp|P10040|CRB_DROME  Crumbs protein precursor (95F)                41   0.003
sp|Q69Z28|ATS16_MOUSE  ADAMTS-16 precursor (A disintegrin an...    39   0.015
sp|Q8TE57|ATS16_HUMAN  ADAMTS-16 precursor (A disintegrin an...    38   0.019
sp|P22105|TENX_HUMAN  Tenascin-X precursor (TN-X) (Hexabrach...    37   0.033
>sp|P11219|AGI_ORYSA Lectin precursor (Agglutinin) [Contains: Lectin 10 kDa peptide;
           Lectin 8 kDa peptide]
          Length = 227

 Score = 49.7 bits (117), Expect = 6e-06
 Identities = 44/156 (28%), Positives = 57/156 (36%)
 Frame = +1

Query: 1   GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 180
           GG  C NN QC S+Y Y        C   S++ G     GPC+ D  C  N   E+    
Sbjct: 79  GGATCSNN-QCCSQYGY--------CGFGSEYCGSGCQNGPCRADIKCGRNANGELCPNN 129

Query: 181 YCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGGPCVRDWQ 360
            C  +     GY G G          S +C      G C  E    ++ GG        +
Sbjct: 130 MCCSQ----WGYCGLG----------SEFCGNGCQSGACCPEKRCGKQAGGD-------K 168

Query: 361 CYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKD 468
           C + +C + G   YC   G   G    SG C K  D
Sbjct: 169 CPNNFCCSAG--GYCGLGGNYCGSGCQSGGCYKGGD 202

 Score = 39.7 bits (91), Expect = 0.007
 Identities = 45/155 (29%), Positives = 58/155 (37%), Gaps = 18/155 (11%)
 Frame = +1

Query: 61  AETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIKYCVQE-------SKQKIGYI 219
           A   + + +++  GK   G  C ++  C S F +   G  YC          S Q+ G  
Sbjct: 19  AAAAVAATNAQTCGKQNDGMICPHNL-CCSQFGYCGLGRDYCGTGCQSGACCSSQRCGSQ 77

Query: 220 GEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGG---GPCVRDWQC----YSRY 375
           G GA C N+  C   GYC      GF       SE  G G   GPC  D +C        
Sbjct: 78  GGGATCSNNQCCSQYGYC------GF------GSEYCGSGCQNGPCRADIKCGRNANGEL 125

Query: 376 CFNN---GKLRYCAKDGKQFGEVGISGPCRKNKDC 471
           C NN    +  YC    +  G    SG C   K C
Sbjct: 126 CPNNMCCSQWGYCGLGSEFCGNGCQSGACCPEKRC 160
>sp|P10968|AGI1_WHEAT Agglutinin isolectin 1 precursor (WGA1) (Isolectin A)
          Length = 212

 Score = 47.8 bits (112), Expect = 2e-05
 Identities = 51/169 (30%), Positives = 60/169 (35%), Gaps = 7/169 (4%)
 Frame = +1

Query: 1   GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 180
           GG  C NN QC S+Y Y        C   +++ G    GGPC+ D  C S          
Sbjct: 77  GGATCTNN-QCCSQYGY--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 118

Query: 181 YCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG---GPCVR 351
                          G LC N+  C           GFC    + SE  GGG   G C  
Sbjct: 119 -------------AGGKLCPNNLCC--------SQWGFC---GLGSEFCGGGCQSGACST 154

Query: 352 DWQC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 486
           D  C      R C NN    YC     ++G  GI GP      C SG C
Sbjct: 155 DKPCGKDAGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 195
>sp|P02876|AGI2_WHEAT Agglutinin isolectin 2 precursor (WGA2) (Isolectin D)
          Length = 213

 Score = 43.9 bits (102), Expect = 4e-04
 Identities = 49/169 (28%), Positives = 59/169 (34%), Gaps = 7/169 (4%)
 Frame = +1

Query: 1   GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 180
           GG  C NN  C S+Y +        C   +++ G    GGPC+ D  C S          
Sbjct: 78  GGATCPNN-HCCSQYGH--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 119

Query: 181 YCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG---GPCVR 351
                          G LC N+  C           GFC    + SE  GGG   G C  
Sbjct: 120 -------------SGGKLCPNNLCC--------SQWGFC---GLGSEFCGGGCQSGACST 155

Query: 352 DWQC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 486
           D  C      R C NN    YC     ++G  GI GP      C SG C
Sbjct: 156 DKPCGKDAGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 196

 Score = 36.2 bits (82), Expect = 0.074
 Identities = 38/139 (27%), Positives = 56/139 (40%), Gaps = 15/139 (10%)
 Frame = +1

Query: 100 GKVGGGGPCKNDFFCASNFCHEINGIKYCVQE-------SKQKIGYIGEGALCKNHHSCY 258
           G+ G    C N+  C S + +   G  YC +        + ++ G    GA C N+H C 
Sbjct: 31  GEQGSNMECPNNL-CCSQYGYCGMGGDYCGKGCQNGACWTSKRCGSQAGGATCPNNHCCS 89

Query: 259 S-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDWQCYS----RYCFNN---GKLRYCAKD 414
             G+C      GF A+      +   GGPC  D +C S    + C NN    +  +C   
Sbjct: 90  QYGHC------GFGAEYCGAGCQ---GGPCRADIKCGSQSGGKLCPNNLCCSQWGFCGLG 140

Query: 415 GKQFGEVGISGPCRKNKDC 471
            +  G    SG C  +K C
Sbjct: 141 SEFCGGGCQSGACSTDKPC 159
>sp|P10969|AGI3_WHEAT Agglutinin isolectin 3 precursor (WGA3)
          Length = 186

 Score = 43.9 bits (102), Expect = 4e-04
 Identities = 48/163 (29%), Positives = 62/163 (38%), Gaps = 1/163 (0%)
 Frame = +1

Query: 1   GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 180
           GG  C NN  C S+Y +        C   +++ G    GGPC+ D  C S          
Sbjct: 51  GGKTCPNN-HCCSQYGH--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 92

Query: 181 YCVQESKQKIGYIGEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDW 357
                          G LC N+  C   GYC  +G+E FC  E  ++       PC +D 
Sbjct: 93  -------------AGGKLCPNNLCCSQWGYC-GLGSE-FCG-EGCQNGACSTDKPCGKD- 135

Query: 358 QCYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 486
               R C NN    YC     ++G  GI GP      C SG C
Sbjct: 136 -AGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 169
>sp|P10039|TENA_CHICK Tenascin precursor (TN) (Hexabrachion) (Cytotactin) (Neuronectin)
           (GMEM) (JI) (Miotendinous antigen)
           (Glioma-associated-extracellular matrix antigen) (GP
           150-225)
          Length = 1808

 Score = 42.7 bits (99), Expect = 8e-04
 Identities = 49/174 (28%), Positives = 69/174 (39%), Gaps = 15/174 (8%)
 Frame = +1

Query: 13  CKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHE----INGIK 180
           C N G+C +  C        +C  D  F G+  G   C ND       CH     +NG  
Sbjct: 413 CHNRGRCINGQC--------VC--DEGFIGEDCGELRCPND-------CHNRGRCVNGQC 455

Query: 181 YCVQESKQKIGYIGE--GAL-----CKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGG 339
            C +      G+IGE  G L     C +H  C +G CV   +EG+  ++        G  
Sbjct: 456 ECHE------GFIGEDCGELRCPNDCNSHGRCVNGQCVC--DEGYTGEDC-------GEL 500

Query: 340 PCVRDWQCYSRYCFNNGKLRYCAKD----GKQFGEVGISGPCRKNKDCSSGRCV 489
            C  D  C++R     G+   C  D    G+  GE+     C ++  C  GRCV
Sbjct: 501 RCPND--CHNRGRCVEGR---CVCDNGFMGEDCGELSCPNDCHQHGRCVDGRCV 549

 Score = 40.8 bits (94), Expect = 0.003
 Identities = 44/167 (26%), Positives = 65/167 (38%), Gaps = 8/167 (4%)
 Frame = +1

Query: 13  CKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIKYCVQ 192
           C N G+C    C        +C  D  + G+  G   C ND F        ING  +C +
Sbjct: 289 CHNRGRCVDNEC--------VC--DEGYTGEDCGELICPNDCFDRGRC---INGTCFCEE 335

Query: 193 ESKQKIGYIGE--GAL-----CKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGGPCVR 351
                 GY GE  G L     C  +  C +G CV   +EGF   +  +         C +
Sbjct: 336 ------GYTGEDCGELTCPNNCNGNGRCENGLCVC--HEGFVGDDCSQKR-------CPK 380

Query: 352 DWQCYSR-YCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRCV 489
           D  C +R +C +   + +    G+  GE+     C     C +G+CV
Sbjct: 381 D--CNNRGHCVDGRCVCHEGYLGEDCGELRCPNDCHNRGRCINGQCV 425

 Score = 30.4 bits (67), Expect = 4.0
 Identities = 52/205 (25%), Positives = 71/205 (34%), Gaps = 27/205 (13%)
 Frame = +1

Query: 13  CKNNGQCYSRYCY------KVDAETKICSKDSKFKGK-VGGGGPCKNDFF---CASNFCH 162
           C N G C    C         D     C  D   +GK V G   C   +    C    C 
Sbjct: 196 CLNRGLCVRGKCICEEGFTGEDCSQAACPSDCNDQGKCVDGVCVCFEGYTGPDCGEELCP 255

Query: 163 EINGIK------YCVQESKQKIGYIGEGA---LCKNHHSCYS-GYCVK---VGNEGFCAK 303
              GI        CV       G+ GE     LC N+  C++ G CV    V +EG+  +
Sbjct: 256 HGCGIHGRCVGGRCVCHE----GFTGEDCNEPLCPNN--CHNRGRCVDNECVCDEGYTGE 309

Query: 304 ESMKSEKVGGGGPCVRDWQCYSRYCFNNGKLRYCAKD--GKQFGEVGISGPCRKNKDCSS 477
           +          G  +    C+ R    NG   +C +   G+  GE+     C  N  C +
Sbjct: 310 DC---------GELICPNDCFDRGRCINGTC-FCEEGYTGEDCGELTCPNNCNGNGRCEN 359

Query: 478 GRCVSMKTKLQDGTESK--VKVCQN 546
           G CV  +  + D    K   K C N
Sbjct: 360 GLCVCHEGFVGDDCSQKRCPKDCNN 384
>sp|P15312|AGI_HORVU Root-specific lectin precursor
          Length = 212

 Score = 42.7 bits (99), Expect = 8e-04
 Identities = 48/167 (28%), Positives = 61/167 (36%), Gaps = 5/167 (2%)
 Frame = +1

Query: 1   GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 180
           GG  C NN  C S++ Y        C   +++ G    GGPC+ D  C S          
Sbjct: 77  GGKTCPNN-HCCSQWGY--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 118

Query: 181 YCVQESKQKIGYIGEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDW 357
                          G LC N+  C   GYC  +G+E FC +          GG C  D 
Sbjct: 119 -------------AGGKLCPNNLCCSQWGYC-GLGSE-FCGEGCQ-------GGACSTDK 156

Query: 358 QC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 486
            C      + C NN    YC     ++G  GI GP      C SG C
Sbjct: 157 PCGKAAGGKVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 195
>sp|P10040|CRB_DROME Crumbs protein precursor (95F)
          Length = 2146

 Score = 40.8 bits (94), Expect = 0.003
 Identities = 46/179 (25%), Positives = 65/179 (36%), Gaps = 17/179 (9%)
 Frame = +1

Query: 1   GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASN-------FC 159
           G G C ++ + Y   C       K C KD+   G      PC+N   C  N       FC
Sbjct: 276 GHGTCSSSPEGYECRC-TARYSGKNCQKDN---GSPCAKNPCENGGSCLENSRGDYQCFC 331

Query: 160 HEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGG 339
              +  ++C  E       +    LC+ +    +G CV +G  G    E  K      G 
Sbjct: 332 DPNHSGQHCETE-------VNIHPLCQTNPCLNNGACVVIGGSGALTCECPKGY---AGA 381

Query: 340 PCVRDW-QCYSRYCFNNG----KLRYCAKDGKQFGEVGI-----SGPCRKNKDCSSGRC 486
            C  D  +C S+ C NNG    ++   + D    G  G         C KN   + GRC
Sbjct: 382 RCEVDTDECASQPCQNNGSCIDRINGFSCDCSGTGYTGAFCQTNVDECDKNPCLNGGRC 440
>sp|Q69Z28|ATS16_MOUSE ADAMTS-16 precursor (A disintegrin and metalloproteinase with
           thrombospondin motifs 16) (ADAM-TS 16) (ADAM-TS16)
          Length = 1222

 Score = 38.5 bits (88), Expect = 0.015
 Identities = 35/126 (27%), Positives = 45/126 (35%), Gaps = 5/126 (3%)
 Frame = +1

Query: 124 CKNDF---FCASNFCHEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGF 294
           C  DF    C + +CH I       ++ + K     EG LC     C  G CVK G+EG 
Sbjct: 527 CMLDFRKDICKALWCHRIG------RKCETKFMPAAEGTLCGQDMWCRGGQCVKYGDEG- 579

Query: 295 CAKESMKSEKVGGGGPCVRDWQCYSRYCFN--NGKLRYCAKDGKQFGEVGISGPCRKNKD 468
                   +   G       W   SR C    + + R C       G     G  R  K 
Sbjct: 580 -------PKPTHGHWSDWSPWSPCSRTCGGGISHRDRLCTNPRPSHGGKFCQGSTRTLKL 632

Query: 469 CSSGRC 486
           C+S RC
Sbjct: 633 CNSQRC 638
>sp|Q8TE57|ATS16_HUMAN ADAMTS-16 precursor (A disintegrin and metalloproteinase with
           thrombospondin motifs 16) (ADAM-TS 16) (ADAM-TS16)
          Length = 1224

 Score = 38.1 bits (87), Expect = 0.019
 Identities = 33/126 (26%), Positives = 46/126 (36%), Gaps = 5/126 (3%)
 Frame = +1

Query: 124 CKNDF---FCASNFCHEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGF 294
           C  DF    C + +CH I       ++ + K     EG +C +   C  G CVK G+EG 
Sbjct: 529 CMLDFKKDICKALWCHRIG------RKCETKFMPAAEGTICGHDMWCRGGQCVKYGDEG- 581

Query: 295 CAKESMKSEKVGGGGPCVRDWQCYSRYCFN--NGKLRYCAKDGKQFGEVGISGPCRKNKD 468
                   +   G       W   SR C    + + R C       G     G  R  K 
Sbjct: 582 -------PKPTHGHWSDWSSWSPCSRTCGGGVSHRSRLCTNPKPSHGGKFCEGSTRTLKL 634

Query: 469 CSSGRC 486
           C+S +C
Sbjct: 635 CNSQKC 640
>sp|P22105|TENX_HUMAN Tenascin-X precursor (TN-X) (Hexabrachion-like protein)
          Length = 4289

 Score = 37.4 bits (85), Expect = 0.033
 Identities = 48/206 (23%), Positives = 71/206 (34%), Gaps = 26/206 (12%)
 Frame = +1

Query: 7   GICKNNGQCYSRYCY------KVDAETKICSKDSKFKGKVGGGGPCKNDF-FCASNFCHE 165
           G C   G+C    C         D  ++ C +D +      G G C+N    C + +  E
Sbjct: 406 GDCNQRGRCEDGRCVCWPGYTGTDCGSRACPRDCR------GRGRCENGVCVCNAGYSGE 459

Query: 166 INGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG--- 336
             G++ C  + +      G G        C+ GY    G +  C   +   +  G G   
Sbjct: 460 DCGVRSCPGDCR------GRGRCESGRCMCWPGY---TGRD--CGTRACPGDCRGRGRCV 508

Query: 337 -GPCVRD-----WQCYSRYCFNNGKLRYCAKDGKQFGEVGISGP----------CRKNKD 468
            G CV +       C SR C  + +     +DG    + G SG           CR    
Sbjct: 509 DGRCVCNPGFTGEDCGSRRCPGDCRGHGLCEDGVCVCDAGYSGEDCSTRSCPGGCRGRGQ 568

Query: 469 CSSGRCVSMKTKLQDGTESKVKVCQN 546
           C  GRCV        G +  V+ C N
Sbjct: 569 CLDGRCVCEDG--YSGEDCGVRQCPN 592

 Score = 36.2 bits (82), Expect = 0.074
 Identities = 39/183 (21%), Positives = 63/183 (34%), Gaps = 24/183 (13%)
 Frame = +1

Query: 13  CKNNGQCYSRYCY------KVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEING 174
           C   G+C    C         D  T+ C +D + +G+   G     +  C + +  +  G
Sbjct: 346 CGEGGRCVDGRCVCWPGYTGEDCSTRTCPRDCRGRGRCEDG-----ECICDTGYSGDDCG 400

Query: 175 IKYCVQESKQK-----------IGYIGEGAL-------CKNHHSCYSGYCVKVGNEGFCA 300
           ++ C  +  Q+            GY G           C+    C +G CV   N G+  
Sbjct: 401 VRSCPGDCNQRGRCEDGRCVCWPGYTGTDCGSRACPRDCRGRGRCENGVCVC--NAGYSG 458

Query: 301 KESMKSEKVGGGGPCVRDWQCYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSG 480
           ++        G   C  D +   R C +   + +    G+  G     G CR    C  G
Sbjct: 459 EDC-------GVRSCPGDCRGRGR-CESGRCMCWPGYTGRDCGTRACPGDCRGRGRCVDG 510

Query: 481 RCV 489
           RCV
Sbjct: 511 RCV 513
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 77,537,041
Number of Sequences: 369166
Number of extensions: 1805006
Number of successful extensions: 5464
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4925
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5413
length of database: 68,354,980
effective HSP length: 106
effective length of database: 48,773,070
effective search space used: 5218718490
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)