Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_018_E02
(642 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|P11219|AGI_ORYSA Lectin precursor (Agglutinin) [Contains... 50 6e-06
sp|P10968|AGI1_WHEAT Agglutinin isolectin 1 precursor (WGA1... 48 2e-05
sp|P02876|AGI2_WHEAT Agglutinin isolectin 2 precursor (WGA2... 44 4e-04
sp|P10969|AGI3_WHEAT Agglutinin isolectin 3 precursor (WGA3) 44 4e-04
sp|P10039|TENA_CHICK Tenascin precursor (TN) (Hexabrachion)... 43 8e-04
sp|P15312|AGI_HORVU Root-specific lectin precursor 43 8e-04
sp|P10040|CRB_DROME Crumbs protein precursor (95F) 41 0.003
sp|Q69Z28|ATS16_MOUSE ADAMTS-16 precursor (A disintegrin an... 39 0.015
sp|Q8TE57|ATS16_HUMAN ADAMTS-16 precursor (A disintegrin an... 38 0.019
sp|P22105|TENX_HUMAN Tenascin-X precursor (TN-X) (Hexabrach... 37 0.033
>sp|P11219|AGI_ORYSA Lectin precursor (Agglutinin) [Contains: Lectin 10 kDa peptide;
Lectin 8 kDa peptide]
Length = 227
Score = 49.7 bits (117), Expect = 6e-06
Identities = 44/156 (28%), Positives = 57/156 (36%)
Frame = +1
Query: 1 GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 180
GG C NN QC S+Y Y C S++ G GPC+ D C N E+
Sbjct: 79 GGATCSNN-QCCSQYGY--------CGFGSEYCGSGCQNGPCRADIKCGRNANGELCPNN 129
Query: 181 YCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGGPCVRDWQ 360
C + GY G G S +C G C E ++ GG +
Sbjct: 130 MCCSQ----WGYCGLG----------SEFCGNGCQSGACCPEKRCGKQAGGD-------K 168
Query: 361 CYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKD 468
C + +C + G YC G G SG C K D
Sbjct: 169 CPNNFCCSAG--GYCGLGGNYCGSGCQSGGCYKGGD 202
Score = 39.7 bits (91), Expect = 0.007
Identities = 45/155 (29%), Positives = 58/155 (37%), Gaps = 18/155 (11%)
Frame = +1
Query: 61 AETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIKYCVQE-------SKQKIGYI 219
A + + +++ GK G C ++ C S F + G YC S Q+ G
Sbjct: 19 AAAAVAATNAQTCGKQNDGMICPHNL-CCSQFGYCGLGRDYCGTGCQSGACCSSQRCGSQ 77
Query: 220 GEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGG---GPCVRDWQC----YSRY 375
G GA C N+ C GYC GF SE G G GPC D +C
Sbjct: 78 GGGATCSNNQCCSQYGYC------GF------GSEYCGSGCQNGPCRADIKCGRNANGEL 125
Query: 376 CFNN---GKLRYCAKDGKQFGEVGISGPCRKNKDC 471
C NN + YC + G SG C K C
Sbjct: 126 CPNNMCCSQWGYCGLGSEFCGNGCQSGACCPEKRC 160
>sp|P10968|AGI1_WHEAT Agglutinin isolectin 1 precursor (WGA1) (Isolectin A)
Length = 212
Score = 47.8 bits (112), Expect = 2e-05
Identities = 51/169 (30%), Positives = 60/169 (35%), Gaps = 7/169 (4%)
Frame = +1
Query: 1 GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 180
GG C NN QC S+Y Y C +++ G GGPC+ D C S
Sbjct: 77 GGATCTNN-QCCSQYGY--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 118
Query: 181 YCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG---GPCVR 351
G LC N+ C GFC + SE GGG G C
Sbjct: 119 -------------AGGKLCPNNLCC--------SQWGFC---GLGSEFCGGGCQSGACST 154
Query: 352 DWQC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 486
D C R C NN YC ++G GI GP C SG C
Sbjct: 155 DKPCGKDAGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 195
>sp|P02876|AGI2_WHEAT Agglutinin isolectin 2 precursor (WGA2) (Isolectin D)
Length = 213
Score = 43.9 bits (102), Expect = 4e-04
Identities = 49/169 (28%), Positives = 59/169 (34%), Gaps = 7/169 (4%)
Frame = +1
Query: 1 GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 180
GG C NN C S+Y + C +++ G GGPC+ D C S
Sbjct: 78 GGATCPNN-HCCSQYGH--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 119
Query: 181 YCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG---GPCVR 351
G LC N+ C GFC + SE GGG G C
Sbjct: 120 -------------SGGKLCPNNLCC--------SQWGFC---GLGSEFCGGGCQSGACST 155
Query: 352 DWQC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 486
D C R C NN YC ++G GI GP C SG C
Sbjct: 156 DKPCGKDAGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 196
Score = 36.2 bits (82), Expect = 0.074
Identities = 38/139 (27%), Positives = 56/139 (40%), Gaps = 15/139 (10%)
Frame = +1
Query: 100 GKVGGGGPCKNDFFCASNFCHEINGIKYCVQE-------SKQKIGYIGEGALCKNHHSCY 258
G+ G C N+ C S + + G YC + + ++ G GA C N+H C
Sbjct: 31 GEQGSNMECPNNL-CCSQYGYCGMGGDYCGKGCQNGACWTSKRCGSQAGGATCPNNHCCS 89
Query: 259 S-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDWQCYS----RYCFNN---GKLRYCAKD 414
G+C GF A+ + GGPC D +C S + C NN + +C
Sbjct: 90 QYGHC------GFGAEYCGAGCQ---GGPCRADIKCGSQSGGKLCPNNLCCSQWGFCGLG 140
Query: 415 GKQFGEVGISGPCRKNKDC 471
+ G SG C +K C
Sbjct: 141 SEFCGGGCQSGACSTDKPC 159
>sp|P10969|AGI3_WHEAT Agglutinin isolectin 3 precursor (WGA3)
Length = 186
Score = 43.9 bits (102), Expect = 4e-04
Identities = 48/163 (29%), Positives = 62/163 (38%), Gaps = 1/163 (0%)
Frame = +1
Query: 1 GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 180
GG C NN C S+Y + C +++ G GGPC+ D C S
Sbjct: 51 GGKTCPNN-HCCSQYGH--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 92
Query: 181 YCVQESKQKIGYIGEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDW 357
G LC N+ C GYC +G+E FC E ++ PC +D
Sbjct: 93 -------------AGGKLCPNNLCCSQWGYC-GLGSE-FCG-EGCQNGACSTDKPCGKD- 135
Query: 358 QCYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 486
R C NN YC ++G GI GP C SG C
Sbjct: 136 -AGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 169
>sp|P10039|TENA_CHICK Tenascin precursor (TN) (Hexabrachion) (Cytotactin) (Neuronectin)
(GMEM) (JI) (Miotendinous antigen)
(Glioma-associated-extracellular matrix antigen) (GP
150-225)
Length = 1808
Score = 42.7 bits (99), Expect = 8e-04
Identities = 49/174 (28%), Positives = 69/174 (39%), Gaps = 15/174 (8%)
Frame = +1
Query: 13 CKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHE----INGIK 180
C N G+C + C +C D F G+ G C ND CH +NG
Sbjct: 413 CHNRGRCINGQC--------VC--DEGFIGEDCGELRCPND-------CHNRGRCVNGQC 455
Query: 181 YCVQESKQKIGYIGE--GAL-----CKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGG 339
C + G+IGE G L C +H C +G CV +EG+ ++ G
Sbjct: 456 ECHE------GFIGEDCGELRCPNDCNSHGRCVNGQCVC--DEGYTGEDC-------GEL 500
Query: 340 PCVRDWQCYSRYCFNNGKLRYCAKD----GKQFGEVGISGPCRKNKDCSSGRCV 489
C D C++R G+ C D G+ GE+ C ++ C GRCV
Sbjct: 501 RCPND--CHNRGRCVEGR---CVCDNGFMGEDCGELSCPNDCHQHGRCVDGRCV 549
Score = 40.8 bits (94), Expect = 0.003
Identities = 44/167 (26%), Positives = 65/167 (38%), Gaps = 8/167 (4%)
Frame = +1
Query: 13 CKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIKYCVQ 192
C N G+C C +C D + G+ G C ND F ING +C +
Sbjct: 289 CHNRGRCVDNEC--------VC--DEGYTGEDCGELICPNDCFDRGRC---INGTCFCEE 335
Query: 193 ESKQKIGYIGE--GAL-----CKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGGPCVR 351
GY GE G L C + C +G CV +EGF + + C +
Sbjct: 336 ------GYTGEDCGELTCPNNCNGNGRCENGLCVC--HEGFVGDDCSQKR-------CPK 380
Query: 352 DWQCYSR-YCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRCV 489
D C +R +C + + + G+ GE+ C C +G+CV
Sbjct: 381 D--CNNRGHCVDGRCVCHEGYLGEDCGELRCPNDCHNRGRCINGQCV 425
Score = 30.4 bits (67), Expect = 4.0
Identities = 52/205 (25%), Positives = 71/205 (34%), Gaps = 27/205 (13%)
Frame = +1
Query: 13 CKNNGQCYSRYCY------KVDAETKICSKDSKFKGK-VGGGGPCKNDFF---CASNFCH 162
C N G C C D C D +GK V G C + C C
Sbjct: 196 CLNRGLCVRGKCICEEGFTGEDCSQAACPSDCNDQGKCVDGVCVCFEGYTGPDCGEELCP 255
Query: 163 EINGIK------YCVQESKQKIGYIGEGA---LCKNHHSCYS-GYCVK---VGNEGFCAK 303
GI CV G+ GE LC N+ C++ G CV V +EG+ +
Sbjct: 256 HGCGIHGRCVGGRCVCHE----GFTGEDCNEPLCPNN--CHNRGRCVDNECVCDEGYTGE 309
Query: 304 ESMKSEKVGGGGPCVRDWQCYSRYCFNNGKLRYCAKD--GKQFGEVGISGPCRKNKDCSS 477
+ G + C+ R NG +C + G+ GE+ C N C +
Sbjct: 310 DC---------GELICPNDCFDRGRCINGTC-FCEEGYTGEDCGELTCPNNCNGNGRCEN 359
Query: 478 GRCVSMKTKLQDGTESK--VKVCQN 546
G CV + + D K K C N
Sbjct: 360 GLCVCHEGFVGDDCSQKRCPKDCNN 384
>sp|P15312|AGI_HORVU Root-specific lectin precursor
Length = 212
Score = 42.7 bits (99), Expect = 8e-04
Identities = 48/167 (28%), Positives = 61/167 (36%), Gaps = 5/167 (2%)
Frame = +1
Query: 1 GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 180
GG C NN C S++ Y C +++ G GGPC+ D C S
Sbjct: 77 GGKTCPNN-HCCSQWGY--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 118
Query: 181 YCVQESKQKIGYIGEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDW 357
G LC N+ C GYC +G+E FC + GG C D
Sbjct: 119 -------------AGGKLCPNNLCCSQWGYC-GLGSE-FCGEGCQ-------GGACSTDK 156
Query: 358 QC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 486
C + C NN YC ++G GI GP C SG C
Sbjct: 157 PCGKAAGGKVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 195
>sp|P10040|CRB_DROME Crumbs protein precursor (95F)
Length = 2146
Score = 40.8 bits (94), Expect = 0.003
Identities = 46/179 (25%), Positives = 65/179 (36%), Gaps = 17/179 (9%)
Frame = +1
Query: 1 GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASN-------FC 159
G G C ++ + Y C K C KD+ G PC+N C N FC
Sbjct: 276 GHGTCSSSPEGYECRC-TARYSGKNCQKDN---GSPCAKNPCENGGSCLENSRGDYQCFC 331
Query: 160 HEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGG 339
+ ++C E + LC+ + +G CV +G G E K G
Sbjct: 332 DPNHSGQHCETE-------VNIHPLCQTNPCLNNGACVVIGGSGALTCECPKGY---AGA 381
Query: 340 PCVRDW-QCYSRYCFNNG----KLRYCAKDGKQFGEVGI-----SGPCRKNKDCSSGRC 486
C D +C S+ C NNG ++ + D G G C KN + GRC
Sbjct: 382 RCEVDTDECASQPCQNNGSCIDRINGFSCDCSGTGYTGAFCQTNVDECDKNPCLNGGRC 440
>sp|Q69Z28|ATS16_MOUSE ADAMTS-16 precursor (A disintegrin and metalloproteinase with
thrombospondin motifs 16) (ADAM-TS 16) (ADAM-TS16)
Length = 1222
Score = 38.5 bits (88), Expect = 0.015
Identities = 35/126 (27%), Positives = 45/126 (35%), Gaps = 5/126 (3%)
Frame = +1
Query: 124 CKNDF---FCASNFCHEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGF 294
C DF C + +CH I ++ + K EG LC C G CVK G+EG
Sbjct: 527 CMLDFRKDICKALWCHRIG------RKCETKFMPAAEGTLCGQDMWCRGGQCVKYGDEG- 579
Query: 295 CAKESMKSEKVGGGGPCVRDWQCYSRYCFN--NGKLRYCAKDGKQFGEVGISGPCRKNKD 468
+ G W SR C + + R C G G R K
Sbjct: 580 -------PKPTHGHWSDWSPWSPCSRTCGGGISHRDRLCTNPRPSHGGKFCQGSTRTLKL 632
Query: 469 CSSGRC 486
C+S RC
Sbjct: 633 CNSQRC 638
>sp|Q8TE57|ATS16_HUMAN ADAMTS-16 precursor (A disintegrin and metalloproteinase with
thrombospondin motifs 16) (ADAM-TS 16) (ADAM-TS16)
Length = 1224
Score = 38.1 bits (87), Expect = 0.019
Identities = 33/126 (26%), Positives = 46/126 (36%), Gaps = 5/126 (3%)
Frame = +1
Query: 124 CKNDF---FCASNFCHEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGF 294
C DF C + +CH I ++ + K EG +C + C G CVK G+EG
Sbjct: 529 CMLDFKKDICKALWCHRIG------RKCETKFMPAAEGTICGHDMWCRGGQCVKYGDEG- 581
Query: 295 CAKESMKSEKVGGGGPCVRDWQCYSRYCFN--NGKLRYCAKDGKQFGEVGISGPCRKNKD 468
+ G W SR C + + R C G G R K
Sbjct: 582 -------PKPTHGHWSDWSSWSPCSRTCGGGVSHRSRLCTNPKPSHGGKFCEGSTRTLKL 634
Query: 469 CSSGRC 486
C+S +C
Sbjct: 635 CNSQKC 640
>sp|P22105|TENX_HUMAN Tenascin-X precursor (TN-X) (Hexabrachion-like protein)
Length = 4289
Score = 37.4 bits (85), Expect = 0.033
Identities = 48/206 (23%), Positives = 71/206 (34%), Gaps = 26/206 (12%)
Frame = +1
Query: 7 GICKNNGQCYSRYCY------KVDAETKICSKDSKFKGKVGGGGPCKNDF-FCASNFCHE 165
G C G+C C D ++ C +D + G G C+N C + + E
Sbjct: 406 GDCNQRGRCEDGRCVCWPGYTGTDCGSRACPRDCR------GRGRCENGVCVCNAGYSGE 459
Query: 166 INGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG--- 336
G++ C + + G G C+ GY G + C + + G G
Sbjct: 460 DCGVRSCPGDCR------GRGRCESGRCMCWPGY---TGRD--CGTRACPGDCRGRGRCV 508
Query: 337 -GPCVRD-----WQCYSRYCFNNGKLRYCAKDGKQFGEVGISGP----------CRKNKD 468
G CV + C SR C + + +DG + G SG CR
Sbjct: 509 DGRCVCNPGFTGEDCGSRRCPGDCRGHGLCEDGVCVCDAGYSGEDCSTRSCPGGCRGRGQ 568
Query: 469 CSSGRCVSMKTKLQDGTESKVKVCQN 546
C GRCV G + V+ C N
Sbjct: 569 CLDGRCVCEDG--YSGEDCGVRQCPN 592
Score = 36.2 bits (82), Expect = 0.074
Identities = 39/183 (21%), Positives = 63/183 (34%), Gaps = 24/183 (13%)
Frame = +1
Query: 13 CKNNGQCYSRYCY------KVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEING 174
C G+C C D T+ C +D + +G+ G + C + + + G
Sbjct: 346 CGEGGRCVDGRCVCWPGYTGEDCSTRTCPRDCRGRGRCEDG-----ECICDTGYSGDDCG 400
Query: 175 IKYCVQESKQK-----------IGYIGEGAL-------CKNHHSCYSGYCVKVGNEGFCA 300
++ C + Q+ GY G C+ C +G CV N G+
Sbjct: 401 VRSCPGDCNQRGRCEDGRCVCWPGYTGTDCGSRACPRDCRGRGRCENGVCVC--NAGYSG 458
Query: 301 KESMKSEKVGGGGPCVRDWQCYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSG 480
++ G C D + R C + + + G+ G G CR C G
Sbjct: 459 EDC-------GVRSCPGDCRGRGR-CESGRCMCWPGYTGRDCGTRACPGDCRGRGRCVDG 510
Query: 481 RCV 489
RCV
Sbjct: 511 RCV 513
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 77,537,041
Number of Sequences: 369166
Number of extensions: 1805006
Number of successful extensions: 5464
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4925
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5413
length of database: 68,354,980
effective HSP length: 106
effective length of database: 48,773,070
effective search space used: 5218718490
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)