Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_018_E01
(681 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|P11219|AGI_ORYSA Lectin precursor (Agglutinin) [Contains... 52 1e-06
sp|P10968|AGI1_WHEAT Agglutinin isolectin 1 precursor (WGA1... 48 3e-05
sp|P02876|AGI2_WHEAT Agglutinin isolectin 2 precursor (WGA2... 44 4e-04
sp|P10969|AGI3_WHEAT Agglutinin isolectin 3 precursor (WGA3) 44 4e-04
sp|P10039|TENA_CHICK Tenascin precursor (TN) (Hexabrachion)... 43 9e-04
sp|P15312|AGI_HORVU Root-specific lectin precursor 43 9e-04
sp|P10040|CRB_DROME Crumbs protein precursor (95F) 41 0.003
sp|Q69Z28|ATS16_MOUSE ADAMTS-16 precursor (A disintegrin an... 39 0.016
sp|Q8TE57|ATS16_HUMAN ADAMTS-16 precursor (A disintegrin an... 38 0.021
sp|P22105|TENX_HUMAN Tenascin-X precursor (TN-X) (Hexabrach... 37 0.037
>sp|P11219|AGI_ORYSA Lectin precursor (Agglutinin) [Contains: Lectin 10 kDa peptide;
Lectin 8 kDa peptide]
Length = 227
Score = 52.0 bits (123), Expect = 1e-06
Identities = 45/157 (28%), Positives = 58/157 (36%)
Frame = +3
Query: 33 GGGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGI 212
GGG C NN QC S+Y Y C S++ G GPC+ D C N E+
Sbjct: 78 GGGATCSNN-QCCSQYGY--------CGFGSEYCGSGCQNGPCRADIKCGRNANGELCPN 128
Query: 213 KYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGGPCVRDW 392
C + GY G G S +C G C E ++ GG
Sbjct: 129 NMCCSQ----WGYCGLG----------SEFCGNGCQSGACCPEKRCGKQAGGD------- 167
Query: 393 QCYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKD 503
+C + +C + G YC G G SG C K D
Sbjct: 168 KCPNNFCCSAG--GYCGLGGNYCGSGCQSGGCYKGGD 202
Score = 39.7 bits (91), Expect = 0.007
Identities = 45/155 (29%), Positives = 58/155 (37%), Gaps = 18/155 (11%)
Frame = +3
Query: 96 AETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIKYCVQE-------SKQKIGYI 254
A + + +++ GK G C ++ C S F + G YC S Q+ G
Sbjct: 19 AAAAVAATNAQTCGKQNDGMICPHNL-CCSQFGYCGLGRDYCGTGCQSGACCSSQRCGSQ 77
Query: 255 GEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGG---GPCVRDWQC----YSRY 410
G GA C N+ C GYC GF SE G G GPC D +C
Sbjct: 78 GGGATCSNNQCCSQYGYC------GF------GSEYCGSGCQNGPCRADIKCGRNANGEL 125
Query: 411 CFNN---GKLRYCAKDGKQFGEVGISGPCRKNKDC 506
C NN + YC + G SG C K C
Sbjct: 126 CPNNMCCSQWGYCGLGSEFCGNGCQSGACCPEKRC 160
>sp|P10968|AGI1_WHEAT Agglutinin isolectin 1 precursor (WGA1) (Isolectin A)
Length = 212
Score = 47.8 bits (112), Expect = 3e-05
Identities = 51/169 (30%), Positives = 60/169 (35%), Gaps = 7/169 (4%)
Frame = +3
Query: 36 GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 215
GG C NN QC S+Y Y C +++ G GGPC+ D C S
Sbjct: 77 GGATCTNN-QCCSQYGY--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 118
Query: 216 YCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG---GPCVR 386
G LC N+ C GFC + SE GGG G C
Sbjct: 119 -------------AGGKLCPNNLCC--------SQWGFC---GLGSEFCGGGCQSGACST 154
Query: 387 DWQC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 521
D C R C NN YC ++G GI GP C SG C
Sbjct: 155 DKPCGKDAGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 195
>sp|P02876|AGI2_WHEAT Agglutinin isolectin 2 precursor (WGA2) (Isolectin D)
Length = 213
Score = 43.9 bits (102), Expect = 4e-04
Identities = 49/169 (28%), Positives = 59/169 (34%), Gaps = 7/169 (4%)
Frame = +3
Query: 36 GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 215
GG C NN C S+Y + C +++ G GGPC+ D C S
Sbjct: 78 GGATCPNN-HCCSQYGH--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 119
Query: 216 YCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG---GPCVR 386
G LC N+ C GFC + SE GGG G C
Sbjct: 120 -------------SGGKLCPNNLCC--------SQWGFC---GLGSEFCGGGCQSGACST 155
Query: 387 DWQC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 521
D C R C NN YC ++G GI GP C SG C
Sbjct: 156 DKPCGKDAGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 196
Score = 36.2 bits (82), Expect = 0.082
Identities = 38/139 (27%), Positives = 56/139 (40%), Gaps = 15/139 (10%)
Frame = +3
Query: 135 GKVGGGGPCKNDFFCASNFCHEINGIKYCVQE-------SKQKIGYIGEGALCKNHHSCY 293
G+ G C N+ C S + + G YC + + ++ G GA C N+H C
Sbjct: 31 GEQGSNMECPNNL-CCSQYGYCGMGGDYCGKGCQNGACWTSKRCGSQAGGATCPNNHCCS 89
Query: 294 S-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDWQCYS----RYCFNN---GKLRYCAKD 449
G+C GF A+ + GGPC D +C S + C NN + +C
Sbjct: 90 QYGHC------GFGAEYCGAGCQ---GGPCRADIKCGSQSGGKLCPNNLCCSQWGFCGLG 140
Query: 450 GKQFGEVGISGPCRKNKDC 506
+ G SG C +K C
Sbjct: 141 SEFCGGGCQSGACSTDKPC 159
>sp|P10969|AGI3_WHEAT Agglutinin isolectin 3 precursor (WGA3)
Length = 186
Score = 43.9 bits (102), Expect = 4e-04
Identities = 48/163 (29%), Positives = 62/163 (38%), Gaps = 1/163 (0%)
Frame = +3
Query: 36 GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 215
GG C NN C S+Y + C +++ G GGPC+ D C S
Sbjct: 51 GGKTCPNN-HCCSQYGH--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 92
Query: 216 YCVQESKQKIGYIGEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDW 392
G LC N+ C GYC +G+E FC E ++ PC +D
Sbjct: 93 -------------AGGKLCPNNLCCSQWGYC-GLGSE-FCG-EGCQNGACSTDKPCGKD- 135
Query: 393 QCYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 521
R C NN YC ++G GI GP C SG C
Sbjct: 136 -AGGRVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 169
>sp|P10039|TENA_CHICK Tenascin precursor (TN) (Hexabrachion) (Cytotactin) (Neuronectin)
(GMEM) (JI) (Miotendinous antigen)
(Glioma-associated-extracellular matrix antigen) (GP
150-225)
Length = 1808
Score = 42.7 bits (99), Expect = 9e-04
Identities = 49/174 (28%), Positives = 69/174 (39%), Gaps = 15/174 (8%)
Frame = +3
Query: 48 CKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHE----INGIK 215
C N G+C + C +C D F G+ G C ND CH +NG
Sbjct: 413 CHNRGRCINGQC--------VC--DEGFIGEDCGELRCPND-------CHNRGRCVNGQC 455
Query: 216 YCVQESKQKIGYIGE--GAL-----CKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGG 374
C + G+IGE G L C +H C +G CV +EG+ ++ G
Sbjct: 456 ECHE------GFIGEDCGELRCPNDCNSHGRCVNGQCVC--DEGYTGEDC-------GEL 500
Query: 375 PCVRDWQCYSRYCFNNGKLRYCAKD----GKQFGEVGISGPCRKNKDCSSGRCV 524
C D C++R G+ C D G+ GE+ C ++ C GRCV
Sbjct: 501 RCPND--CHNRGRCVEGR---CVCDNGFMGEDCGELSCPNDCHQHGRCVDGRCV 549
Score = 40.8 bits (94), Expect = 0.003
Identities = 44/167 (26%), Positives = 65/167 (38%), Gaps = 8/167 (4%)
Frame = +3
Query: 48 CKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIKYCVQ 227
C N G+C C +C D + G+ G C ND F ING +C +
Sbjct: 289 CHNRGRCVDNEC--------VC--DEGYTGEDCGELICPNDCFDRGRC---INGTCFCEE 335
Query: 228 ESKQKIGYIGE--GAL-----CKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGGPCVR 386
GY GE G L C + C +G CV +EGF + + C +
Sbjct: 336 ------GYTGEDCGELTCPNNCNGNGRCENGLCVC--HEGFVGDDCSQKR-------CPK 380
Query: 387 DWQCYSR-YCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRCV 524
D C +R +C + + + G+ GE+ C C +G+CV
Sbjct: 381 D--CNNRGHCVDGRCVCHEGYLGEDCGELRCPNDCHNRGRCINGQCV 425
Score = 30.4 bits (67), Expect = 4.5
Identities = 52/205 (25%), Positives = 71/205 (34%), Gaps = 27/205 (13%)
Frame = +3
Query: 48 CKNNGQCYSRYCY------KVDAETKICSKDSKFKGK-VGGGGPCKNDFF---CASNFCH 197
C N G C C D C D +GK V G C + C C
Sbjct: 196 CLNRGLCVRGKCICEEGFTGEDCSQAACPSDCNDQGKCVDGVCVCFEGYTGPDCGEELCP 255
Query: 198 EINGIK------YCVQESKQKIGYIGEGA---LCKNHHSCYS-GYCVK---VGNEGFCAK 338
GI CV G+ GE LC N+ C++ G CV V +EG+ +
Sbjct: 256 HGCGIHGRCVGGRCVCHE----GFTGEDCNEPLCPNN--CHNRGRCVDNECVCDEGYTGE 309
Query: 339 ESMKSEKVGGGGPCVRDWQCYSRYCFNNGKLRYCAKD--GKQFGEVGISGPCRKNKDCSS 512
+ G + C+ R NG +C + G+ GE+ C N C +
Sbjct: 310 DC---------GELICPNDCFDRGRCINGTC-FCEEGYTGEDCGELTCPNNCNGNGRCEN 359
Query: 513 GRCVSMKTKLQDGTESK--VKVCQN 581
G CV + + D K K C N
Sbjct: 360 GLCVCHEGFVGDDCSQKRCPKDCNN 384
>sp|P15312|AGI_HORVU Root-specific lectin precursor
Length = 212
Score = 42.7 bits (99), Expect = 9e-04
Identities = 48/167 (28%), Positives = 61/167 (36%), Gaps = 5/167 (2%)
Frame = +3
Query: 36 GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEINGIK 215
GG C NN C S++ Y C +++ G GGPC+ D C S
Sbjct: 77 GGKTCPNN-HCCSQWGY--------CGFGAEYCGAGCQGGPCRADIKCGSQ--------- 118
Query: 216 YCVQESKQKIGYIGEGALCKNHHSCYS-GYCVKVGNEGFCAKESMKSEKVGGGGPCVRDW 392
G LC N+ C GYC +G+E FC + GG C D
Sbjct: 119 -------------AGGKLCPNNLCCSQWGYC-GLGSE-FCGEGCQ-------GGACSTDK 156
Query: 393 QC----YSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRC 521
C + C NN YC ++G GI GP C SG C
Sbjct: 157 PCGKAAGGKVCTNN----YCC---SKWGSCGI-GPGYCGAGCQSGGC 195
>sp|P10040|CRB_DROME Crumbs protein precursor (95F)
Length = 2146
Score = 40.8 bits (94), Expect = 0.003
Identities = 46/179 (25%), Positives = 65/179 (36%), Gaps = 17/179 (9%)
Frame = +3
Query: 36 GGGICKNNGQCYSRYCYKVDAETKICSKDSKFKGKVGGGGPCKNDFFCASN-------FC 194
G G C ++ + Y C K C KD+ G PC+N C N FC
Sbjct: 276 GHGTCSSSPEGYECRC-TARYSGKNCQKDN---GSPCAKNPCENGGSCLENSRGDYQCFC 331
Query: 195 HEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGG 374
+ ++C E + LC+ + +G CV +G G E K G
Sbjct: 332 DPNHSGQHCETE-------VNIHPLCQTNPCLNNGACVVIGGSGALTCECPKGY---AGA 381
Query: 375 PCVRDW-QCYSRYCFNNG----KLRYCAKDGKQFGEVGI-----SGPCRKNKDCSSGRC 521
C D +C S+ C NNG ++ + D G G C KN + GRC
Sbjct: 382 RCEVDTDECASQPCQNNGSCIDRINGFSCDCSGTGYTGAFCQTNVDECDKNPCLNGGRC 440
>sp|Q69Z28|ATS16_MOUSE ADAMTS-16 precursor (A disintegrin and metalloproteinase with
thrombospondin motifs 16) (ADAM-TS 16) (ADAM-TS16)
Length = 1222
Score = 38.5 bits (88), Expect = 0.016
Identities = 35/126 (27%), Positives = 45/126 (35%), Gaps = 5/126 (3%)
Frame = +3
Query: 159 CKNDF---FCASNFCHEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGF 329
C DF C + +CH I ++ + K EG LC C G CVK G+EG
Sbjct: 527 CMLDFRKDICKALWCHRIG------RKCETKFMPAAEGTLCGQDMWCRGGQCVKYGDEG- 579
Query: 330 CAKESMKSEKVGGGGPCVRDWQCYSRYCFN--NGKLRYCAKDGKQFGEVGISGPCRKNKD 503
+ G W SR C + + R C G G R K
Sbjct: 580 -------PKPTHGHWSDWSPWSPCSRTCGGGISHRDRLCTNPRPSHGGKFCQGSTRTLKL 632
Query: 504 CSSGRC 521
C+S RC
Sbjct: 633 CNSQRC 638
>sp|Q8TE57|ATS16_HUMAN ADAMTS-16 precursor (A disintegrin and metalloproteinase with
thrombospondin motifs 16) (ADAM-TS 16) (ADAM-TS16)
Length = 1224
Score = 38.1 bits (87), Expect = 0.021
Identities = 33/126 (26%), Positives = 46/126 (36%), Gaps = 5/126 (3%)
Frame = +3
Query: 159 CKNDF---FCASNFCHEINGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGF 329
C DF C + +CH I ++ + K EG +C + C G CVK G+EG
Sbjct: 529 CMLDFKKDICKALWCHRIG------RKCETKFMPAAEGTICGHDMWCRGGQCVKYGDEG- 581
Query: 330 CAKESMKSEKVGGGGPCVRDWQCYSRYCFN--NGKLRYCAKDGKQFGEVGISGPCRKNKD 503
+ G W SR C + + R C G G R K
Sbjct: 582 -------PKPTHGHWSDWSSWSPCSRTCGGGVSHRSRLCTNPKPSHGGKFCEGSTRTLKL 634
Query: 504 CSSGRC 521
C+S +C
Sbjct: 635 CNSQKC 640
>sp|P22105|TENX_HUMAN Tenascin-X precursor (TN-X) (Hexabrachion-like protein)
Length = 4289
Score = 37.4 bits (85), Expect = 0.037
Identities = 48/206 (23%), Positives = 71/206 (34%), Gaps = 26/206 (12%)
Frame = +3
Query: 42 GICKNNGQCYSRYCY------KVDAETKICSKDSKFKGKVGGGGPCKNDF-FCASNFCHE 200
G C G+C C D ++ C +D + G G C+N C + + E
Sbjct: 406 GDCNQRGRCEDGRCVCWPGYTGTDCGSRACPRDCR------GRGRCENGVCVCNAGYSGE 459
Query: 201 INGIKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGG--- 371
G++ C + + G G C+ GY G + C + + G G
Sbjct: 460 DCGVRSCPGDCR------GRGRCESGRCMCWPGY---TGRD--CGTRACPGDCRGRGRCV 508
Query: 372 -GPCVRD-----WQCYSRYCFNNGKLRYCAKDGKQFGEVGISGP----------CRKNKD 503
G CV + C SR C + + +DG + G SG CR
Sbjct: 509 DGRCVCNPGFTGEDCGSRRCPGDCRGHGLCEDGVCVCDAGYSGEDCSTRSCPGGCRGRGQ 568
Query: 504 CSSGRCVSMKTKLQDGTESKVKVCQN 581
C GRCV G + V+ C N
Sbjct: 569 CLDGRCVCEDG--YSGEDCGVRQCPN 592
Score = 36.6 bits (83), Expect = 0.063
Identities = 39/165 (23%), Positives = 59/165 (35%), Gaps = 1/165 (0%)
Frame = +3
Query: 33 GGGGICKNNGQCYSRYCYK-VDAETKICSKDSKFKGKVGGGGPCKNDFFCASNFCHEING 209
G GG C + G+C Y D T+ C +D + +G+ G + C + + + G
Sbjct: 347 GEGGRCVD-GRCVCWPGYTGEDCSTRTCPRDCRGRGRCEDG-----ECICDTGYSGDDCG 400
Query: 210 IKYCVQESKQKIGYIGEGALCKNHHSCYSGYCVKVGNEGFCAKESMKSEKVGGGGPCVRD 389
++ C + C C G CV G+ + G C RD
Sbjct: 401 VRSCPGD-------------CNQRGRCEDGRCVCW--PGYTGTDC-------GSRACPRD 438
Query: 390 WQCYSRYCFNNGKLRYCAKDGKQFGEVGISGPCRKNKDCSSGRCV 524
+ R C N + G+ G G CR C SGRC+
Sbjct: 439 CRGRGR-CENGVCVCNAGYSGEDCGVRSCPGDCRGRGRCESGRCM 482
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 81,852,694
Number of Sequences: 369166
Number of extensions: 1929922
Number of successful extensions: 5730
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 5164
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5677
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 5782011865
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)