Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_021_O18
(793 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|O75095|EGFL3_HUMAN Multiple EGF-like-domain protein 3 pr... 38 0.028
sp|O88281|EGFL3_RAT Multiple EGF-like-domain protein 3 prec... 35 0.18
sp|Q8VHS2|CRUM1_MOUSE Crumbs protein homolog 1 precursor 34 0.52
sp|Q3LI77|KR134_HUMAN Keratin-associated protein 13-4 33 0.89
sp|Q80V70|EGFL3_MOUSE Multiple EGF-like-domain protein 3 32 2.0
sp|O18735|ERBB2_CANFA Receptor tyrosine-protein kinase erbB... 31 4.4
sp|Q60553|ERBB2_MESAU Receptor tyrosine-protein kinase erbB... 31 4.4
sp|P34853|NU4M_APILI NADH-ubiquinone oxidoreductase chain 4... 30 7.5
sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5... 30 7.5
sp|P59222|SREC2_MOUSE Scavenger receptor class F member 2 p... 30 9.8
>sp|O75095|EGFL3_HUMAN Multiple EGF-like-domain protein 3 precursor (Multiple epidermal
growth factor-like domains 6)
Length = 1229
Score = 38.1 bits (87), Expect = 0.028
Identities = 25/84 (29%), Positives = 35/84 (41%), Gaps = 1/84 (1%)
Frame = +2
Query: 77 GGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS 253
G +CE +G G ++C PA ++ +P TG+ C GF RC
Sbjct: 661 GEDCEADCPEGRWGLGCQEIC--PACQHAARCDPETGACLCLPGFVGS-------RCQDV 711
Query: 254 CSGWWFWRHCDTRCSYFTTYWCYP 325
C W+ C TRCS C+P
Sbjct: 712 CPAGWYGPSCQTRCSCANDGHCHP 735
Score = 32.3 bits (72), Expect = 1.5
Identities = 25/82 (30%), Positives = 32/82 (39%), Gaps = 7/82 (8%)
Frame = +2
Query: 170 NPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFTTYWCYPNAVGENHN 349
+P TG CP G++ K C + C WF C RCS C P A +
Sbjct: 907 DPHTGRCLCPAGWTGDK-------CQSPCLRGWFGEACAQRCS------CPPGAACHHVT 953
Query: 350 -------GYQFGGIKQGNCPIG 394
G+ G +QG CP G
Sbjct: 954 GACRCPPGFTGSGCEQG-CPPG 974
>sp|O88281|EGFL3_RAT Multiple EGF-like-domain protein 3 precursor (Multiple epidermal
growth factor-like domains 6)
Length = 1574
Score = 35.4 bits (80), Expect = 0.18
Identities = 21/69 (30%), Positives = 28/69 (40%)
Frame = +2
Query: 119 GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCS 298
G ++C PA + NP TG+ C GF RC +CS W+ C RC+
Sbjct: 785 GCQEIC--PACEHGASCNPETGTCLCLPGFVGS-------RCQDTCSAGWYGTGCQIRCA 835
Query: 299 YFTTYWCYP 325
C P
Sbjct: 836 CANDGHCDP 844
Score = 31.6 bits (70), Expect = 2.6
Identities = 24/83 (28%), Positives = 32/83 (38%), Gaps = 1/83 (1%)
Frame = +2
Query: 77 GGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS 253
G C+ V+ TFG + C G S V TG+ CP G+ C+ +
Sbjct: 1116 GDKCQSSCVSGTFGVHCEEHCACRKGASCHHV---TGACFCPPGWRGP-------HCEQA 1165
Query: 254 CSGWWFWRHCDTRCSYFTTYWCY 322
C WF C RC T C+
Sbjct: 1166 CPRGWFGEACAQRCLCPTNASCH 1188
Score = 31.2 bits (69), Expect = 3.4
Identities = 26/79 (32%), Positives = 29/79 (36%), Gaps = 4/79 (5%)
Frame = +2
Query: 170 NPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFTTYWCYPNAVGENHN 349
NP GS SC GF RC C +F C RC+ C P GE
Sbjct: 669 NPKDGSCSCKAGFQGE-------RCQAECESGFFGPGCRHRCTCQPGVACDP-VSGECRT 720
Query: 350 ----GYQFGGIKQGNCPIG 394
GYQ Q CP+G
Sbjct: 721 QCPPGYQGEDCGQ-ECPVG 738
Score = 30.8 bits (68), Expect = 4.4
Identities = 32/109 (29%), Positives = 40/109 (36%), Gaps = 3/109 (2%)
Frame = +2
Query: 77 GGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS 253
G +CE FG Q C P S V TG CP GF+ + C+ +
Sbjct: 1159 GPHCEQACPRGWFGEACAQRCLCPTNASCHHV---TGECRCPPGFTG-------LSCEQA 1208
Query: 254 CSGWWFWRHCDTRCSYFTTYW-CYP-NAVGENHNGYQFGGIKQGNCPIG 394
C F + C+ C W C P + V GY G Q CP G
Sbjct: 1209 CQPGTFGKDCEHLCQCPGETWACDPASGVCTCAAGYHGTGCLQ-RCPSG 1256
>sp|Q8VHS2|CRUM1_MOUSE Crumbs protein homolog 1 precursor
Length = 1405
Score = 33.9 bits (76), Expect = 0.52
Identities = 55/226 (24%), Positives = 83/226 (36%), Gaps = 16/226 (7%)
Frame = +2
Query: 23 KGCLDFTSPNFNSDANVDGGNCEHPSV-NFTFGGVYQVCDGPAGPSYSQVNPLTGSNSCP 199
K C D P F+S + P NF +C P P YS +N T +NSC
Sbjct: 64 KDCEDLKDPCFSSPCQGIATCVKIPGEGNF-------LCQCP--PGYSGLNCETATNSCG 114
Query: 200 KGFSSVKLHSGLIRCDTS-----CSGWWFWRHCDTRCSYFTTYWCYPNAVGENH-NGYQF 361
++ H G R D C + R C+T + + C+ A+ ++ NGY
Sbjct: 115 ---GNLCQHGGTCRKDPEHPVCICPPGYAGRFCETDHNECASSPCHNGAMCQDGINGYSC 171
Query: 362 GGIKQGNCPIGYISLKVGLSVEICVTIDNDPNNPFAIKFGGLFSC-------SVGNPLAK 520
C GY L V+ CV+ D N + G ++C V L
Sbjct: 172 ------FCVPGYQGRHCDLEVDECVS-DPCKNEAVCLNEIGRYTCVCPQEFSGVNCELEI 224
Query: 521 EFVKGKPKLSSSKMMDLV-YWQKTCAPGYI-SHIASIEQGCQISYC 652
+ + +P L + D + CAPG++ H C+ C
Sbjct: 225 DECRSQPCLHGATCQDAPGGYSCDCAPGFLGEHCELSVNECESQPC 270
>sp|Q3LI77|KR134_HUMAN Keratin-associated protein 13-4
Length = 160
Score = 33.1 bits (74), Expect = 0.89
Identities = 27/96 (28%), Positives = 39/96 (40%)
Frame = +2
Query: 128 QVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFT 307
+ C PA S P T CP C T+CSG +R R +
Sbjct: 53 KTCWEPASCQKSCYRPRTSILCCP--------------CQTTCSGSLGFRSSSCRSQGYG 98
Query: 308 TYWCYPNAVGENHNGYQFGGIKQGNCPIGYISLKVG 415
+ CY ++G +G++F +K G C G+ SL G
Sbjct: 99 SRCCY--SLGNGSSGFRF--LKYGGC--GFPSLSYG 128
>sp|Q80V70|EGFL3_MOUSE Multiple EGF-like-domain protein 3
Length = 656
Score = 32.0 bits (71), Expect = 2.0
Identities = 31/125 (24%), Positives = 46/125 (36%), Gaps = 1/125 (0%)
Frame = +2
Query: 26 GCLDFTSPNFNSDANVDGGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPK 202
G D + + A G C+ P V+ FG + C G + V TG+ CP
Sbjct: 181 GTCDRLTGHCRCPAGWTGDKCQSPCVSGMFGVHCEEHCACRKGATCHHV---TGACLCPP 237
Query: 203 GFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFTTYWCYPNAVGENHNGYQFGGIKQGN 382
G+ C+ +C WF C RC C P A + +G + +
Sbjct: 238 GWRGS-------HCEQACPRGWFGEACAQRCH------CPPGASCHHVSG-------ECH 277
Query: 383 CPIGY 397
CP G+
Sbjct: 278 CPPGF 282
>sp|O18735|ERBB2_CANFA Receptor tyrosine-protein kinase erbB-2 precursor (p185erbB2)
(C-erbB-2)
Length = 1259
Score = 30.8 bits (68), Expect = 4.4
Identities = 31/119 (26%), Positives = 42/119 (35%), Gaps = 1/119 (0%)
Frame = +2
Query: 80 GNCEHPSVNFTFGGVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCS 259
G+C+ + GG + C GP C G + K HS + C
Sbjct: 210 GDCQSLTRTVCAGGCAR-CKGPQPTDCCH-------EQCAAGCTGPK-HSDCLACLHFNH 260
Query: 260 GWWFWRHCDTRCSYFT-TYWCYPNAVGENHNGYQFGGIKQGNCPIGYISLKVGLSVEIC 433
HC +Y T T+ PN G Y FG +CP Y+S VG +C
Sbjct: 261 SGICELHCPALVTYNTDTFESMPNPEGR----YTFGASCVTSCPYNYLSTDVGSCTLVC 315
>sp|Q60553|ERBB2_MESAU Receptor tyrosine-protein kinase erbB-2 precursor (p185erbB2)
(C-erbB-2) (NEU proto-oncogene)
Length = 1254
Score = 30.8 bits (68), Expect = 4.4
Identities = 18/53 (33%), Positives = 23/53 (43%), Gaps = 1/53 (1%)
Frame = +2
Query: 278 HCDTRCSYFT-TYWCYPNAVGENHNGYQFGGIKQGNCPIGYISLKVGLSVEIC 433
HC +Y T T+ PN G Y FG CP Y+S +VG +C
Sbjct: 267 HCPALVTYNTDTFESMPNPEGR----YTFGASCVTTCPYNYLSTEVGSCTLVC 315
>sp|P34853|NU4M_APILI NADH-ubiquinone oxidoreductase chain 4 (NADH dehydrogenase subunit
4)
Length = 447
Score = 30.0 bits (66), Expect = 7.5
Identities = 13/34 (38%), Positives = 17/34 (50%)
Frame = -1
Query: 475 FNCKWIVWIVVYCHTYFNTQSNL**NITNWAISL 374
FN WI WI ++C+ FN S +T W L
Sbjct: 49 FNLNWIDWIYIFCNLSFNMYSYGLIMLTLWIFGL 82
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 precursor (Cysteine
protease-related 5)
Length = 344
Score = 30.0 bits (66), Expect = 7.5
Identities = 31/126 (24%), Positives = 42/126 (33%), Gaps = 12/126 (9%)
Frame = +2
Query: 47 PNFNSDANV----DGGNCEHPSVNFTFGGVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSS 214
PN S N+ D G+C + F + D S VN L S
Sbjct: 93 PNCMSINNIRDQSDCGSC------WAFAAAEAISDRTCIASNGAVNTLLSSEDL------ 140
Query: 215 VKLHSGLIRCDTSCSG--------WWFWRHCDTRCSYFTTYWCYPNAVGENHNGYQFGGI 370
+ +G+ C C G WW T SY T + C P ++ G G+
Sbjct: 141 LSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAP--CGETVNGV 198
Query: 371 KQGNCP 388
K CP
Sbjct: 199 KWPACP 204
>sp|P59222|SREC2_MOUSE Scavenger receptor class F member 2 precursor (Scavenger receptor
expressed by endothelial cells 2 protein) (SREC-II)
Length = 833
Score = 29.6 bits (65), Expect = 9.8
Identities = 20/74 (27%), Positives = 30/74 (40%)
Frame = +2
Query: 131 VCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFTT 310
VC+G + S ++V G C G+ CDT C ++ C RCS
Sbjct: 71 VCEGNSTCSENEVCVRPGECRCRHGYFGAN-------CDTKCPRQFWGPDCKERCS---- 119
Query: 311 YWCYPNAVGENHNG 352
C+P+ E+ G
Sbjct: 120 --CHPHGQCEDVTG 131
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 96,537,708
Number of Sequences: 369166
Number of extensions: 2183571
Number of successful extensions: 4922
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4701
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4915
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 7425705210
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)