Planarian EST Database


Dr_sW_010_H21

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_010_H21
         (771 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|O75095|EGFL3_HUMAN  Multiple EGF-like-domain protein 3 pr...    38   0.027
sp|O88281|EGFL3_RAT  Multiple EGF-like-domain protein 3 prec...    35   0.17 
sp|Q3LI77|KR134_HUMAN  Keratin-associated protein 13-4             33   0.86 
sp|Q8VHS2|CRUM1_MOUSE  Crumbs protein homolog 1 precursor          33   0.86 
sp|Q80V70|EGFL3_MOUSE  Multiple EGF-like-domain protein 3          31   3.3  
sp|O18735|ERBB2_CANFA  Receptor tyrosine-protein kinase erbB...    31   4.3  
sp|Q60553|ERBB2_MESAU  Receptor tyrosine-protein kinase erbB...    31   4.3  
sp|P34853|NU4M_APILI  NADH-ubiquinone oxidoreductase chain 4...    30   7.3  
sp|P43509|CPR5_CAEEL  Cathepsin B-like cysteine proteinase 5...    30   7.3  
sp|P59222|SREC2_MOUSE  Scavenger receptor class F member 2 p...    30   9.5  
>sp|O75095|EGFL3_HUMAN Multiple EGF-like-domain protein 3 precursor (Multiple epidermal
           growth factor-like domains 6)
          Length = 1229

 Score = 38.1 bits (87), Expect = 0.027
 Identities = 25/84 (29%), Positives = 35/84 (41%), Gaps = 1/84 (1%)
 Frame = +3

Query: 48  GGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS 224
           G +CE       +G G  ++C  PA    ++ +P TG+  C  GF          RC   
Sbjct: 661 GEDCEADCPEGRWGLGCQEIC--PACQHAARCDPETGACLCLPGFVGS-------RCQDV 711

Query: 225 CSGWWFWRHCDTRCSYFTTYWCYP 296
           C   W+   C TRCS      C+P
Sbjct: 712 CPAGWYGPSCQTRCSCANDGHCHP 735

 Score = 32.3 bits (72), Expect = 1.5
 Identities = 25/82 (30%), Positives = 32/82 (39%), Gaps = 7/82 (8%)
 Frame = +3

Query: 141  NPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFTTYWCYPNAVGENHN 320
            +P TG   CP G++  K       C + C   WF   C  RCS      C P A   +  
Sbjct: 907  DPHTGRCLCPAGWTGDK-------CQSPCLRGWFGEACAQRCS------CPPGAACHHVT 953

Query: 321  -------GYQFGGIKQGNCPIG 365
                   G+   G +QG CP G
Sbjct: 954  GACRCPPGFTGSGCEQG-CPPG 974
>sp|O88281|EGFL3_RAT Multiple EGF-like-domain protein 3 precursor (Multiple epidermal
           growth factor-like domains 6)
          Length = 1574

 Score = 35.4 bits (80), Expect = 0.17
 Identities = 21/69 (30%), Positives = 28/69 (40%)
 Frame = +3

Query: 90  GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCS 269
           G  ++C  PA    +  NP TG+  C  GF          RC  +CS  W+   C  RC+
Sbjct: 785 GCQEIC--PACEHGASCNPETGTCLCLPGFVGS-------RCQDTCSAGWYGTGCQIRCA 835

Query: 270 YFTTYWCYP 296
                 C P
Sbjct: 836 CANDGHCDP 844

 Score = 31.6 bits (70), Expect = 2.5
 Identities = 24/83 (28%), Positives = 32/83 (38%), Gaps = 1/83 (1%)
 Frame = +3

Query: 48   GGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS 224
            G  C+   V+ TFG    + C    G S   V   TG+  CP G+           C+ +
Sbjct: 1116 GDKCQSSCVSGTFGVHCEEHCACRKGASCHHV---TGACFCPPGWRGP-------HCEQA 1165

Query: 225  CSGWWFWRHCDTRCSYFTTYWCY 293
            C   WF   C  RC   T   C+
Sbjct: 1166 CPRGWFGEACAQRCLCPTNASCH 1188

 Score = 31.2 bits (69), Expect = 3.3
 Identities = 26/79 (32%), Positives = 29/79 (36%), Gaps = 4/79 (5%)
 Frame = +3

Query: 141 NPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFTTYWCYPNAVGENHN 320
           NP  GS SC  GF          RC   C   +F   C  RC+      C P   GE   
Sbjct: 669 NPKDGSCSCKAGFQGE-------RCQAECESGFFGPGCRHRCTCQPGVACDP-VSGECRT 720

Query: 321 ----GYQFGGIKQGNCPIG 365
               GYQ     Q  CP+G
Sbjct: 721 QCPPGYQGEDCGQ-ECPVG 738

 Score = 30.8 bits (68), Expect = 4.3
 Identities = 32/109 (29%), Positives = 40/109 (36%), Gaps = 3/109 (2%)
 Frame = +3

Query: 48   GGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS 224
            G +CE       FG    Q C  P   S   V   TG   CP GF+        + C+ +
Sbjct: 1159 GPHCEQACPRGWFGEACAQRCLCPTNASCHHV---TGECRCPPGFTG-------LSCEQA 1208

Query: 225  CSGWWFWRHCDTRCSYFTTYW-CYP-NAVGENHNGYQFGGIKQGNCPIG 365
            C    F + C+  C      W C P + V     GY   G  Q  CP G
Sbjct: 1209 CQPGTFGKDCEHLCQCPGETWACDPASGVCTCAAGYHGTGCLQ-RCPSG 1256
>sp|Q3LI77|KR134_HUMAN Keratin-associated protein 13-4
          Length = 160

 Score = 33.1 bits (74), Expect = 0.86
 Identities = 27/96 (28%), Positives = 39/96 (40%)
 Frame = +3

Query: 99  QVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFT 278
           + C  PA    S   P T    CP              C T+CSG   +R    R   + 
Sbjct: 53  KTCWEPASCQKSCYRPRTSILCCP--------------CQTTCSGSLGFRSSSCRSQGYG 98

Query: 279 TYWCYPNAVGENHNGYQFGGIKQGNCPIGYISLKVG 386
           +  CY  ++G   +G++F  +K G C  G+ SL  G
Sbjct: 99  SRCCY--SLGNGSSGFRF--LKYGGC--GFPSLSYG 128
>sp|Q8VHS2|CRUM1_MOUSE Crumbs protein homolog 1 precursor
          Length = 1405

 Score = 33.1 bits (74), Expect = 0.86
 Identities = 46/189 (24%), Positives = 72/189 (38%), Gaps = 15/189 (7%)
 Frame = +3

Query: 102 VCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS-----CSGWWFWRHCDTRC 266
           +C  P  P YS +N  T +NSC     ++  H G  R D       C   +  R C+T  
Sbjct: 94  LCQCP--PGYSGLNCETATNSCG---GNLCQHGGTCRKDPEHPVCICPPGYAGRFCETDH 148

Query: 267 SYFTTYWCYPNAVGENH-NGYQFGGIKQGNCPIGYISLKVGLSVEICVTIDNDPNNPFAI 443
           +   +  C+  A+ ++  NGY         C  GY      L V+ CV+ D   N    +
Sbjct: 149 NECASSPCHNGAMCQDGINGYSC------FCVPGYQGRHCDLEVDECVS-DPCKNEAVCL 201

Query: 444 KFGGLFSC-------SVGNPLAKEFVKGKPKLSSSKMMDLV-YWQKTCAPGYI-SHIASI 596
              G ++C        V   L  +  + +P L  +   D    +   CAPG++  H    
Sbjct: 202 NEIGRYTCVCPQEFSGVNCELEIDECRSQPCLHGATCQDAPGGYSCDCAPGFLGEHCELS 261

Query: 597 EQGCQISYC 623
              C+   C
Sbjct: 262 VNECESQPC 270
>sp|Q80V70|EGFL3_MOUSE Multiple EGF-like-domain protein 3
          Length = 656

 Score = 31.2 bits (69), Expect = 3.3
 Identities = 28/108 (25%), Positives = 41/108 (37%), Gaps = 1/108 (0%)
 Frame = +3

Query: 48  GGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS 224
           G  C+ P V+  FG    + C    G +   V   TG+  CP G+           C+ +
Sbjct: 198 GDKCQSPCVSGMFGVHCEEHCACRKGATCHHV---TGACLCPPGWRGS-------HCEQA 247

Query: 225 CSGWWFWRHCDTRCSYFTTYWCYPNAVGENHNGYQFGGIKQGNCPIGY 368
           C   WF   C  RC       C P A   + +G       + +CP G+
Sbjct: 248 CPRGWFGEACAQRCH------CPPGASCHHVSG-------ECHCPPGF 282
>sp|O18735|ERBB2_CANFA Receptor tyrosine-protein kinase erbB-2 precursor (p185erbB2)
           (C-erbB-2)
          Length = 1259

 Score = 30.8 bits (68), Expect = 4.3
 Identities = 31/119 (26%), Positives = 42/119 (35%), Gaps = 1/119 (0%)
 Frame = +3

Query: 51  GNCEHPSVNFTFGGVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCS 230
           G+C+  +     GG  + C GP                C  G +  K HS  + C     
Sbjct: 210 GDCQSLTRTVCAGGCAR-CKGPQPTDCCH-------EQCAAGCTGPK-HSDCLACLHFNH 260

Query: 231 GWWFWRHCDTRCSYFT-TYWCYPNAVGENHNGYQFGGIKQGNCPIGYISLKVGLSVEIC 404
                 HC    +Y T T+   PN  G     Y FG     +CP  Y+S  VG    +C
Sbjct: 261 SGICELHCPALVTYNTDTFESMPNPEGR----YTFGASCVTSCPYNYLSTDVGSCTLVC 315
>sp|Q60553|ERBB2_MESAU Receptor tyrosine-protein kinase erbB-2 precursor (p185erbB2)
           (C-erbB-2) (NEU proto-oncogene)
          Length = 1254

 Score = 30.8 bits (68), Expect = 4.3
 Identities = 18/53 (33%), Positives = 23/53 (43%), Gaps = 1/53 (1%)
 Frame = +3

Query: 249 HCDTRCSYFT-TYWCYPNAVGENHNGYQFGGIKQGNCPIGYISLKVGLSVEIC 404
           HC    +Y T T+   PN  G     Y FG      CP  Y+S +VG    +C
Sbjct: 267 HCPALVTYNTDTFESMPNPEGR----YTFGASCVTTCPYNYLSTEVGSCTLVC 315
>sp|P34853|NU4M_APILI NADH-ubiquinone oxidoreductase chain 4 (NADH dehydrogenase subunit
           4)
          Length = 447

 Score = 30.0 bits (66), Expect = 7.3
 Identities = 13/34 (38%), Positives = 17/34 (50%)
 Frame = -2

Query: 446 FNCKWIVWIVVYCHTYFNTQSNL**NITNWAISL 345
           FN  WI WI ++C+  FN  S     +T W   L
Sbjct: 49  FNLNWIDWIYIFCNLSFNMYSYGLIMLTLWIFGL 82
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 precursor (Cysteine
           protease-related 5)
          Length = 344

 Score = 30.0 bits (66), Expect = 7.3
 Identities = 31/126 (24%), Positives = 42/126 (33%), Gaps = 12/126 (9%)
 Frame = +3

Query: 18  PNFNSDANV----DGGNCEHPSVNFTFGGVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSS 185
           PN  S  N+    D G+C      + F     + D     S   VN L  S         
Sbjct: 93  PNCMSINNIRDQSDCGSC------WAFAAAEAISDRTCIASNGAVNTLLSSEDL------ 140

Query: 186 VKLHSGLIRCDTSCSG--------WWFWRHCDTRCSYFTTYWCYPNAVGENHNGYQFGGI 341
           +   +G+  C   C G        WW      T  SY T + C P ++     G    G+
Sbjct: 141 LSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAP--CGETVNGV 198

Query: 342 KQGNCP 359
           K   CP
Sbjct: 199 KWPACP 204
>sp|P59222|SREC2_MOUSE Scavenger receptor class F member 2 precursor (Scavenger receptor
           expressed by endothelial cells 2 protein) (SREC-II)
          Length = 833

 Score = 29.6 bits (65), Expect = 9.5
 Identities = 20/74 (27%), Positives = 30/74 (40%)
 Frame = +3

Query: 102 VCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFTT 281
           VC+G +  S ++V    G   C  G+           CDT C   ++   C  RCS    
Sbjct: 71  VCEGNSTCSENEVCVRPGECRCRHGYFGAN-------CDTKCPRQFWGPDCKERCS---- 119

Query: 282 YWCYPNAVGENHNG 323
             C+P+   E+  G
Sbjct: 120 --CHPHGQCEDVTG 131
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 93,367,256
Number of Sequences: 369166
Number of extensions: 2104298
Number of successful extensions: 4756
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4549
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4750
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 7163732800
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)