Planarian EST Database


Dr_sW_024_A24

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_024_A24
         (553 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|O75095|EGFL3_HUMAN  Multiple EGF-like-domain protein 3 pr...    38   0.014
sp|O88281|EGFL3_RAT  Multiple EGF-like-domain protein 3 prec...    35   0.094
sp|Q3LI77|KR134_HUMAN  Keratin-associated protein 13-4             33   0.46 
sp|Q80V70|EGFL3_MOUSE  Multiple EGF-like-domain protein 3          31   1.8  
sp|O18735|ERBB2_CANFA  Receptor tyrosine-protein kinase erbB...    31   2.3  
sp|Q60553|ERBB2_MESAU  Receptor tyrosine-protein kinase erbB...    31   2.3  
sp|Q8VHS2|CRUM1_MOUSE  Crumbs protein homolog 1 precursor          30   3.0  
sp|P34853|NU4M_APILI  NADH-ubiquinone oxidoreductase chain 4...    30   3.9  
sp|P06494|ERBB2_RAT  Receptor tyrosine-protein kinase erbB-2...    30   3.9  
sp|P59222|SREC2_MOUSE  Scavenger receptor class F member 2 p...    30   5.1  
>sp|O75095|EGFL3_HUMAN Multiple EGF-like-domain protein 3 precursor (Multiple epidermal
           growth factor-like domains 6)
          Length = 1229

 Score = 38.1 bits (87), Expect = 0.014
 Identities = 25/84 (29%), Positives = 35/84 (41%), Gaps = 1/84 (1%)
 Frame = +1

Query: 1   GGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS 177
           G +CE       +G G  ++C  PA    ++ +P TG+  C  GF          RC   
Sbjct: 661 GEDCEADCPEGRWGLGCQEIC--PACQHAARCDPETGACLCLPGFVGS-------RCQDV 711

Query: 178 CSGWWFWRHCDTRCSYFTTYWCYP 249
           C   W+   C TRCS      C+P
Sbjct: 712 CPAGWYGPSCQTRCSCANDGHCHP 735

 Score = 32.3 bits (72), Expect = 0.79
 Identities = 25/82 (30%), Positives = 32/82 (39%), Gaps = 7/82 (8%)
 Frame = +1

Query: 94   NPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFTTYWCYPNAVGENHN 273
            +P TG   CP G++  K       C + C   WF   C  RCS      C P A   +  
Sbjct: 907  DPHTGRCLCPAGWTGDK-------CQSPCLRGWFGEACAQRCS------CPPGAACHHVT 953

Query: 274  -------GYQFGGIKQGNCPIG 318
                   G+   G +QG CP G
Sbjct: 954  GACRCPPGFTGSGCEQG-CPPG 974
>sp|O88281|EGFL3_RAT Multiple EGF-like-domain protein 3 precursor (Multiple epidermal
           growth factor-like domains 6)
          Length = 1574

 Score = 35.4 bits (80), Expect = 0.094
 Identities = 21/69 (30%), Positives = 28/69 (40%)
 Frame = +1

Query: 43  GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCS 222
           G  ++C  PA    +  NP TG+  C  GF          RC  +CS  W+   C  RC+
Sbjct: 785 GCQEIC--PACEHGASCNPETGTCLCLPGFVGS-------RCQDTCSAGWYGTGCQIRCA 835

Query: 223 YFTTYWCYP 249
                 C P
Sbjct: 836 CANDGHCDP 844

 Score = 31.6 bits (70), Expect = 1.4
 Identities = 24/83 (28%), Positives = 32/83 (38%), Gaps = 1/83 (1%)
 Frame = +1

Query: 1    GGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS 177
            G  C+   V+ TFG    + C    G S   V   TG+  CP G+           C+ +
Sbjct: 1116 GDKCQSSCVSGTFGVHCEEHCACRKGASCHHV---TGACFCPPGWRGP-------HCEQA 1165

Query: 178  CSGWWFWRHCDTRCSYFTTYWCY 246
            C   WF   C  RC   T   C+
Sbjct: 1166 CPRGWFGEACAQRCLCPTNASCH 1188

 Score = 31.2 bits (69), Expect = 1.8
 Identities = 26/79 (32%), Positives = 29/79 (36%), Gaps = 4/79 (5%)
 Frame = +1

Query: 94  NPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFTTYWCYPNAVGENHN 273
           NP  GS SC  GF          RC   C   +F   C  RC+      C P   GE   
Sbjct: 669 NPKDGSCSCKAGFQGE-------RCQAECESGFFGPGCRHRCTCQPGVACDP-VSGECRT 720

Query: 274 ----GYQFGGIKQGNCPIG 318
               GYQ     Q  CP+G
Sbjct: 721 QCPPGYQGEDCGQ-ECPVG 738

 Score = 30.8 bits (68), Expect = 2.3
 Identities = 32/109 (29%), Positives = 40/109 (36%), Gaps = 3/109 (2%)
 Frame = +1

Query: 1    GGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS 177
            G +CE       FG    Q C  P   S   V   TG   CP GF+        + C+ +
Sbjct: 1159 GPHCEQACPRGWFGEACAQRCLCPTNASCHHV---TGECRCPPGFTG-------LSCEQA 1208

Query: 178  CSGWWFWRHCDTRCSYFTTYW-CYP-NAVGENHNGYQFGGIKQGNCPIG 318
            C    F + C+  C      W C P + V     GY   G  Q  CP G
Sbjct: 1209 CQPGTFGKDCEHLCQCPGETWACDPASGVCTCAAGYHGTGCLQ-RCPSG 1256
>sp|Q3LI77|KR134_HUMAN Keratin-associated protein 13-4
          Length = 160

 Score = 33.1 bits (74), Expect = 0.46
 Identities = 27/96 (28%), Positives = 39/96 (40%)
 Frame = +1

Query: 52  QVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFT 231
           + C  PA    S   P T    CP              C T+CSG   +R    R   + 
Sbjct: 53  KTCWEPASCQKSCYRPRTSILCCP--------------CQTTCSGSLGFRSSSCRSQGYG 98

Query: 232 TYWCYPNAVGENHNGYQFGGIKQGNCPIGYISLKVG 339
           +  CY  ++G   +G++F  +K G C  G+ SL  G
Sbjct: 99  SRCCY--SLGNGSSGFRF--LKYGGC--GFPSLSYG 128
>sp|Q80V70|EGFL3_MOUSE Multiple EGF-like-domain protein 3
          Length = 656

 Score = 31.2 bits (69), Expect = 1.8
 Identities = 28/108 (25%), Positives = 41/108 (37%), Gaps = 1/108 (0%)
 Frame = +1

Query: 1   GGNCEHPSVNFTFG-GVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS 177
           G  C+ P V+  FG    + C    G +   V   TG+  CP G+           C+ +
Sbjct: 198 GDKCQSPCVSGMFGVHCEEHCACRKGATCHHV---TGACLCPPGWRGS-------HCEQA 247

Query: 178 CSGWWFWRHCDTRCSYFTTYWCYPNAVGENHNGYQFGGIKQGNCPIGY 321
           C   WF   C  RC       C P A   + +G       + +CP G+
Sbjct: 248 CPRGWFGEACAQRCH------CPPGASCHHVSG-------ECHCPPGF 282
>sp|O18735|ERBB2_CANFA Receptor tyrosine-protein kinase erbB-2 precursor (p185erbB2)
           (C-erbB-2)
          Length = 1259

 Score = 30.8 bits (68), Expect = 2.3
 Identities = 31/119 (26%), Positives = 42/119 (35%), Gaps = 1/119 (0%)
 Frame = +1

Query: 4   GNCEHPSVNFTFGGVYQVCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCS 183
           G+C+  +     GG  + C GP                C  G +  K HS  + C     
Sbjct: 210 GDCQSLTRTVCAGGCAR-CKGPQPTDCCH-------EQCAAGCTGPK-HSDCLACLHFNH 260

Query: 184 GWWFWRHCDTRCSYFT-TYWCYPNAVGENHNGYQFGGIKQGNCPIGYISLKVGLSVEIC 357
                 HC    +Y T T+   PN  G     Y FG     +CP  Y+S  VG    +C
Sbjct: 261 SGICELHCPALVTYNTDTFESMPNPEGR----YTFGASCVTSCPYNYLSTDVGSCTLVC 315
>sp|Q60553|ERBB2_MESAU Receptor tyrosine-protein kinase erbB-2 precursor (p185erbB2)
           (C-erbB-2) (NEU proto-oncogene)
          Length = 1254

 Score = 30.8 bits (68), Expect = 2.3
 Identities = 18/53 (33%), Positives = 23/53 (43%), Gaps = 1/53 (1%)
 Frame = +1

Query: 202 HCDTRCSYFT-TYWCYPNAVGENHNGYQFGGIKQGNCPIGYISLKVGLSVEIC 357
           HC    +Y T T+   PN  G     Y FG      CP  Y+S +VG    +C
Sbjct: 267 HCPALVTYNTDTFESMPNPEGR----YTFGASCVTTCPYNYLSTEVGSCTLVC 315
>sp|Q8VHS2|CRUM1_MOUSE Crumbs protein homolog 1 precursor
          Length = 1405

 Score = 30.4 bits (67), Expect = 3.0
 Identities = 34/128 (26%), Positives = 52/128 (40%), Gaps = 6/128 (4%)
 Frame = +1

Query: 55  VCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTS-----CSGWWFWRHCDTRC 219
           +C  P  P YS +N  T +NSC     ++  H G  R D       C   +  R C+T  
Sbjct: 94  LCQCP--PGYSGLNCETATNSCG---GNLCQHGGTCRKDPEHPVCICPPGYAGRFCETDH 148

Query: 220 SYFTTYWCYPNAVGENH-NGYQFGGIKQGNCPIGYISLKVGLSVEICVTIDNDPNNPFAI 396
           +   +  C+  A+ ++  NGY         C  GY      L V+ CV+ D   N    +
Sbjct: 149 NECASSPCHNGAMCQDGINGYSC------FCVPGYQGRHCDLEVDECVS-DPCKNEAVCL 201

Query: 397 KFGGLFSC 420
              G ++C
Sbjct: 202 NEIGRYTC 209
>sp|P34853|NU4M_APILI NADH-ubiquinone oxidoreductase chain 4 (NADH dehydrogenase subunit
           4)
          Length = 447

 Score = 30.0 bits (66), Expect = 3.9
 Identities = 13/34 (38%), Positives = 17/34 (50%)
 Frame = -2

Query: 399 FNCKWIVWIVVYCHTYFNTQSNL**NITNWAISL 298
           FN  WI WI ++C+  FN  S     +T W   L
Sbjct: 49  FNLNWIDWIYIFCNLSFNMYSYGLIMLTLWIFGL 82
>sp|P06494|ERBB2_RAT Receptor tyrosine-protein kinase erbB-2 precursor (p185erbB2)
           (C-erbB-2) (NEU proto-oncogene) (Epidermal growth factor
           receptor-related protein)
          Length = 1257

 Score = 30.0 bits (66), Expect = 3.9
 Identities = 27/105 (25%), Positives = 36/105 (34%), Gaps = 9/105 (8%)
 Frame = +1

Query: 205 CDTRCSYFTTYWCYPNAVGENHNG-YQFGGIKQGNCPIGYISLKVGLSVEICVTIDNDPN 381
           C+  C    TY         N  G Y FG      CP  Y+S +VG    +C      PN
Sbjct: 265 CELHCPALVTYNTDTFESMHNPEGRYTFGASCVTTCPYNYLSTEVGSCTLVC-----PPN 319

Query: 382 NPFAIKFGGLFSCS--------VGNPLAKEFVRGKPKLSSSKMMD 492
           N       G   C         V   L  E +RG   ++S  + +
Sbjct: 320 NQEVTAEDGTQRCEKCSKPCARVCYGLGMEHLRGARAITSDNVQE 364
>sp|P59222|SREC2_MOUSE Scavenger receptor class F member 2 precursor (Scavenger receptor
           expressed by endothelial cells 2 protein) (SREC-II)
          Length = 833

 Score = 29.6 bits (65), Expect = 5.1
 Identities = 20/74 (27%), Positives = 30/74 (40%)
 Frame = +1

Query: 55  VCDGPAGPSYSQVNPLTGSNSCPKGFSSVKLHSGLIRCDTSCSGWWFWRHCDTRCSYFTT 234
           VC+G +  S ++V    G   C  G+           CDT C   ++   C  RCS    
Sbjct: 71  VCEGNSTCSENEVCVRPGECRCRHGYFGAN-------CDTKCPRQFWGPDCKERCS---- 119

Query: 235 YWCYPNAVGENHNG 276
             C+P+   E+  G
Sbjct: 120 --CHPHGQCEDVTG 131
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 69,523,528
Number of Sequences: 369166
Number of extensions: 1510634
Number of successful extensions: 3490
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3352
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3484
length of database: 68,354,980
effective HSP length: 104
effective length of database: 49,142,540
effective search space used: 3882260660
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)