Planaria EST Database


DrC_01879

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_01879
         (875 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|O57590|SURF4_FUGRU  Surfeit locus protein 4                    163   7e-40
sp|Q64310|SURF4_MOUSE  Surfeit locus protein 4                    158   2e-38
sp|O15260|SURF4_HUMAN  Surfeit locus protein 4                    157   3e-38
sp|Q18864|SURF4_CAEEL  Surfeit locus protein 4 homolog            154   2e-37
sp|O18405|SURF4_DROME  Surfeit locus protein 4 homolog            140   5e-33
sp|O74559|SURF4_SCHPO  Surfeit locus protein 4 homolog             94   6e-19
sp|P53337|YG5Y_YEAST  Hypothetical 35.0 kDa protein in BGL2-...    91   3e-18
sp|O45731|YFC7_CAEEL  Hypothetical protein T02E1.7 in chromo...    83   1e-15
sp|P0AD48|YPHA_SHIFL  Inner membrane protein yphA >gi|825925...    39   0.025
sp|P04372|COX2_TRYBB  Cytochrome c oxidase subunit 2 (Cytoch...    33   1.0  
>sp|O57590|SURF4_FUGRU Surfeit locus protein 4
          Length = 269

 Score =  163 bits (412), Expect = 7e-40
 Identities = 78/136 (57%), Positives = 103/136 (75%)
 Frame = +3

Query: 3   SNEKVKSYLAGLPNTGINQPKQYMLLGGRLLVVLIFITLIHWDGGFFYMIQTLANLVLIL 182
           S  + KS  AG+P+ G + PKQYM LGGR+L+VL+F+TL+H+D  FF ++Q L    LI+
Sbjct: 134 SRSEGKSMFAGVPSMGESSPKQYMQLGGRVLLVLMFMTLLHFDFNFFSILQNLVGTALII 193

Query: 183 MVAIGYKTKLCAIILVLWLTTMNFWYNNFWYYIDRELLWDFLKYDFFQTWSVIGGLLLIV 362
           +VAIG+KTKL A+ LVLWL  +N ++N FW     + + DFLKYDFFQT SVIGGLLL+V
Sbjct: 194 LVAIGFKTKLAALTLVLWLLVINVYFNAFWTIPAYKPMHDFLKYDFFQTTSVIGGLLLVV 253

Query: 363 AYGPGGVSIDDYKKDW 410
           A GPGGVS+D+ KK+W
Sbjct: 254 ALGPGGVSMDEKKKEW 269
>sp|Q64310|SURF4_MOUSE Surfeit locus protein 4
          Length = 269

 Score =  158 bits (399), Expect = 2e-38
 Identities = 75/136 (55%), Positives = 101/136 (74%)
 Frame = +3

Query: 3   SNEKVKSYLAGLPNTGINQPKQYMLLGGRLLVVLIFITLIHWDGGFFYMIQTLANLVLIL 182
           S  + KS  AG+P    + PKQYM LGGR+L+VL+F+TL+H+D  FF +IQ +    L++
Sbjct: 134 SRSEGKSMFAGVPTMRESSPKQYMQLGGRVLLVLMFMTLLHFDASFFSIIQNIVGTALMI 193

Query: 183 MVAIGYKTKLCAIILVLWLTTMNFWYNNFWYYIDRELLWDFLKYDFFQTWSVIGGLLLIV 362
           +VAIG+KTKL A+ LV+WL  +N ++N FW     + + DFLKYDFFQT SVIGGLLL+V
Sbjct: 194 LVAIGFKTKLAALTLVVWLFAINVYFNAFWTIPVYKPMHDFLKYDFFQTMSVIGGLLLVV 253

Query: 363 AYGPGGVSIDDYKKDW 410
           A GPGGVS+D+ KK+W
Sbjct: 254 ALGPGGVSMDEKKKEW 269
>sp|O15260|SURF4_HUMAN Surfeit locus protein 4
          Length = 269

 Score =  157 bits (398), Expect = 3e-38
 Identities = 74/136 (54%), Positives = 101/136 (74%)
 Frame = +3

Query: 3   SNEKVKSYLAGLPNTGINQPKQYMLLGGRLLVVLIFITLIHWDGGFFYMIQTLANLVLIL 182
           S  + KS  AG+P    + PKQYM LGGR+L+VL+F+TL+H+D  FF ++Q +    L++
Sbjct: 134 SRSEGKSMFAGVPTMRESSPKQYMQLGGRVLLVLMFMTLLHFDASFFSIVQNIVGTALMI 193

Query: 183 MVAIGYKTKLCAIILVLWLTTMNFWYNNFWYYIDRELLWDFLKYDFFQTWSVIGGLLLIV 362
           +VAIG+KTKL A+ LV+WL  +N ++N FW     + + DFLKYDFFQT SVIGGLLL+V
Sbjct: 194 LVAIGFKTKLAALTLVVWLFAINVYFNAFWTIPVYKPMHDFLKYDFFQTMSVIGGLLLVV 253

Query: 363 AYGPGGVSIDDYKKDW 410
           A GPGGVS+D+ KK+W
Sbjct: 254 ALGPGGVSMDEKKKEW 269
>sp|Q18864|SURF4_CAEEL Surfeit locus protein 4 homolog
          Length = 277

 Score =  154 bits (390), Expect = 2e-37
 Identities = 71/137 (51%), Positives = 98/137 (71%), Gaps = 1/137 (0%)
 Frame = +3

Query: 3   SNEKVKSYLAGLPNTG-INQPKQYMLLGGRLLVVLIFITLIHWDGGFFYMIQTLANLVLI 179
           + E+  S  AG+P  G  N+PK YMLL GR+L++ +F++L+H++  F  +++ +    LI
Sbjct: 141 TQEEKASLFAGVPTMGDSNKPKSYMLLAGRVLLIFMFMSLMHFEMSFMQVLEIVVGFALI 200

Query: 180 LMVAIGYKTKLCAIILVLWLTTMNFWYNNFWYYIDRELLWDFLKYDFFQTWSVIGGLLLI 359
            +V+IGYKTKL AI+LV+WL  +N W N +W         DF+KYDFFQT SVIGGLLL+
Sbjct: 201 TLVSIGYKTKLSAIVLVIWLFGLNLWLNAWWTIPSDRFYRDFMKYDFFQTMSVIGGLLLV 260

Query: 360 VAYGPGGVSIDDYKKDW 410
           +AYGPGGVS+DDYKK W
Sbjct: 261 IAYGPGGVSVDDYKKRW 277
>sp|O18405|SURF4_DROME Surfeit locus protein 4 homolog
          Length = 270

 Score =  140 bits (353), Expect = 5e-33
 Identities = 67/131 (51%), Positives = 96/131 (73%)
 Frame = +3

Query: 18  KSYLAGLPNTGINQPKQYMLLGGRLLVVLIFITLIHWDGGFFYMIQTLANLVLILMVAIG 197
           +S  AG+P+ G N+PK +M L GR+L+  +FITLI ++   + +IQ +   +L+++V +G
Sbjct: 140 RSLFAGVPSMGENKPKNFMQLAGRILLAFMFITLIRFELSVWQVIQDIIGSILMVLVVLG 199

Query: 198 YKTKLCAIILVLWLTTMNFWYNNFWYYIDRELLWDFLKYDFFQTWSVIGGLLLIVAYGPG 377
           YKTKL A+ILV  LT +N ++N +W     + L DFLKYDFFQT SVIGGLL+IV+ GPG
Sbjct: 200 YKTKLSALILVALLTILNLYHNAWWTIPSYKPLRDFLKYDFFQTLSVIGGLLMIVSLGPG 259

Query: 378 GVSIDDYKKDW 410
           GVS+D++KK W
Sbjct: 260 GVSMDEHKKKW 270
>sp|O74559|SURF4_SCHPO Surfeit locus protein 4 homolog
          Length = 302

 Score = 93.6 bits (231), Expect = 6e-19
 Identities = 50/128 (39%), Positives = 74/128 (57%), Gaps = 3/128 (2%)
 Frame = +3

Query: 30  AGLPNTGINQPKQYMLLGGRLLVVLIFITLIHWDGGFFYMIQTLANLVLIL---MVAIGY 200
           AGLP    +  + Y  L GR+L++ +F+ L+  +G      + L +++ +    MV IG+
Sbjct: 173 AGLPAVSEHNKRTYFQLAGRVLLIFMFLGLLAKEGSGISWTRILVHILSVTACAMVVIGF 232

Query: 201 KTKLCAIILVLWLTTMNFWYNNFWYYIDRELLWDFLKYDFFQTWSVIGGLLLIVAYGPGG 380
           K K  A +LVL L+  NF  N+FW         DF +YDFFQT S++GGLL +V  GPG 
Sbjct: 233 KAKFFAAVLVLILSVANFIINSFWSVpreSPYRDFYRYDFFQTLSIVGGLLYLVNTGPGK 292

Query: 381 VSIDDYKK 404
            S+D+ KK
Sbjct: 293 FSVDEKKK 300
>sp|P53337|YG5Y_YEAST Hypothetical 35.0 kDa protein in BGL2-ZUO1 intergenic region
          Length = 310

 Score = 91.3 bits (225), Expect = 3e-18
 Identities = 50/121 (41%), Positives = 76/121 (62%)
 Frame = +3

Query: 42  NTGINQPKQYMLLGGRLLVVLIFITLIHWDGGFFYMIQTLANLVLILMVAIGYKTKLCAI 221
           N+  ++ K Y+L  GR+L+VL+FI    +   +F ++ T+   +     AIGYKTK  +I
Sbjct: 194 NSKNDKAKGYLLFAGRILIVLMFIAFT-FSKSWFTVVLTIIGTICF---AIGYKTKFASI 249

Query: 222 ILVLWLTTMNFWYNNFWYYIDRELLWDFLKYDFFQTWSVIGGLLLIVAYGPGGVSIDDYK 401
           +L L LT  N   NN+W+Y + +   DFLKY+F+Q  S+IGGLLL+   G G +S+D+ K
Sbjct: 250 MLGLILTFYNITLNNYWFYNNTKR--DFLKYEFYQNLSIIGGLLLVTNTGAGELSVDEKK 307

Query: 402 K 404
           K
Sbjct: 308 K 308
>sp|O45731|YFC7_CAEEL Hypothetical protein T02E1.7 in chromosome I
          Length = 269

 Score = 82.8 bits (203), Expect = 1e-15
 Identities = 41/113 (36%), Positives = 69/113 (61%)
 Frame = +3

Query: 72  MLLGGRLLVVLIFITLIHWDGGFFYMIQTLANLVLILMVAIGYKTKLCAIILVLWLTTMN 251
           +L   R+ + L+ I+++H+D  +  ++  + +  +++ V +G+KT++ + +L  WL   N
Sbjct: 160 LLAACRVCLNLMLISMVHFDMSYTRILLCIISYGMMIFVWLGFKTRMMSFMLATWLFAYN 219

Query: 252 FWYNNFWYYIDRELLWDFLKYDFFQTWSVIGGLLLIVAYGPGGVSIDDYKKDW 410
              N+FW   D EL    ++YDFFQT S IGGLLL++  GPG  S D+ KK W
Sbjct: 220 IVLNDFWNK-DAEL--HIIRYDFFQTLSAIGGLLLLIHTGPGEFSFDELKKKW 269
>sp|P0AD48|YPHA_SHIFL Inner membrane protein yphA
 sp|P0AD47|YPHA_ECOLI Inner membrane protein yphA
          Length = 140

 Score = 38.5 bits (88), Expect = 0.025
 Identities = 31/128 (24%), Positives = 60/128 (46%), Gaps = 21/128 (16%)
 Frame = +3

Query: 72  MLLGGRLLVVLIFIT-----LIHWDGGFFYM------IQTLANLVLILM-------VAIG 197
           +LL  R+ VVLIFI      ++ +DG   YM      +  LA ++ ++M       + +G
Sbjct: 16  LLLIARIAVVLIFIIFGFPKMMGFDGTVQYMASLGAPMPMLAAIIAVVMEVPAAILIVLG 75

Query: 198 YKTKLCAIILVLWLTTMNFWYNNFWYYIDRELLWDFLKYDFFQTW---SVIGGLLLIVAY 368
           + T+  A++ + +        +++W     ++  D +  +    W   S+ G  LL+   
Sbjct: 76  FFTRPLAVLFIFYTLGTAVIGHHYW-----DMTGDAVGPNMINFWKNVSIAGAFLLLAIT 130

Query: 369 GPGGVSID 392
           GPG +S+D
Sbjct: 131 GPGAISLD 138
>sp|P04372|COX2_TRYBB Cytochrome c oxidase subunit 2 (Cytochrome c oxidase polypeptide
           II)
          Length = 210

 Score = 33.1 bits (74), Expect = 1.0
 Identities = 19/74 (25%), Positives = 37/74 (50%), Gaps = 1/74 (1%)
 Frame = +3

Query: 141 FYMIQTLANLVLILMVAIGYKTKLCAIILVLWLTTMNFWYNNFWYYIDRELLWDFLKYDF 320
           F+MI  + ++++++  +I     +CA+I+   LT      NN +        WDF+   F
Sbjct: 7   FWMIFLMDSIIVLISFSIFLSVWICALIIATVLTVTKI--NNIY------CTWDFISSKF 58

Query: 321 FQT-WSVIGGLLLI 359
             T W V+G + ++
Sbjct: 59  IDTYWFVLGMMFIL 72
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 89,432,188
Number of Sequences: 369166
Number of extensions: 1681271
Number of successful extensions: 3934
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3825
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3929
length of database: 68,354,980
effective HSP length: 110
effective length of database: 48,034,130
effective search space used: 8694177530
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)