Planaria EST Database


DrC_02228

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_02228
         (707 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q91VH6|CB004_MOUSE  Protein C2orf4 homolog                     286   5e-77
sp|Q9Y316|CB004_HUMAN  Protein C2orf4 (C21orf19-like protein)     285   6e-77
sp|Q22915|YC4P_CAEEL  Hypothetical UPF0103 protein C37C3.8 i...   216   6e-56
sp|Q10212|YAY4_SCHPO  Hypothetical UPF0103 protein C4H3.04c ...   170   3e-42
sp|P47085|YJX8_YEAST  Hypothetical UPF0103 protein YJR008w        144   3e-34
sp|O59292|Y1626_PYRHO  Hypothetical UPF0103 protein PH1626         91   4e-18
sp|O67039|Y890_AQUAE  Hypothetical UPF0103 protein AQ_890          88   2e-17
sp|Q9V189|Y539_PYRAB  Hypothetical UPF0103 protein PYRAB05390      87   6e-17
sp|Q8U0F2|YG38_PYRFU  Hypothetical UPF0103 protein PF1638          86   7e-17
sp|Q9HLJ1|Y237_THEAC  Hypothetical UPF0103 protein Ta0237          77   6e-14
>sp|Q91VH6|CB004_MOUSE Protein C2orf4 homolog
          Length = 297

 Score =  286 bits (731), Expect = 5e-77
 Identities = 135/235 (57%), Positives = 166/235 (70%)
 Frame = +3

Query: 3   GSWYXXXXXXXXXXXXXXXXXXKISHSPARAIITPHAGYTYSGSTAGFAYKQIDPSQIER 182
           GSWY                  + +  PARAII PHAGYTY GS A  AYKQ+DPS   R
Sbjct: 14  GSWYTASGPQLNAQLEGWLSQVQSTKRPARAIIAPHAGYTYCGSCAAHAYKQVDPSVTRR 73

Query: 183 VFILGPSHHVSYGENCVLSNFDEYETPFYNLPIDKKIYSELLATNNFGQAKCNHDEDEHS 362
           +FILGPSHHV     C LS+ D Y TP Y+L ID+KIY EL  T  F +     DEDEHS
Sbjct: 74  IFILGPSHHVPLSR-CALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHS 132

Query: 363 LEMQLPYVAKIMESRKGQFTIIPIIVGNLSPTHEEKYGKILAKYMLDKSNLFVISSDFCH 542
           +EM LPY AK MES K +FTIIP++VG LS + E+++GK+ +KY+ D SNLFV+SSDFCH
Sbjct: 133 IEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCH 192

Query: 543 WGKRFRYTYYDEKFGEIWKSIENLDRMGMDAVESLDPEKFNSYLQQYHNTICGRH 707
           WG+RFRY+YYDE  GEI++SIE+LD+MGM  +E LDP  F++YL++YHNTICGRH
Sbjct: 193 WGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRH 247
>sp|Q9Y316|CB004_HUMAN Protein C2orf4 (C21orf19-like protein)
          Length = 297

 Score =  285 bits (730), Expect = 6e-77
 Identities = 135/235 (57%), Positives = 166/235 (70%)
 Frame = +3

Query: 3   GSWYXXXXXXXXXXXXXXXXXXKISHSPARAIITPHAGYTYSGSTAGFAYKQIDPSQIER 182
           GSWY                  + +  PARAII PHAGYTY GS A  AYKQ+DPS   R
Sbjct: 14  GSWYTASGPQLNAQLEGWLSQVQSTKRPARAIIAPHAGYTYCGSCAAHAYKQVDPSITRR 73

Query: 183 VFILGPSHHVSYGENCVLSNFDEYETPFYNLPIDKKIYSELLATNNFGQAKCNHDEDEHS 362
           +FILGPSHHV     C LS+ D Y TP Y+L ID+KIY EL  T  F +     DEDEHS
Sbjct: 74  IFILGPSHHVPLSR-CALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHS 132

Query: 363 LEMQLPYVAKIMESRKGQFTIIPIIVGNLSPTHEEKYGKILAKYMLDKSNLFVISSDFCH 542
           +EM LPY AK MES K +FTIIP++VG LS + E+++GK+ +KY+ D SNLFV+SSDFCH
Sbjct: 133 IEMHLPYTAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCH 192

Query: 543 WGKRFRYTYYDEKFGEIWKSIENLDRMGMDAVESLDPEKFNSYLQQYHNTICGRH 707
           WG+RFRY+YYDE  GEI++SIE+LD+MGM  +E LDP  F++YL++YHNTICGRH
Sbjct: 193 WGQRFRYSYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRH 247
>sp|Q22915|YC4P_CAEEL Hypothetical UPF0103 protein C37C3.8 in chromosome V
          Length = 350

 Score =  216 bits (549), Expect = 6e-56
 Identities = 104/208 (50%), Positives = 146/208 (70%), Gaps = 1/208 (0%)
 Frame = +3

Query: 87  ARAIITPHAGYTYSGSTAGFAYKQIDPSQIERVFILGPSHHVSYGENCVLSNFDEYETPF 266
           ARA+I+PHAGY+Y G TA +A+KQ+  S +ERVFILGPSH V+    C ++   +Y TP 
Sbjct: 93  ARALISPHAGYSYCGETAAYAFKQVVSSAVERVFILGPSHVVALN-GCAITTCSKYRTPL 151

Query: 267 YNLPIDKKIYSELLATNNFGQAKCNHDEDEHSLEMQLPYVAKIMESRKGQFTIIPIIVGN 446
            +L +D KI  EL AT +F       +E EHS+EMQLP++AK+M S++  +TI+P++VG+
Sbjct: 152 GDLIVDHKINEELRATRHFDLMDRRDEESEHSIEMQLPFIAKVMGSKR--YTIVPVLVGS 209

Query: 447 LSPTHEEKYGKILAKYMLDKSNLFVISSDFCHWGKRFRYTYYDEKFG-EIWKSIENLDRM 623
           L  + ++ YG I A YM D  NLFVISSDFCHWG+RF ++ YD      I++ I N+D+ 
Sbjct: 210 LPGSRQQTYGNIFAHYMEDPRNLFVISSDFCHWGERFSFSPYDRHSSIPIYEQITNMDKQ 269

Query: 624 GMDAVESLDPEKFNSYLQQYHNTICGRH 707
           GM A+E+L+P  FN YL++  NTICGR+
Sbjct: 270 GMSAIETLNPAAFNDYLKKTQNTICGRN 297
>sp|Q10212|YAY4_SCHPO Hypothetical UPF0103 protein C4H3.04c in chromosome I
          Length = 309

 Score =  170 bits (431), Expect = 3e-42
 Identities = 91/227 (40%), Positives = 135/227 (59%), Gaps = 21/227 (9%)
 Frame = +3

Query: 90  RAIITPHAGYTYSGSTAGFAYKQIDPSQIERVFILGPSHHVSYGENCVLSNFDEYETPFY 269
           R +I+PHAGY YSG  A   ++Q+D S+I+RVF+ GPSHH+ +   C++S      TP  
Sbjct: 40  RFVISPHAGYMYSGKVASQGFQQLDFSKIQRVFVFGPSHHI-FTRKCLVSRASICSTPLG 98

Query: 270 NLPIDKKIYSELLATNN-FGQAKCNHDEDEHSLEMQLPYVA--KIMESRKGQFTIIPIIV 440
           +L +D+ +  +L+A++N F     + DE EHSLEMQ P +A   + +   G+  I+PI++
Sbjct: 99  DLKVDEDLCQKLVASDNSFDSMTLDVDESEHSLEMQFPLLAFHLLKQGCLGKVKIVPIMI 158

Query: 441 GNLSPTHEEKYGKILAKYMLDKSNLFVISSDFCHWGKRFRYTYYDEKFGE---------- 590
           G L+ T      K L++Y+ D+SN FVISSDFCHWG+RF YT Y     +          
Sbjct: 159 GALTSTTMMAAAKFLSQYIKDESNSFVISSDFCHWGRRFGYTLYLNDTNQLEDAVLKYKR 218

Query: 591 --------IWKSIENLDRMGMDAVESLDPEKFNSYLQQYHNTICGRH 707
                   I++SI NLD +GM  +E+   + F+ YL+   NTICGR+
Sbjct: 219 RGGPTSPKIYESISNLDHIGMKIIETKSSDDFSEYLKTTQNTICGRY 265
>sp|P47085|YJX8_YEAST Hypothetical UPF0103 protein YJR008w
          Length = 338

 Score =  144 bits (362), Expect = 3e-34
 Identities = 88/246 (35%), Positives = 135/246 (54%), Gaps = 41/246 (16%)
 Frame = +3

Query: 87  ARAIITPHAGYTYSGSTAGFAYKQIDPSQ-IERVFILGPSHHVSYGENCVLSNFDEYETP 263
           AR II PHAGY Y G T  ++Y  +D ++ ++R+FILGPSHH+ +    ++S F E ETP
Sbjct: 40  ARIIICPHAGYRYCGPTMAYSYASLDLNRNVKRIFILGPSHHIYFKNQILVSAFSELETP 99

Query: 264 FYNLPIDKKIYSELLATNNFGQAK-----CNHDED--EHSLEMQLPYVAKIMESRK---G 413
             NL +D  +   L+        K      +HD D  EHSLEMQLP + + ++ R+    
Sbjct: 100 LGNLKVDTDLCKTLIQKEYPENGKKLFKPMDHDTDMAEHSLEMQLPMLVETLKWREISLD 159

Query: 414 QFTIIPIIVGNLSPTHEEKYGKILAKYMLDKSNLFVISSDFCHWGKRFRYTYY---DEKF 584
              + P++V + S   +   G IL++Y+ D +NLF++SSDFCHWG+RF+YT Y    E+ 
Sbjct: 160 TVKVFPMMVSHNSVDVDRCIGNILSEYIKDPNNLFIVSSDFCHWGRRFQYTGYVGSKEEL 219

Query: 585 GE-----------------------IWKSIENLDRMGMDAV-ESLDPEKFNS---YLQQY 683
            +                       IW+SIE +DR  M  + ++ + E++++   YL+  
Sbjct: 220 NDAIQEETEVEMLTARSKLSHHQVPIWQSIEIMDRYAMKTLSDTPNGERYDAWKQYLEIT 279

Query: 684 HNTICG 701
            NTICG
Sbjct: 280 GNTICG 285
>sp|O59292|Y1626_PYRHO Hypothetical UPF0103 protein PH1626
          Length = 291

 Score = 90.5 bits (223), Expect = 4e-18
 Identities = 52/203 (25%), Positives = 95/203 (46%), Gaps = 2/203 (0%)
 Frame = +3

Query: 99  ITPHAGYTYSGSTAGFAYKQIDPSQIERVFILGPSHHVSYGENCVLSNFDEYETPFYNLP 278
           + PHAGY +SG TA   YK I    +  VF++   +H   G    L    E+ TP  ++ 
Sbjct: 42  VAPHAGYVFSGFTASRTYKAIYEDGLPEVFVIFGPNHTGLGSPIALYPEGEWITPMGSIK 101

Query: 279 IDKKIYSELLATNNFGQAKCNHDEDEHSLEMQLPYVAKIMESRKGQFTIIPIIVGNLSPT 458
           +D K   E++  +          + EHS+E+QLP++  I E    +  I+PI +G     
Sbjct: 102 VDSKFAKEIVKRSGIADLDDLAHKYEHSIEVQLPFIQYIAEKAGVEVKIVPITLGIQDEE 161

Query: 459 HEEKYGKIL--AKYMLDKSNLFVISSDFCHWGKRFRYTYYDEKFGEIWKSIENLDRMGMD 632
                G+ +  A   L +  + + S+DF H+G  + Y  +  +  E+   + + D   + 
Sbjct: 162 VSRSLGRSIFEASTSLGRDTIIIASTDFMHYGSFYGYVPFRGRPEELPNMVRDWDMRIIR 221

Query: 633 AVESLDPEKFNSYLQQYHNTICG 701
            +   D +   S +++ ++T+CG
Sbjct: 222 RILDFDLDGMFSEIREMNHTMCG 244
>sp|O67039|Y890_AQUAE Hypothetical UPF0103 protein AQ_890
          Length = 267

 Score = 88.2 bits (217), Expect = 2e-17
 Identities = 64/208 (30%), Positives = 106/208 (50%), Gaps = 4/208 (1%)
 Frame = +3

Query: 90  RAIITPHAGYTYSGSTAGFAYKQIDPSQIERVFILGPSHHVSYGENCVLSNFDEYETPFY 269
           +AI+ PHAGY YSG TA   YK+I+    E+V +LGP +H   G+   + + D +ETP+ 
Sbjct: 39  KAILVPHAGYIYSGKTACEVYKRIEIP--EKVVLLGP-NHTGLGKPISVYSGDAWETPYG 95

Query: 270 NLPIDKKIYSELLATNNFGQAKCNHDE----DEHSLEMQLPYVAKIMESRKGQFTIIPII 437
            + ID ++  ++L          N DE     EHSLE+QLP++ +     + +F I+PI+
Sbjct: 96  VVEIDGELREKILK-----YPYANPDEYAHLYEHSLEVQLPFLQRY---ARREFKILPIV 147

Query: 438 VGNLSPTHEEKYGKILAKYMLDKSNLFVISSDFCHWGKRFRYTYYDEKFGEIWKSIENLD 617
           V  +     + +G+ L + + ++  L VISSD  H+                 +     D
Sbjct: 148 VTFVEYEVAKDFGRFLGEVLKEEDALIVISSDMSHYVPA--------------EEARKKD 193

Query: 618 RMGMDAVESLDPEKFNSYLQQYHNTICG 701
            + + A+E L+ E+      QY+ T+CG
Sbjct: 194 EILISAMERLNTEELYFKAVQYNITMCG 221
>sp|Q9V189|Y539_PYRAB Hypothetical UPF0103 protein PYRAB05390
          Length = 291

 Score = 86.7 bits (213), Expect = 6e-17
 Identities = 50/203 (24%), Positives = 93/203 (45%), Gaps = 2/203 (0%)
 Frame = +3

Query: 99  ITPHAGYTYSGSTAGFAYKQIDPSQIERVFILGPSHHVSYGENCVLSNFDEYETPFYNLP 278
           + PHAGY +SG TA   YK I    +   F++   +H   G    +    ++ TP   + 
Sbjct: 42  VAPHAGYVFSGYTASRTYKAIYEDGLPETFVIFGPNHTGLGSPIAVYPEGDWVTPLGKVK 101

Query: 279 IDKKIYSELLATNNFGQAKCNHDEDEHSLEMQLPYVAKIMESRKGQFTIIPIIVGNLSPT 458
           ID ++  E++  +          + EHS+E+QLP++  I E     F I+PI +G     
Sbjct: 102 IDSELAKEIVKLSKIADLDDLAHKYEHSIEVQLPFIQYIAEKAGTDFRIVPITLGIQDED 161

Query: 459 HEEKYGKIL--AKYMLDKSNLFVISSDFCHWGKRFRYTYYDEKFGEIWKSIENLDRMGMD 632
             E  G+ +  A   L +  + + S+DF H+G  + Y  +  +  E+   ++  D   + 
Sbjct: 162 VSEALGRAVFEAAEALGRDVIVIASTDFMHYGSFYGYVPFRGRANELPNMVKEWDMRIIR 221

Query: 633 AVESLDPEKFNSYLQQYHNTICG 701
            +   D +     +++  +T+CG
Sbjct: 222 RILDFDLKGMFEEIREMDHTMCG 244
>sp|Q8U0F2|YG38_PYRFU Hypothetical UPF0103 protein PF1638
          Length = 292

 Score = 86.3 bits (212), Expect = 7e-17
 Identities = 51/203 (25%), Positives = 92/203 (45%), Gaps = 2/203 (0%)
 Frame = +3

Query: 99  ITPHAGYTYSGSTAGFAYKQIDPSQIERVFILGPSHHVSYGENCVLSNFDEYETPFYNLP 278
           + PHAGY +SG TA   YK I    +  VF++   +H   G    +    E+ETP   + 
Sbjct: 42  VAPHAGYIFSGYTASRTYKAIYEDGLPEVFVILGPNHTGLGSPIAVYPKGEWETPLGRIK 101

Query: 279 IDKKIYSELLATNNFGQAKCNHDEDEHSLEMQLPYVAKIMESRKGQFTIIPIIVGNLSPT 458
           +D+K+   +   +          + EHS+E+QLP++  + E       I+PI +G     
Sbjct: 102 VDEKLARRITELSEIADLDDLAHKYEHSIEVQLPFIQYLAELSGKDVKIVPITLGIQDEE 161

Query: 459 HEEKYGKIL--AKYMLDKSNLFVISSDFCHWGKRFRYTYYDEKFGEIWKSIENLDRMGMD 632
                GK +  A   L +  + + S+DF H+G+ + Y  +  +  E+   ++  D   + 
Sbjct: 162 VSYALGKAIYEASQELGRDIVVIASTDFMHYGEFYGYVPFRARADELPNLVKEWDMRVIR 221

Query: 633 AVESLDPEKFNSYLQQYHNTICG 701
            +   D E     +   ++T+CG
Sbjct: 222 RILDFDVEGMFEEINAMNHTMCG 244
>sp|Q9HLJ1|Y237_THEAC Hypothetical UPF0103 protein Ta0237
          Length = 268

 Score = 76.6 bits (187), Expect = 6e-14
 Identities = 55/201 (27%), Positives = 95/201 (47%)
 Frame = +3

Query: 99  ITPHAGYTYSGSTAGFAYKQIDPSQIERVFILGPSHHVSYGENCVLSNFDEYETPFYNLP 278
           + PHAG  YSG TA ++Y+ I+ S +    I+GP+H         L    E+ TP  +  
Sbjct: 41  VVPHAGIIYSGRTAMYSYRAIEKSAVRDFVIIGPNHR-PLTPYASLYPEGEWSTPLGDAL 99

Query: 279 IDKKIYSELLATNNFGQAKCNHDEDEHSLEMQLPYVAKIMESRKGQFTIIPIIVGNLSPT 458
           I+ ++   L   +N+          EHS+E+Q+P++  +       F  +P+I+G+    
Sbjct: 100 INDRMAEALYRDSNYIVKDEESHLMEHSVEVQIPFLQYLFGD---GFRFVPVILGDQEID 156

Query: 459 HEEKYGKILAKYMLDKSNLFVISSDFCHWGKRFRYTYYDEKFGEIWKSIENLDRMGMDAV 638
                G+ + K  ++   +F+ SSDF H+              E  K +E  D   + A+
Sbjct: 157 VARDIGEAIMK--IEDPFIFIASSDFTHY--------------EDAKRVEKKDMDLISAI 200

Query: 639 ESLDPEKFNSYLQQYHNTICG 701
            +LD +KF S L++ + T CG
Sbjct: 201 LTLDLDKFYSVLEKENVTACG 221
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 82,934,560
Number of Sequences: 369166
Number of extensions: 1732901
Number of successful extensions: 4310
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4164
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4272
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 6219306880
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)