Planarian EST Database


Dr_sW_020_I24

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_020_I24
         (690 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|O88767|PARK7_RAT  Protein DJ-1 (Contraception-associated ...   129   6e-30
sp|Q99LX0|PARK7_MOUSE  Protein DJ-1                               129   9e-30
sp|Q99497|PARK7_HUMAN  Protein DJ-1 (Oncogene DJ1)                127   2e-29
sp|Q10356|YDB3_SCHPO  Hypothetical protein C22E12.03c in chr...   109   8e-24
sp|Q46948|THIJ_ECOLI  Protein thiJ                                102   1e-21
sp|P55880|THIJ_SALTY  Protein thiJ                                 99   1e-20
sp|O06006|YRAA_BACSU  Hypothetical protein yraA                    93   8e-19
sp|Q5HN59|Y1413_STAEQ  Hypothetical protein SERP1413               87   4e-17
sp|Q49YS0|Y918_STAS1  Hypothetical protein SSP0918                 86   7e-17
sp|Q4L7I2|Y1084_STAHJ  Hypothetical protein SH1084                 86   9e-17
>sp|O88767|PARK7_RAT Protein DJ-1 (Contraception-associated protein 1) (CAP1 protein)
           (Fertility protein SP22)
          Length = 189

 Score =  129 bits (325), Expect = 6e-30
 Identities = 71/181 (39%), Positives = 105/181 (58%), Gaps = 2/181 (1%)
 Frame = +3

Query: 54  IIVANGSEDIEVVATSDVLSRGGIDVVIVGMEGSGNVKLAHKTVIVVEKSLADVKTT-LY 230
           +I+A G+E++E V   D++ R GI V + G+ G   V+ +   VI  + SL + KT   Y
Sbjct: 8   VILAKGAEEMETVIPVDIMRRAGIKVTVAGLAGKDPVQCSRDVVICPDTSLEEAKTQGPY 67

Query: 231 DALIIPGGMGGVKALSANENVKEMLYNHQKNKKVIGAICAGPLVLSAFGVAAGSTLTSYP 410
           D +++PGG  G + LS +  VKE+L   +  K +I AICAGP  L A  V  G  +TS+P
Sbjct: 68  DVVVLPGGNLGAQNLSESALVKEILKEQENRKGLIAAICAGPTALLAHEVGFGCKVTSHP 127

Query: 411 SVKDQ-LTSADYKYKEERVVVDRNLVTSRGPGTAVEFGLKLVEVLKGADAVKGLAEQMLV 587
             KD+ +  + Y Y E RV  D  ++TSRGPGT+ EF L +VE L G D    +   +++
Sbjct: 128 LAKDKMMNGSHYSYSESRVEKDGLILTSRGPGTSFEFALAIVEALSGKDMANQVKAPLVL 187

Query: 588 K 590
           K
Sbjct: 188 K 188
>sp|Q99LX0|PARK7_MOUSE Protein DJ-1
          Length = 189

 Score =  129 bits (323), Expect = 9e-30
 Identities = 71/181 (39%), Positives = 105/181 (58%), Gaps = 2/181 (1%)
 Frame = +3

Query: 54  IIVANGSEDIEVVATSDVLSRGGIDVVIVGMEGSGNVKLAHKTVIVVEKSLADVKTT-LY 230
           +I+A G+E++E V   DV+ R GI V + G+ G   V+ +   +I  + SL D KT   Y
Sbjct: 8   VILAKGAEEMETVIPVDVMRRAGIKVTVAGLAGKDPVQCSRDVMICPDTSLEDAKTQGPY 67

Query: 231 DALIIPGGMGGVKALSANENVKEMLYNHQKNKKVIGAICAGPLVLSAFGVAAGSTLTSYP 410
           D +++PGG  G + LS +  VKE+L   +  K +I AICAGP  L A  V  G  +T++P
Sbjct: 68  DVVVLPGGNLGAQNLSESPMVKEILKEQESRKGLIAAICAGPTALLAHEVGFGCKVTTHP 127

Query: 411 SVKDQ-LTSADYKYKEERVVVDRNLVTSRGPGTAVEFGLKLVEVLKGADAVKGLAEQMLV 587
             KD+ +  + Y Y E RV  D  ++TSRGPGT+ EF L +VE L G D    +   +++
Sbjct: 128 LAKDKMMNGSHYSYSESRVEKDGLILTSRGPGTSFEFALAIVEALVGKDMANQVKAPLVL 187

Query: 588 K 590
           K
Sbjct: 188 K 188
>sp|Q99497|PARK7_HUMAN Protein DJ-1 (Oncogene DJ1)
          Length = 189

 Score =  127 bits (320), Expect = 2e-29
 Identities = 70/181 (38%), Positives = 104/181 (57%), Gaps = 2/181 (1%)
 Frame = +3

Query: 54  IIVANGSEDIEVVATSDVLSRGGIDVVIVGMEGSGNVKLAHKTVIVVEKSLADVKTT-LY 230
           +I+A G+E++E V   DV+ R GI V + G+ G   V+ +   VI  + SL D K    Y
Sbjct: 8   VILAKGAEEMETVIPVDVMRRAGIKVTVAGLAGKDPVQCSRDVVICPDASLEDAKKEGPY 67

Query: 231 DALIIPGGMGGVKALSANENVKEMLYNHQKNKKVIGAICAGPLVLSAFGVAAGSTLTSYP 410
           D +++PGG  G + LS +  VKE+L   +  K +I AICAGP  L A  +  GS +T++P
Sbjct: 68  DVVVLPGGNLGAQNLSESAAVKEILKEQENRKGLIAAICAGPTALLAHEIGFGSKVTTHP 127

Query: 411 SVKDQ-LTSADYKYKEERVVVDRNLVTSRGPGTAVEFGLKLVEVLKGADAVKGLAEQMLV 587
             KD+ +    Y Y E RV  D  ++TSRGPGT+ EF L +VE L G +    +   +++
Sbjct: 128 LAKDKMMNGGHYTYSENRVEKDGLILTSRGPGTSFEFALAIVEALNGKEVAAQVKAPLVL 187

Query: 588 K 590
           K
Sbjct: 188 K 188
>sp|Q10356|YDB3_SCHPO Hypothetical protein C22E12.03c in chromosome I
          Length = 191

 Score =  109 bits (272), Expect = 8e-24
 Identities = 68/180 (37%), Positives = 100/180 (55%), Gaps = 8/180 (4%)
 Frame = +3

Query: 39  MVTVCIIVANGSEDIEVVATSDVLSRGGI--DVVIVGMEGSGNVKLAHKTVIVVEKSLAD 212
           MV VC+ VA+G+++IE  A   +  R  I  D V VG      VK++    +   +S  +
Sbjct: 1   MVKVCLFVADGTDEIEFSAPWGIFKRAEIPIDSVYVGENKDRLVKMSRDVEMYANRSYKE 60

Query: 213 VKTT-----LYDALIIPGGMGGVKALSANENVKEMLYN-HQKNKKVIGAICAGPLVLSAF 374
           + +       YD  IIPGG  G K LS    V++++   ++K  K IG ICAG L     
Sbjct: 61  IPSADDFAKQYDIAIIPGGGLGAKTLSTTPFVQQVVKEFYKKPNKWIGMICAGTLTAKTS 120

Query: 375 GVAAGSTLTSYPSVKDQLTSADYKYKEERVVVDRNLVTSRGPGTAVEFGLKLVEVLKGAD 554
           G+     +T +PSV+ QL    YKY ++ VV++ NL+TS+GPGTA+ FGLKL+E +   D
Sbjct: 121 GLP-NKQITGHPSVRGQLEEGGYKYLDQPVVLEENLITSQGPGTAMLFGLKLLEQVASKD 179
>sp|Q46948|THIJ_ECOLI Protein thiJ
          Length = 196

 Score =  102 bits (253), Expect = 1e-21
 Identities = 56/183 (30%), Positives = 106/183 (57%), Gaps = 5/183 (2%)
 Frame = +3

Query: 54  IIVANGSEDIEVVATSDVLSRGGIDVVIVGMEGSGNVKL--AHKTVIVVEKSLADVKTTL 227
           + +A GSE+ E V T D+L RGGI V    +   GN+ +  +    ++ +  L +V    
Sbjct: 7   VCLAPGSEETEAVTTIDLLVRGGIKVTTASVASDGNLAITCSRGVKLLADAPLVEVADGE 66

Query: 228 YDALIIPGGMGGVKALSANENVKEMLYNHQKNKKVIGAICAGPL-VLSAFGVAAGSTLTS 404
           YD +++PGG+ G +    +  + E +    ++ +++ AICA P  VL    +     +T 
Sbjct: 67  YDVIVLPGGIKGAECFRDSTLLVETVKQFHRSGRIVAAICAAPATVLVPHDIFPIGNMTG 126

Query: 405 YPSVKDQLTSADYKYKEERVVVDRN--LVTSRGPGTAVEFGLKLVEVLKGADAVKGLAEQ 578
           +P++KD++ +   ++ ++RVV D    L+TS+GPGTA++FGLK++++L G +    +A Q
Sbjct: 127 FPTLKDKIPAE--QWLDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLVGREKAHEVASQ 184

Query: 579 MLV 587
           +++
Sbjct: 185 LVM 187
>sp|P55880|THIJ_SALTY Protein thiJ
          Length = 196

 Score = 98.6 bits (244), Expect = 1e-20
 Identities = 55/183 (30%), Positives = 105/183 (57%), Gaps = 5/183 (2%)
 Frame = +3

Query: 54  IIVANGSEDIEVVATSDVLSRGGIDVVIVGMEGSGNVKL--AHKTVIVVEKSLADVKTTL 227
           + +A GSE+ E V T D+L RGGI V    +   GN+ +  +    ++ +  L +V    
Sbjct: 7   VCLAPGSEETEAVTTIDLLVRGGIHVTTASVASDGNLTIVCSRGVKLLADAPLVEVADGD 66

Query: 228 YDALIIPGGMGGVKALSANENVKEMLYNHQKNKKVIGAICAGPL-VLSAFGVAAGSTLTS 404
           YD +++PGG+ G +    +  + E +    ++ +++ AICA    VL    +     +T 
Sbjct: 67  YDIIVLPGGIKGAECFRDSPLLVETVKQFHRSGRIVAAICAAAATVLVPHDIFPIGNMTG 126

Query: 405 YPSVKDQLTSADYKYKEERVVVDRN--LVTSRGPGTAVEFGLKLVEVLKGADAVKGLAEQ 578
           +P++KD++ +   ++ ++RVV D    L+TS+GPGTA++FGLK++++L G +    +A Q
Sbjct: 127 FPALKDKIPAE--QWLDKRVVWDARVKLLTSQGPGTAIDFGLKIIDLLAGREKAHEVASQ 184

Query: 579 MLV 587
           +++
Sbjct: 185 LVM 187
>sp|O06006|YRAA_BACSU Hypothetical protein yraA
          Length = 169

 Score = 92.8 bits (229), Expect = 8e-19
 Identities = 54/166 (32%), Positives = 84/166 (50%)
 Frame = +3

Query: 48  VCIIVANGSEDIEVVATSDVLSRGGIDVVIVGMEGSGNVKLAHKTVIVVEKSLADVKTTL 227
           + ++V +  EDIE  +        G  VV + +E    V   H   + ++K+++DV  + 
Sbjct: 5   IAVLVTDQFEDIEYTSPVKAYEEAGYSVVAIDLEAGKEVTGKHGEKVKIDKAISDVDASD 64

Query: 228 YDALIIPGGMGGVKALSANENVKEMLYNHQKNKKVIGAICAGPLVLSAFGVAAGSTLTSY 407
           +DAL+IPGG      L A++   E      +NKK + AIC GP VL    +  G  +T Y
Sbjct: 65  FDALLIPGGFSP-DLLRADDRPGEFAKAFVENKKPVFAICHGPQVLIDTDLLKGKDITGY 123

Query: 408 PSVKDQLTSADYKYKEERVVVDRNLVTSRGPGTAVEFGLKLVEVLK 545
            S++  L +A   YK+  VVV  N+VTSR P     F  + + +LK
Sbjct: 124 RSIRKDLINAGANYKDAEVVVSHNIVTSRTPDDLEAFNRESLNLLK 169
>sp|Q5HN59|Y1413_STAEQ Hypothetical protein SERP1413
          Length = 172

 Score = 87.0 bits (214), Expect = 4e-17
 Identities = 57/167 (34%), Positives = 83/167 (49%), Gaps = 1/167 (0%)
 Frame = +3

Query: 48  VCIIVANGSEDIEVVATSDVLSRGGIDVVIVGMEGSGNVKLAHKTVIVVEKSLADVKTTL 227
           V II+A+  EDIE+ +  + L   G +  ++G   +  V   H   + V+ S+AD K   
Sbjct: 5   VAIILADEFEDIELTSPKEALENAGFETEVIGDTANHEVVGKHGEKVTVDVSIADAKPEN 64

Query: 228 YDALIIPGGMGGVKALSANENVKEMLYNH-QKNKKVIGAICAGPLVLSAFGVAAGSTLTS 404
           YDAL+IPGG          E        +  KN     AIC GPLVL       G T+T 
Sbjct: 65  YDALLIPGGFSPDHLRGDEEGRYGTFAKYFTKNDVPTFAICHGPLVLVDTDDLKGRTITG 124

Query: 405 YPSVKDQLTSADYKYKEERVVVDRNLVTSRGPGTAVEFGLKLVEVLK 545
             +V+  L++A     +E VVVD N+VTSR P    +F  ++V+ L+
Sbjct: 125 VINVRKDLSNAGANVVDESVVVDNNIVTSRVPDDLDDFNREIVKKLE 171
>sp|Q49YS0|Y918_STAS1 Hypothetical protein SSP0918
          Length = 172

 Score = 86.3 bits (212), Expect = 7e-17
 Identities = 55/166 (33%), Positives = 83/166 (50%), Gaps = 1/166 (0%)
 Frame = +3

Query: 48  VCIIVANGSEDIEVVATSDVLSRGGIDVVIVGMEGSGNVKLAHKTVIVVEKSLADVKTTL 227
           V II+ N  EDIE+ +  + +   G + V++G + +  V   H T + V+ S+AD K   
Sbjct: 5   VAIILTNEFEDIELTSPKEAIEEAGHETVVIGDQANSEVVGKHGTKVAVDVSIADAKPED 64

Query: 228 YDALIIPGGMGGVKALSANENVKEMLYNH-QKNKKVIGAICAGPLVLSAFGVAAGSTLTS 404
           +D L+IPGG          E        +  KN     AIC GP +L       G TLT+
Sbjct: 65  FDGLLIPGGFSPDHLRGDAEGRYGTFAKYFTKNDVPAFAICHGPQILIDTDDLNGRTLTA 124

Query: 405 YPSVKDQLTSADYKYKEERVVVDRNLVTSRGPGTAVEFGLKLVEVL 542
             +V+  L +A  +  +E VVVD+N+VTSR P    +F  ++V  L
Sbjct: 125 VLNVRKDLANAGAQVVDESVVVDKNIVTSRTPDDLDDFNREIVNQL 170
>sp|Q4L7I2|Y1084_STAHJ Hypothetical protein SH1084
          Length = 172

 Score = 85.9 bits (211), Expect = 9e-17
 Identities = 55/167 (32%), Positives = 84/167 (50%), Gaps = 1/167 (0%)
 Frame = +3

Query: 48  VCIIVANGSEDIEVVATSDVLSRGGIDVVIVGMEGSGNVKLAHKTVIVVEKSLADVKTTL 227
           V II++N  EDIE+ +  + +   G +  I+G   +  V   H   ++V+ S+AD K   
Sbjct: 5   VAIILSNEFEDIELTSPKEAIEEAGFETEIIGDTANAEVVGKHGEKVIVDVSIADAKPED 64

Query: 228 YDALIIPGGMGGVKALSANENVKEMLYNH-QKNKKVIGAICAGPLVLSAFGVAAGSTLTS 404
           YD L+IPGG          E        +  KN     AIC GP +L       G TLT+
Sbjct: 65  YDGLLIPGGFSPDHLRGDAEGRYGTFAKYFTKNDVPAFAICHGPQILIDTDDLNGRTLTA 124

Query: 405 YPSVKDQLTSADYKYKEERVVVDRNLVTSRGPGTAVEFGLKLVEVLK 545
             +V+  L++A     +E VVVD+N+VTSR P    +F  ++V+ L+
Sbjct: 125 VLNVRKDLSNAGANVVDESVVVDKNIVTSRTPDDLDDFNREIVKQLQ 171
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 70,688,536
Number of Sequences: 369166
Number of extensions: 1318571
Number of successful extensions: 4300
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4136
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4275
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 5927776870
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)