Planarian EST Database


Dr_sW_025_I23

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_025_I23
         (801 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|O95396|MOCS3_HUMAN  Molybdenum cofactor synthesis protein...   285   1e-76
sp|Q9ZNW0|MOC3_ARATH  Molybdenum cofactor synthesis protein ...   246   7e-65
sp|P38820|YHR1_YEAST  Hypothetical 49.4 kDa protein in CDC12...   215   1e-55
sp|Q09810|YABA_SCHPO  Hypothetical protein C2G11.10c in chro...   206   5e-53
sp|P12282|MOEB_ECOLI  Molybdopterin biosynthesis protein moeB     139   1e-32
sp|Q56067|MOEB_SALTY  Molybdopterin biosynthesis protein moeB     135   2e-31
sp|P51335|YCXH_PORPU  Hypothetical 43.4 kDa protein in rpl9-...   133   6e-31
sp|P30138|THIF_ECOLI  Adenylyltransferase thiF                    123   5e-28
sp|P45211|MOEB_HAEIN  Molybdopterin biosynthesis protein moeB     117   3e-26
sp|P18500|HESA_ANASP  Protein hesA                                 88   3e-17
>sp|O95396|MOCS3_HUMAN Molybdenum cofactor synthesis protein 3 (Molybdopterin synthase
           sulfurylase) (MPT synthase sulfurylase)
          Length = 460

 Score =  285 bits (728), Expect = 1e-76
 Identities = 130/266 (48%), Positives = 186/266 (69%)
 Frame = +3

Query: 3   IGLVDHDIVDESNLHRQICHKESKIGXXXXXXXXXXXXMLNSSINIVHHKVLINSENAID 182
           +GLVD+D+V+ SNL RQ+ H E+  G             LNS++  V +   +    A+D
Sbjct: 109 LGLVDYDVVEMSNLARQVLHGEALAGQAKAFSAAASLRRLNSAVECVPYTQALTPATALD 168

Query: 183 IIKNYDVVLDCTDNVVTRYLINDCCVILKKPLVSASALGLEGQLTVYNYNNGPCYRCLFP 362
           +++ YDVV DC+DNV TRYL+ND CV+  +PLVSASAL  EGQ+TVY+Y+ GPCYRC+FP
Sbjct: 169 LVRRYDVVADCSDNVPTRYLVNDACVLAGRPLVSASALRFEGQITVYHYDGGPCYRCIFP 228

Query: 363 VPPPANTVTNCNEGGVLGTVPGMLGTLQATETIKIITQIGSTYSGKMLLYDAESGNFRNI 542
            PPPA TVTNC +GGVLG V G+LG LQA E +KI   +G +YSG +LL+DA  G+FR+I
Sbjct: 229 QPPPAETVTNCADGGVLGVVTGVLGCLQALEVLKIAAGLGPSYSGSLLLFDALRGHFRSI 288

Query: 543 KLRPRNNNCEVCGDNPSIREPIDYQKFCNAKPSDACGGKLSILSPCDRISVSEYSSILET 722
           +LR R  +C  CG+ P++ + +DY+ FC +  +D C   L +LSP +R+SV++Y  +L++
Sbjct: 289 RLRSRRLDCAACGERPTVTDLLDYEAFCGSSATDKC-RSLQLLSPEERVSVTDYKRLLDS 347

Query: 723 NQPHILIDVRPQVQIDTCRFTNAIQL 800
              H+L+DVRPQV++D CR  +A+ +
Sbjct: 348 GAFHLLLDVRPQVEVDICRLPHALHI 373
>sp|Q9ZNW0|MOC3_ARATH Molybdenum cofactor synthesis protein 3 (Molybdopterin synthase
           sulfurylase) (MPT synthase sulfurylase)
          Length = 464

 Score =  246 bits (627), Expect = 7e-65
 Identities = 116/255 (45%), Positives = 166/255 (65%), Gaps = 4/255 (1%)
 Frame = +3

Query: 3   IGLVDHDIVDESNLHRQICHKESKIGXXXXXXXXXXXXMLNSSINIVHHKVLINSENAID 182
           +G++DHD+V+ +N+HRQI H E+ IG             +NS+I +  +   + + NA++
Sbjct: 118 LGIIDHDVVELNNMHRQIIHTEAFIGHPKVKSAAAACRSINSTIKVDEYVEALRTSNALE 177

Query: 183 IIKNYDVVLDCTDNVVTRYLINDCCVILKKPLVSASALGLEGQLTVYNYNNGPCYRCLFP 362
           I+  YD+++D TDN  +RY+I+DCCV+L KPLVS +ALG+EGQLTVYN+N GPCYRCLFP
Sbjct: 178 ILSQYDIIVDATDNPPSRYMISDCCVLLGKPLVSGAALGMEGQLTVYNHNGGPCYRCLFP 237

Query: 363 VPPPANTVTNCNEGGVLGTVPGMLGTLQATETIKIITQIGSTYSGKMLLYDAESGNFRNI 542
            PPP +    C++ GVLG VPG++G LQA ETIK+ + +G   S +MLL+DA S   R +
Sbjct: 238 TPPPTSACQRCSDSGVLGVVPGVIGCLQALETIKLASLVGEPLSERMLLFDALSARMRIV 297

Query: 543 KLRPRNNNCEVCGDNPSIR----EPIDYQKFCNAKPSDACGGKLSILSPCDRISVSEYSS 710
           K+R R++ C VCGDN S      +  DY+ F          G L++L    RIS  E+  
Sbjct: 298 KIRGRSSQCTVCGDNSSFNKQTFKDFDYEDFTQ---FPLFAGPLNLLPAESRISSKEFKE 354

Query: 711 ILETNQPHILIDVRP 755
           IL+  + H+L+DVRP
Sbjct: 355 ILQKKEQHVLLDVRP 369
>sp|P38820|YHR1_YEAST Hypothetical 49.4 kDa protein in CDC12-ORC6 intergenic region
          Length = 440

 Score =  215 bits (548), Expect = 1e-55
 Identities = 115/279 (41%), Positives = 162/279 (58%), Gaps = 13/279 (4%)
 Frame = +3

Query: 3   IGLVDHDIVDESNLHRQICHKESKIGXXXXXXXXXXXXMLNSSINIVHHKVLINSENAID 182
           IG+VD+D+V+ SNLHRQ+ H  S++G             LN  IN+V + V +NS NA D
Sbjct: 94  IGIVDNDVVETSNLHRQVLHDSSRVGMLKCESARQYITKLNPHINVVTYPVRLNSSNAFD 153

Query: 183 IIKNYDVVLDCTDNVVTRYLINDCCVILKKPLVSASALGLEGQLTVYNYNN-GPCYRCLF 359
           I K Y+ +LDCTD+ +TRYL++D  V L   +VSAS LG EGQLT+ N+NN GPCYRC +
Sbjct: 154 IFKGYNYILDCTDSPLTRYLVSDVAVNLGITVVSASGLGTEGQLTILNFNNIGPCYRCFY 213

Query: 360 PVPPPANTVTNCNEGGVLGTVPGMLGTLQATETIKIITQI--GSTYSGKMLLYDA-ESGN 530
           P PPP N VT+C EGGV+G   G++GT+ A ET+K+I  I     +S  ++LY      +
Sbjct: 214 PTPPPPNAVTSCQEGGVIGPCIGLVGTMMAVETLKLILGIYTNENFSPFLMLYSGFPQQS 273

Query: 531 FRNIKLRPRNNNCEVCGDNPSI------REPIDYQKFCNAKPSDACGGKLSILSPCDRIS 692
            R  K+R R   C  CG N +I      +  I+Y+ FC A+  + C        P +RIS
Sbjct: 274 LRTFKMRGRQEKCLCCGKNRTITKEAIEKGEINYELFCGARNYNVC-------EPDERIS 326

Query: 693 VSEYSSILETNQ---PHILIDVRPQVQIDTCRFTNAIQL 800
           V  +  I + ++    HI +DVRP    +   F  A+ +
Sbjct: 327 VDAFQRIYKDDEFLAKHIFLDVRPSHHYEISHFPEAVNI 365
>sp|Q09810|YABA_SCHPO Hypothetical protein C2G11.10c in chromosome I
          Length = 401

 Score =  206 bits (525), Expect = 5e-53
 Identities = 117/271 (43%), Positives = 162/271 (59%), Gaps = 12/271 (4%)
 Frame = +3

Query: 3   IGLVDHDIVDESNLHRQICHKESKIGXXXXXXXXXXXXMLNSSINIVHHKVLINSENAID 182
           +G++D D+VD+SNLHRQI H  SK G             LN ++ I  +    ++ N   
Sbjct: 70  LGIMDGDVVDKSNLHRQIIHSTSKQGMHKAISAKQFLEDLNPNVIINTYLEFASASNLFS 129

Query: 183 IIKNYDVVLDCTDNVVTRYLINDCCVILKKPLVSASALGLEGQLTVYNYNNGPCYRCLFP 362
           II+ YDVVLDCTDN  TRYLI+D CV+L +PLVSASAL LEGQL +YNY NGPCYRC+FP
Sbjct: 130 IIEQYDVVLDCTDNQYTRYLISDTCVLLGRPLVSASALKLEGQLCIYNYCNGPCYRCMFP 189

Query: 363 VPPPANTVTNCNEGGVLGTVPGMLGTLQATETIKIITQIG----STYSGKMLLYDA-ESG 527
            P P   V +C + G+LG V G +GT+QA ET+K+I  I       +   MLL+ A +  
Sbjct: 190 NPTP--VVASCAKSGILGPVVGTMGTMQALETVKLILHINGIKKDQFDPYMLLFHAFKVP 247

Query: 528 NFRNIKLRPRNNNCEVCGDNPSI------REPIDYQKFCNAKPSDACGGKLSILSPCDRI 689
            +++I++RPR  +C+ CG N  +        P +Y   C+  P+ +       L+P  RI
Sbjct: 248 QWKHIRIRPRQQSCKACGPNKMLSREFMESSPKEYTTICDYVPTLS-----KQLAPIRRI 302

Query: 690 SVSEYSSILETNQPHI-LIDVRPQVQIDTCR 779
           S  +  +++ET+ PHI  +DVR  VQ   CR
Sbjct: 303 SALDLKNLIETS-PHITFLDVREPVQFGICR 332
>sp|P12282|MOEB_ECOLI Molybdopterin biosynthesis protein moeB
          Length = 249

 Score =  139 bits (349), Expect = 1e-32
 Identities = 77/192 (40%), Positives = 107/192 (55%), Gaps = 1/192 (0%)
 Frame = +3

Query: 9   LVDHDIVDESNLHRQICHKESKIGXXXXXXXXXXXXMLNSSINIVHHKVLINSENAIDII 188
           L+D D V  SNL RQ  H ++ +G             +N  I I     L++      +I
Sbjct: 60  LLDFDTVSLSNLQRQTLHSDATVGQPKVESARDALTRINPHIAITPVNALLDDAELAALI 119

Query: 189 KNYDVVLDCTDNVVTRYLINDCCVILKKPLVSASALGLEGQLTVYNYNNG-PCYRCLFPV 365
             +D+VLDCTDNV  R  +N  C   K PLVS +A+ +EGQ+TV+ Y +G PCYRCL  +
Sbjct: 120 AEHDLVLDCTDNVAVRNQLNAGCFAAKVPLVSGAAIRMEGQITVFTYQDGEPCYRCLSRL 179

Query: 366 PPPANTVTNCNEGGVLGTVPGMLGTLQATETIKIITQIGSTYSGKMLLYDAESGNFRNIK 545
                    C E GV+  + G++G+LQA E IK++   G   SGK+++YDA +  FR +K
Sbjct: 180 --FGENALTCVEAGVMAPLIGVIGSLQAMEAIKMLAGYGKPASGKIVMYDAMTCQFREMK 237

Query: 546 LRPRNNNCEVCG 581
           L  RN  CEVCG
Sbjct: 238 LM-RNPGCEVCG 248
>sp|Q56067|MOEB_SALTY Molybdopterin biosynthesis protein moeB
          Length = 249

 Score =  135 bits (339), Expect = 2e-31
 Identities = 74/192 (38%), Positives = 105/192 (54%), Gaps = 1/192 (0%)
 Frame = +3

Query: 9   LVDHDIVDESNLHRQICHKESKIGXXXXXXXXXXXXMLNSSINIVHHKVLINSENAIDII 188
           L+D D V  SNL RQ  H ++ +G             +N  I I      ++ +    +I
Sbjct: 60  LLDFDTVSVSNLQRQTLHSDATVGQPKVESARDALARINPHITITPVNARLDDDAMTSLI 119

Query: 189 KNYDVVLDCTDNVVTRYLINDCCVILKKPLVSASALGLEGQLTVYNY-NNGPCYRCLFPV 365
             + +VLDCTDNV  R  +N  C   K PL+S +A+ +EGQ+TV+ Y  N PCYRCL  +
Sbjct: 120 AGHSLVLDCTDNVSVRNQLNAGCYTAKVPLISGAAIRMEGQVTVFTYRENEPCYRCLSRL 179

Query: 366 PPPANTVTNCNEGGVLGTVPGMLGTLQATETIKIITQIGSTYSGKMLLYDAESGNFRNIK 545
                    C E GV+  + G++G+LQA E IK++   G   SGK+++YDA +  FR +K
Sbjct: 180 --FGENALTCVEAGVMAPLIGVIGSLQAMEAIKLLAHYGQPASGKIVMYDAMTCQFREMK 237

Query: 546 LRPRNNNCEVCG 581
           L  RN  CEVCG
Sbjct: 238 LM-RNPGCEVCG 248
>sp|P51335|YCXH_PORPU Hypothetical 43.4 kDa protein in rpl9-rpl11 intergenic region
           (ORF382)
          Length = 382

 Score =  133 bits (334), Expect = 6e-31
 Identities = 88/270 (32%), Positives = 133/270 (49%), Gaps = 4/270 (1%)
 Frame = +3

Query: 3   IGLVDHDIVDESNLHRQICHKESKIGXXXXXXXXXXXXMLNSSINIVHHKVLINSENAID 182
           IG+VD+DI+D SNL RQI +  + IG             +N + N+      + S NAI+
Sbjct: 65  IGIVDNDIIDISNLQRQILYTVNDIGLSKAYIAKKKILEINPTCNVQIFNTRLQSINAIE 124

Query: 183 IIKNYDVVLDCTDNVVTRYLINDCCVILKKPLVSASALGLEGQLTVYNYNNGPCYRCLF- 359
           II+ YD+++D TDN  +RY+I+D C+ L K  +  +    EGQ++ +NY  GP YR    
Sbjct: 125 IIRQYDIIIDGTDNFGSRYIISDSCLELNKIHIYGAIFQFEGQVSTFNYQGGPKYRDFHN 184

Query: 360 PVPPPANTVTNCNEGGVLGTVPGMLGTLQATETIKIITQIGSTYSGKMLLYDAESGNFRN 539
            +    N    C+  GVLG +PG++GTLQATE IKII    S  SG +L Y+A + +F  
Sbjct: 185 NIETENNPEDTCSNAGVLGLLPGIIGTLQATEAIKIILGYKSVLSGIILKYNAMTISFEK 244

Query: 540 IKLRPRNNNCEVCGDNPSIREPIDYQKFCNAKPSDACGGKL--SILSPCDRISVSEYSSI 713
            K                    I + +F  ++P       L  +   P   I V E  + 
Sbjct: 245 FK--------------------IIHTQFILSQPKKKIKSLLVGNSSYPVQEIDVIELQNE 284

Query: 714 LETNQ-PHILIDVRPQVQIDTCRFTNAIQL 800
           L  N   +I++DVR + + +      A+ L
Sbjct: 285 LYRNSFKYIILDVRSKEEYEESHLDKAVNL 314
>sp|P30138|THIF_ECOLI Adenylyltransferase thiF
          Length = 251

 Score =  123 bits (309), Expect = 5e-28
 Identities = 78/193 (40%), Positives = 103/193 (53%), Gaps = 2/193 (1%)
 Frame = +3

Query: 9   LVDHDIVDESNLHRQICHKESKIGXXXXXXXXXXXXMLNSSINIVHHKVLINSENAIDII 188
           L D D V  SNL RQI      I              LN  I +   +  +  E   D +
Sbjct: 57  LADDDDVHLSNLQRQILFTTEDIDRPKSQVSQQRLTQLNPDIQLTALQQRLTGEALKDAV 116

Query: 189 KNYDVVLDCTDNVVTRYLINDCCVILKKPLVSASALGLEGQLTVYN--YNNGPCYRCLFP 362
              DVVLDCTDN+ TR  IN  CV L  PL++ASA+G  GQL V    +  G CYRCL+P
Sbjct: 117 ARADVVLDCTDNMATRQEINAACVALNTPLITASAVGFGGQLMVLTPPWEQG-CYRCLWP 175

Query: 363 VPPPANTVTNCNEGGVLGTVPGMLGTLQATETIKIITQIGSTYSGKMLLYDAESGNFRNI 542
                    NC   GV+G V G++GTLQA E IK+++ I  T +G++ L+D +S  +R++
Sbjct: 176 --DNQEPERNCRTAGVVGPVVGVMGTLQALEAIKLLSGI-ETPAGELRLFDGKSSQWRSL 232

Query: 543 KLRPRNNNCEVCG 581
            LR R + C VCG
Sbjct: 233 ALR-RASGCPVCG 244
>sp|P45211|MOEB_HAEIN Molybdopterin biosynthesis protein moeB
          Length = 243

 Score =  117 bits (294), Expect = 3e-26
 Identities = 67/181 (37%), Positives = 101/181 (55%), Gaps = 1/181 (0%)
 Frame = +3

Query: 9   LVDHDIVDESNLHRQICHKESKIGXXXXXXXXXXXXMLNSSINIVHHKVLINSENAIDII 188
           L+D D V  SNL RQ+ H ++++              +N  INI      ++ E   +II
Sbjct: 60  LLDFDTVSLSNLQRQVLHCDARLNMPKVESAKIALEQINPHINIETINAKLDEEKLAEII 119

Query: 189 KNYDVVLDCTDNVVTRYLINDCCVILKKPLVSASALGLEGQLTVYNYN-NGPCYRCLFPV 365
            ++D+VLDCTDNV  R  ++  C  +K PL+S +A+ +EGQ++V+ Y  N P YR L  +
Sbjct: 120 PHFDIVLDCTDNVEIRNQLDRQCNHMKVPLISGAAIRMEGQVSVFTYEPNTPTYRDLSKL 179

Query: 366 PPPANTVTNCNEGGVLGTVPGMLGTLQATETIKIITQIGSTYSGKMLLYDAESGNFRNIK 545
                 V +C E GVL  + G++G +QA E IK+  +IG    G++L+ D  S N R IK
Sbjct: 180 --FRQNVLSCVEAGVLAPIVGIVGCIQALEAIKVRLKIGKNLCGRLLMIDGFSMNIREIK 237

Query: 546 L 548
           L
Sbjct: 238 L 238
>sp|P18500|HESA_ANASP Protein hesA
          Length = 252

 Score = 87.8 bits (216), Expect = 3e-17
 Identities = 56/183 (30%), Positives = 83/183 (45%)
 Frame = +3

Query: 39  NLHRQICHKESKIGXXXXXXXXXXXXMLNSSINIVHHKVLINSENAIDIIKNYDVVLDCT 218
           +++RQ+   +  +G             +N  I I      I SEN   ++++ D+ LDC 
Sbjct: 55  DMNRQVLMTDDWVGKPRVFKAKETLQAINPDIQIETIHDYITSENVDSLVQSADMALDCA 114

Query: 219 DNVVTRYLINDCCVILKKPLVSASALGLEGQLTVYNYNNGPCYRCLFPVPPPANTVTNCN 398
            N   R L+N  CV  +KP+V A+  G+E  LT       PC  C+FP  P  +      
Sbjct: 115 HNFTERDLLNSACVRWRKPMVEAAMDGMEAYLTTIIPGVTPCLSCIFPEKPDWDR----R 170

Query: 399 EGGVLGTVPGMLGTLQATETIKIITQIGSTYSGKMLLYDAESGNFRNIKLRPRNNNCEVC 578
              VLG V G L  L A E IK+IT        ++L  D     F   +L  R+ +C VC
Sbjct: 171 GFSVLGAVSGTLACLTALEAIKLITGFSQPLLSQLLTIDLNRMEFAKRRLY-RDRSCPVC 229

Query: 579 GDN 587
           G++
Sbjct: 230 GND 232
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 84,337,353
Number of Sequences: 369166
Number of extensions: 1651819
Number of successful extensions: 4773
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4562
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4746
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 7570361805
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)