Planarian EST Database


Dr_sW_002_M20

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_002_M20
         (524 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P41606|RPOC2_PINTH  DNA-directed RNA polymerase beta'' ch...    34   0.25 
sp|Q85X62|RPOC2_PINKO  DNA-directed RNA polymerase beta'' ch...    33   0.42 
sp|Q04351|Y3709_CLOAB  Hypothetical protein CAC3709                32   0.93 
sp|Q8TRB8|RPOA2_METAC  DNA-directed RNA polymerase subunit A''     31   1.6  
sp|O33845|DPOL_THEAG  DNA polymerase (Pol Tfu) [Contains: Ta...    30   2.7  
sp|P84329|HFW1_DROPS  Halfway protein (Singed wings protein)       30   3.5  
sp|Q48514|TRA3_LEPBO  Transposase for insertion sequence ele...    30   4.6  
sp|P22967|ACET_MOUSE  Angiotensin-converting enzyme, testis-...    30   4.6  
sp|P09470|ACE_MOUSE  Angiotensin-converting enzyme, somatic ...    30   4.6  
sp|Q8RXF1|SF3A1_ARATH  Probable splicing factor 3 subunit 1        29   6.0  
>sp|P41606|RPOC2_PINTH DNA-directed RNA polymerase beta'' chain (PEP) (Plastid-encoded RNA
           polymerase beta'' subunit) (RNA polymerase beta''
           subunit)
          Length = 1224

 Score = 33.9 bits (76), Expect = 0.25
 Identities = 21/77 (27%), Positives = 42/77 (54%), Gaps = 1/77 (1%)
 Frame = +3

Query: 111 FNGQVLHEDYNSRYYNIRDGSVIQIVTGNLTNSINLIVEVQSIR-PNHNALYIKVDQNIT 287
           FNG++   +        R+G    +   NL+ +I+   +VQ++  P  + L ++ DQ + 
Sbjct: 373 FNGKIEFNENLVYPTRTRNGHPAYLCHNNLSITIDGQDQVQNLTIPPQSLLLVQNDQYVE 432

Query: 288 TEQLLSEIRTQISFIKD 338
           +EQL++E+R + S  K+
Sbjct: 433 SEQLIAEVRARTSSFKE 449
>sp|Q85X62|RPOC2_PINKO DNA-directed RNA polymerase beta'' chain (PEP) (Plastid-encoded RNA
           polymerase beta'' subunit) (RNA polymerase beta''
           subunit)
          Length = 1209

 Score = 33.1 bits (74), Expect = 0.42
 Identities = 20/77 (25%), Positives = 42/77 (54%), Gaps = 1/77 (1%)
 Frame = +3

Query: 111 FNGQVLHEDYNSRYYNIRDGSVIQIVTGNLTNSINLIVEVQSIR-PNHNALYIKVDQNIT 287
           FNG++   +        R+G    +   NL+ +I+   +VQ++  P  + L ++ DQ + 
Sbjct: 373 FNGKIEFNENLVYPTRTRNGHPAYLCHNNLSITIDGQNQVQNLTIPPQSLLLVQNDQYVE 432

Query: 288 TEQLLSEIRTQISFIKD 338
           +EQ+++E+R + S  K+
Sbjct: 433 SEQIIAEVRARTSSFKE 449
>sp|Q04351|Y3709_CLOAB Hypothetical protein CAC3709
          Length = 1498

 Score = 32.0 bits (71), Expect = 0.93
 Identities = 24/84 (28%), Positives = 43/84 (51%), Gaps = 10/84 (11%)
 Frame = +3

Query: 48  LIKSLKALWRDKYNLEIESLHFNG------QVLHEDYNSRYY----NIRDGSVIQIVTGN 197
           LIK++  +  DKY+ E E LHF          + E YNSR Y    + +D  V+ + TGN
Sbjct: 306 LIKAVAEI-SDKYDEETEVLHFRNPSPEKLAEMIETYNSRIYERMASDKDFLVVTLGTGN 364

Query: 198 LTNSINLIVEVQSIRPNHNALYIK 269
             +++++  ++     +  +L +K
Sbjct: 365 QASNLSVDSDINDRDNDEKSLRVK 388
>sp|Q8TRB8|RPOA2_METAC DNA-directed RNA polymerase subunit A''
          Length = 397

 Score = 31.2 bits (69), Expect = 1.6
 Identities = 28/115 (24%), Positives = 50/115 (43%), Gaps = 8/115 (6%)
 Frame = +3

Query: 54  KSLKALWRDKYNLEIESLHFNGQVLHEDYNSRYYNIRDGSVIQIV--------TGNLTNS 209
           + L  L +  +N+ ++ +    +V+       Y    +GS ++ V        T   TN+
Sbjct: 208 RELLQLAKSIHNVTLKGIEGIKRVVVRKEGEEYTLYTEGSALRDVLQFEGVDKTRTSTNN 267

Query: 210 INLIVEVQSIRPNHNALYIKVDQNITTEQLLSEIRTQISFIKDSAYLTCRGQILQ 374
           IN I EV  I    NA+  +    +  + L  +IR  I  + D   +TC G++ Q
Sbjct: 268 INEIYEVLGIEAARNAIIKEATDTLREQGLTVDIR-HIMLVAD--LMTCDGEVKQ 319
>sp|O33845|DPOL_THEAG DNA polymerase (Pol Tfu) [Contains: Tag pol-1 intein (Tsp-TY pol-1)
            (Intein I); Tag pol-2 intein (Tsp-TY pol-2) (Intein II);
            Tag pol-3 intein (Tsp-TY pol-3) (Intein III)]
          Length = 1829

 Score = 30.4 bits (67), Expect = 2.7
 Identities = 22/80 (27%), Positives = 38/80 (47%), Gaps = 1/80 (1%)
 Frame = +3

Query: 51   IKSLKALWRDKYNLEIESLHFN-GQVLHEDYNSRYYNIRDGSVIQIVTGNLTNSINLIVE 227
            IK +KAL R KY  E   +  N G+ +H       + IR+G + +I  G      +LI+ 
Sbjct: 918  IKKVKALIRHKYKGEAYEVELNSGRKIHITRGHSLFTIRNGKIKEI-WGEEVKVGDLIIV 976

Query: 228  VQSIRPNHNALYIKVDQNIT 287
             + ++ N     I + + I+
Sbjct: 977  PKKVKLNEKEAVINIPELIS 996
>sp|P84329|HFW1_DROPS Halfway protein (Singed wings protein)
          Length = 587

 Score = 30.0 bits (66), Expect = 3.5
 Identities = 26/87 (29%), Positives = 40/87 (45%)
 Frame = +3

Query: 177 IQIVTGNLTNSINLIVEVQSIRPNHNALYIKVDQNITTEQLLSEIRTQISFIKDSAYLTC 356
           + +  GN+T  IN         P H+AL      NI+   + SEI +++   KD  +L  
Sbjct: 226 LAVTDGNITRLINAF-------PRHSALKCL---NISNNNI-SEIPSRM--FKDVPHLEF 272

Query: 357 RGQILQSASSLAHQNINNNDTIVVLMR 437
            G    + S + H N N N T+ + MR
Sbjct: 273 FGMSRNNLSLVPHHNQNKNITVDIRMR 299
>sp|Q48514|TRA3_LEPBO Transposase for insertion sequence element IS1533
          Length = 346

 Score = 29.6 bits (65), Expect = 4.6
 Identities = 16/57 (28%), Positives = 27/57 (47%), Gaps = 5/57 (8%)
 Frame = +3

Query: 9   EVNRFNETLDSSKLIKSLKAL-----WRDKYNLEIESLHFNGQVLHEDYNSRYYNIR 164
           ++ R  + L    L K +K L     W   +   + +LHFN ++L E +N  Y  +R
Sbjct: 141 DLGRNRQRLMKFLLRKDIKLLPTTKYWTVSHYKWLNNLHFNNEILQETFNDYYSRVR 197
>sp|P22967|ACET_MOUSE Angiotensin-converting enzyme, testis-specific isoform precursor
           (ACE-T) (Dipeptidyl carboxypeptidase I) (Kininase II)
           [Contains: Angiotensin-converting enzyme,
           testis-specific isoform, soluble form]
          Length = 732

 Score = 29.6 bits (65), Expect = 4.6
 Identities = 24/96 (25%), Positives = 44/96 (45%), Gaps = 5/96 (5%)
 Frame = +3

Query: 36  DSSKLIKSLKALWRDKYNLEIESLHFNGQVLHEDYNSRYYNIRDGSVIQIVTGNL----- 200
           +S  L + L+ L+++   L +    +  + LH  Y S Y N+ DG +   + GN+     
Sbjct: 255 ESDNLEQDLEKLYQELQPLYLNLHAYVRRSLHRHYGSEYINL-DGPIPAHLLGNMWAQTW 313

Query: 201 TNSINLIVEVQSIRPNHNALYIKVDQNITTEQLLSE 308
           +N  +L+    S  PN +A    + Q  T  ++  E
Sbjct: 314 SNIYDLVAPFPS-APNIDATEAMIKQGWTPRRIFKE 348
>sp|P09470|ACE_MOUSE Angiotensin-converting enzyme, somatic isoform precursor (Dipeptidyl
            carboxypeptidase I) (Kininase II) [Contains:
            Angiotensin-converting enzyme, somatic isoform, soluble
            form]
          Length = 1312

 Score = 29.6 bits (65), Expect = 4.6
 Identities = 24/96 (25%), Positives = 44/96 (45%), Gaps = 5/96 (5%)
 Frame = +3

Query: 36   DSSKLIKSLKALWRDKYNLEIESLHFNGQVLHEDYNSRYYNIRDGSVIQIVTGNL----- 200
            +S  L + L+ L+++   L +    +  + LH  Y S Y N+ DG +   + GN+     
Sbjct: 835  ESDNLEQDLEKLYQELQPLYLNLHAYVRRSLHRHYGSEYINL-DGPIPAHLLGNMWAQTW 893

Query: 201  TNSINLIVEVQSIRPNHNALYIKVDQNITTEQLLSE 308
            +N  +L+    S  PN +A    + Q  T  ++  E
Sbjct: 894  SNIYDLVAPFPS-APNIDATEAMIKQGWTPRRIFKE 928
>sp|Q8RXF1|SF3A1_ARATH Probable splicing factor 3 subunit 1
          Length = 785

 Score = 29.3 bits (64), Expect = 6.0
 Identities = 21/93 (22%), Positives = 40/93 (43%)
 Frame = +3

Query: 171 SVIQIVTGNLTNSINLIVEVQSIRPNHNALYIKVDQNITTEQLLSEIRTQISFIKDSAYL 350
           + I++   N  +   + + VQS+  N  +L  K+   I       ++  +  F+KD+   
Sbjct: 703 ATIRVSKPNENDGQFMEITVQSLSENVGSLKEKIAGEIQIPANKQKLSGKAGFLKDNM-- 760

Query: 351 TCRGQILQSASSLAHQNINNNDTIVVLMRTRGG 449
                      SLAH N+   + + + +R RGG
Sbjct: 761 -----------SLAHYNVGAGEILTLSLRERGG 782
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 59,213,201
Number of Sequences: 369166
Number of extensions: 1099944
Number of successful extensions: 2648
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 2590
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2648
length of database: 68,354,980
effective HSP length: 103
effective length of database: 49,327,275
effective search space used: 3502236525
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)