Planarian EST Database


Dr_sW_004_H05

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_004_H05
         (942 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P27401|POL_SFV3L  Pol polyprotein [Contains: Protease ; R...    75   3e-13
sp|P23074|POL_SFV1  Pol polyprotein [Contains: Protease ; Re...    74   5e-13
sp|P14350|POL_FOAMV  Pol polyprotein [Contains: Reverse tran...    74   8e-13
sp|P10394|POL4_DROME  Retrovirus-related Pol polyprotein fro...    69   1e-11
sp|Q05654|RT21_SCHPO  Retrotransposable element Tf2 155 kDa ...    52   2e-06
sp|Q9C0R2|RT22_SCHPO  Retrotransposable element Tf2 155 kDa ...    52   2e-06
sp|Q9UR07|RT23_SCHPO  Retrotransposable element Tf2 155 kDa ...    52   2e-06
sp|P21414|POL_GALV  Pol polyprotein [Contains: Protease ; Re...    50   7e-06
sp|P31792|POL_FENV1  Pol polyprotein [Contains: Reverse tran...    50   1e-05
sp|P10272|POL_BAEVM  Pol polyprotein [Contains: Protease ; R...    49   3e-05
>sp|P27401|POL_SFV3L Pol polyprotein [Contains: Protease ; Reverse
            transcriptase/ribonuclease H (RT); Integrase (IN)]
          Length = 1157

 Score = 74.7 bits (182), Expect = 3e-13
 Identities = 45/183 (24%), Positives = 85/183 (46%)
 Frame = +2

Query: 2    IISECQACQKHKVLTVKTKEETASLQPSVMVADIYVNICGPLKEARQMRYILGIIDQCSK 181
            +I +C+ C      T+         +P       +++  GPL  +    ++L ++D  + 
Sbjct: 856  VIRQCKQCLVTNAATLAAPPILRPERPVKPFDKFFIDYIGPLPPSNGYLHVLVVVDSMTG 915

Query: 182  YIVLTAIRRQDENTVQRVISNNWLLKFGCPKRIQMDCGRSFESKAMLQFAKRWNIELCFS 361
            ++ L   +    +   + +  N L     PK I  D G +F S     +AK   I+L FS
Sbjct: 916  FVWLYPTKAPSTSATVKAL--NMLTSIAVPKVIHSDQGAAFTSATFADWAKNKGIQLEFS 973

Query: 362  SPYHHNTNVQIERQFKTVRDLLNTTLEERGISNWTEILPEI*FALNSTWQKSINTSPTNF 541
            +PYH  ++ ++ER+   ++ LL   L  R  + W ++LP +  ALN+++  S   +P   
Sbjct: 974  TPYHPQSSGKVERKNSDIKRLLTKLLVGRP-AKWYDLLPVVQLALNNSYSPSSKYTPHQL 1032

Query: 542  VFG 550
            +FG
Sbjct: 1033 LFG 1035
>sp|P23074|POL_SFV1 Pol polyprotein [Contains: Protease ; Reverse
            transcriptase/ribonuclease H (RT); Integrase (IN)]
          Length = 1161

 Score = 74.3 bits (181), Expect = 5e-13
 Identities = 44/182 (24%), Positives = 85/182 (46%)
 Frame = +2

Query: 5    ISECQACQKHKVLTVKTKEETASLQPSVMVADIYVNICGPLKEARQMRYILGIIDQCSKY 184
            I +C+ C       + +      ++P       Y++  GPL  +    ++L ++D  + +
Sbjct: 855  IRQCKQCLVTNATNLTSPPILRPVKPLKPFDKFYIDYIGPLPPSNGYLHVLVVVDSMTGF 914

Query: 185  IVLTAIRRQDENTVQRVISNNWLLKFGCPKRIQMDCGRSFESKAMLQFAKRWNIELCFSS 364
            + L   +    +   + +  N L     PK +  D G +F S     +AK   I+L FS+
Sbjct: 915  VWLYPTKAPSTSATVKAL--NMLTSIAIPKVLHSDQGAAFTSSTFADWAKEKGIQLEFST 972

Query: 365  PYHHNTNVQIERQFKTVRDLLNTTLEERGISNWTEILPEI*FALNSTWQKSINTSPTNFV 544
            PYH  ++ ++ER+   ++ LL   L  R  + W ++LP +  ALN+++  S   +P   +
Sbjct: 973  PYHPQSSGKVERKNSDIKRLLTKLLIGRP-AKWYDLLPVVQLALNNSYSPSSKYTPHQLL 1031

Query: 545  FG 550
            FG
Sbjct: 1032 FG 1033
>sp|P14350|POL_FOAMV Pol polyprotein [Contains: Reverse transcriptase/ribonuclease H (RT);
            Integrase (IN)]
          Length = 886

 Score = 73.6 bits (179), Expect = 8e-13
 Identities = 49/186 (26%), Positives = 86/186 (46%), Gaps = 3/186 (1%)
 Frame = +2

Query: 2    IISECQACQKHKVLTVKTKEETASLQPS---VMVADIYVNICGPLKEARQMRYILGIIDQ 172
            ++ +   CQ+  +     K     L+P          +++  GPL  ++   Y+L ++D 
Sbjct: 643  VVKQLGRCQQCLITNASNKASGPILRPDRPQKPFDKFFIDYIGPLPPSQGYLYVLVVVDG 702

Query: 173  CSKYIVLTAIRRQDENTVQRVISNNWLLKFGCPKRIQMDCGRSFESKAMLQFAKRWNIEL 352
             + +  L   +    +T   V S N L     PK I  D G +F S    ++AK   I L
Sbjct: 703  MTGFTWLYPTKAP--STSATVKSLNVLTSIAIPKVIHSDQGAAFTSSTFAEWAKERGIHL 760

Query: 353  CFSSPYHHNTNVQIERQFKTVRDLLNTTLEERGISNWTEILPEI*FALNSTWQKSINTSP 532
             FS+PYH  +  ++ER+   ++ LL   L  R  + W ++LP +  ALN+T+   +  +P
Sbjct: 761  EFSTPYHPQSGSKVERKNSDIKRLLTKLLVGRP-TKWYDLLPVVQLALNNTYSPVLKYTP 819

Query: 533  TNFVFG 550
               +FG
Sbjct: 820  HQLLFG 825
>sp|P10394|POL4_DROME Retrovirus-related Pol polyprotein from transposon 412 [Contains:
            Protease ; Reverse transcriptase ; Endonuclease]
          Length = 1237

 Score = 69.3 bits (168), Expect = 1e-11
 Identities = 46/184 (25%), Positives = 80/184 (43%), Gaps = 1/184 (0%)
 Frame = +2

Query: 5    ISECQACQKHKVLTVKTKEETASLQPSVMVADIYVNICGPL-KEARQMRYILGIIDQCSK 181
            + +CQ CQK K         T +  P      + V+  GPL K      Y + +I   +K
Sbjct: 936  VRKCQKCQKAKTTKHTKTPMTITETPEHAFDRVVVDTIGPLPKSENGNEYAVTLICDLTK 995

Query: 182  YIVLTAIRRQDENTVQRVISNNWLLKFGCPKRIQMDCGRSFESKAMLQFAKRWNIELCFS 361
            Y+V   I  +   TV + I  +++LK+G  K    D G  +++  +    K   I+   S
Sbjct: 996  YLVAIPIANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKNSIITDLCKYLKIKNITS 1055

Query: 362  SPYHHNTNVQIERQFKTVRDLLNTTLEERGISNWTEILPEI*FALNSTWQKSINTSPTNF 541
            + +HH T   +ER  +T+ + + + +     ++W   L    +  N+T     N  P   
Sbjct: 1056 TAHHHQTVGVVERSHRTLNEYIRSYI-STDKTDWDVWLQYFVYCFNTTQSMVHNYCPYEL 1114

Query: 542  VFGK 553
            VFG+
Sbjct: 1115 VFGR 1118
>sp|Q05654|RT21_SCHPO Retrotransposable element Tf2 155 kDa protein type 1
          Length = 1333

 Score = 52.0 bits (123), Expect = 2e-06
 Identities = 58/287 (20%), Positives = 104/287 (36%), Gaps = 38/287 (13%)
 Frame = +2

Query: 5    ISECQACQKHKVLTVKTKEETASLQPSVMVAD-IYVNICGPLKEARQMRYILGIIDQCSK 181
            +  C  CQ +K    K       + PS    + + ++    L E+     +  ++D+ SK
Sbjct: 954  VQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSK 1013

Query: 182  YIVLTAIRRQ-DENTVQRVISNNWLLKFGCPKRIQMDCGRSFESKAMLQFAKRWNIELCF 358
              +L    +        R+     +  FG PK I  D    F S+    FA ++N  + F
Sbjct: 1014 MAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKF 1073

Query: 359  SSPYHHNTNVQIERQFKTVRDLLNTTLEERGISNWTEILPEI*FALNSTWQKSINTSPTN 538
            S PY   T+ Q ER  +TV  LL         + W + +  +  + N+    +   +P  
Sbjct: 1074 SLPYRPQTDGQTERTNQTVEKLLRCVCSTHP-NTWVDHISLVQQSYNNAIHSATQMTPFE 1132

Query: 539  FVFG-----K*IARENWN*QTSEKT-------------------------EMK*ESRRSF 628
             V         +   +++ +T E +                         +MK +    F
Sbjct: 1133 IVHRYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEF 1192

Query: 629  KVGERVLVR------TENRNKYQKRFEGPYQIIKKMHDRRYLLQRDD 751
            + G+ V+V+          NK    F GP+ +++K     Y L   D
Sbjct: 1193 QPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
>sp|Q9C0R2|RT22_SCHPO Retrotransposable element Tf2 155 kDa protein type 2
          Length = 1333

 Score = 52.0 bits (123), Expect = 2e-06
 Identities = 58/287 (20%), Positives = 104/287 (36%), Gaps = 38/287 (13%)
 Frame = +2

Query: 5    ISECQACQKHKVLTVKTKEETASLQPSVMVAD-IYVNICGPLKEARQMRYILGIIDQCSK 181
            +  C  CQ +K    K       + PS    + + ++    L E+     +  ++D+ SK
Sbjct: 954  VQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSK 1013

Query: 182  YIVLTAIRRQ-DENTVQRVISNNWLLKFGCPKRIQMDCGRSFESKAMLQFAKRWNIELCF 358
              +L    +        R+     +  FG PK I  D    F S+    FA ++N  + F
Sbjct: 1014 MAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKF 1073

Query: 359  SSPYHHNTNVQIERQFKTVRDLLNTTLEERGISNWTEILPEI*FALNSTWQKSINTSPTN 538
            S PY   T+ Q ER  +TV  LL         + W + +  +  + N+    +   +P  
Sbjct: 1074 SLPYRPQTDGQTERTNQTVEKLLRCVCSTHP-NTWVDHISLVQQSYNNAIHSATQMTPFE 1132

Query: 539  FVFG-----K*IARENWN*QTSEKT-------------------------EMK*ESRRSF 628
             V         +   +++ +T E +                         +MK +    F
Sbjct: 1133 IVHRYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEF 1192

Query: 629  KVGERVLVR------TENRNKYQKRFEGPYQIIKKMHDRRYLLQRDD 751
            + G+ V+V+          NK    F GP+ +++K     Y L   D
Sbjct: 1193 QPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
>sp|Q9UR07|RT23_SCHPO Retrotransposable element Tf2 155 kDa protein type 3
          Length = 1333

 Score = 52.0 bits (123), Expect = 2e-06
 Identities = 58/287 (20%), Positives = 104/287 (36%), Gaps = 38/287 (13%)
 Frame = +2

Query: 5    ISECQACQKHKVLTVKTKEETASLQPSVMVAD-IYVNICGPLKEARQMRYILGIIDQCSK 181
            +  C  CQ +K    K       + PS    + + ++    L E+     +  ++D+ SK
Sbjct: 954  VQNCHTCQINKSRNHKPYGPLQPIPPSERPWESLSMDFITALPESSGYNALFVVVDRFSK 1013

Query: 182  YIVLTAIRRQ-DENTVQRVISNNWLLKFGCPKRIQMDCGRSFESKAMLQFAKRWNIELCF 358
              +L    +        R+     +  FG PK I  D    F S+    FA ++N  + F
Sbjct: 1014 MAILVPCTKSITAEQTARMFDQRVIAYFGNPKEIIADNDHIFTSQTWKDFAHKYNFVMKF 1073

Query: 359  SSPYHHNTNVQIERQFKTVRDLLNTTLEERGISNWTEILPEI*FALNSTWQKSINTSPTN 538
            S PY   T+ Q ER  +TV  LL         + W + +  +  + N+    +   +P  
Sbjct: 1074 SLPYRPQTDGQTERTNQTVEKLLRCVCSTHP-NTWVDHISLVQQSYNNAIHSATQMTPFE 1132

Query: 539  FVFG-----K*IARENWN*QTSEKT-------------------------EMK*ESRRSF 628
             V         +   +++ +T E +                         +MK +    F
Sbjct: 1133 IVHRYSPALSPLELPSFSDKTDENSQETIQVFQTVKEHLNTNNIKMKKYFDMKIQEIEEF 1192

Query: 629  KVGERVLVR------TENRNKYQKRFEGPYQIIKKMHDRRYLLQRDD 751
            + G+ V+V+          NK    F GP+ +++K     Y L   D
Sbjct: 1193 QPGDLVMVKRTKTGFLHKSNKLAPSFAGPFYVLQKSGPNNYELDLPD 1239
>sp|P21414|POL_GALV Pol polyprotein [Contains: Protease ; Reverse
            transcriptase/ribonuclease H (RT); Integrase (IN)]
          Length = 1165

 Score = 50.4 bits (119), Expect = 7e-06
 Identities = 55/266 (20%), Positives = 99/266 (37%), Gaps = 30/266 (11%)
 Frame = +2

Query: 2    IISECQACQKHKVLTVKTKEETASLQPSVMVADIYVNICGPLKEARQ-MRYILGIIDQCS 178
            + S+CQAC     +T  T  ET   Q        +      +K  R   +Y+L  ID  S
Sbjct: 848  VTSQCQACAMTNAVT--TYRETGKRQRGDRPGVYWEVDFTEIKPGRYGNKYLLVFIDTFS 905

Query: 179  KYIVLTAIRRQDENTVQRVISNNWLLKFGCPKRIQMDCGRSFESKAMLQFAKRWNIELCF 358
             ++     + +    V + I    L +FG PK +  D G +F ++     A +  I    
Sbjct: 906  GWVEAFPTKTETALIVCKKILEEILPRFGIPKVLGSDNGPAFVAQVSQGLATQLGINWKL 965

Query: 359  SSPYHHNTNVQIERQFKTVRDLLNTTLEERGISNWTEILPEI*FALNSTWQKSINTSPTN 538
               Y   ++ Q+ER  +T+++ L     E G  +W  +LP       +T       +P  
Sbjct: 966  HCAYRPQSSGQVERMNRTIKETLTKLALETGGKDWVTLLPLALLRARNT-PGRFGLTPYE 1024

Query: 539  FVFG-----------------------------K*IARENWN*QTSEKTEMK*ESRRSFK 631
             ++G                             + +  + W+                F+
Sbjct: 1025 ILYGGPPPILESGETLGPDDRFLPVLFTHLKALEIVRTQIWDQIKEVYKPGTVTIPHPFQ 1084

Query: 632  VGERVLVRTENRNKYQKRFEGPYQII 709
            VG++VLVR    +  + R++GPY ++
Sbjct: 1085 VGDQVLVRRHRPSSLEPRWKGPYLVL 1110
>sp|P31792|POL_FENV1 Pol polyprotein [Contains: Reverse transcriptase/ribonuclease H (RT);
            Integrase (IN)]
          Length = 1046

 Score = 49.7 bits (117), Expect = 1e-05
 Identities = 36/160 (22%), Positives = 66/160 (41%), Gaps = 2/160 (1%)
 Frame = +2

Query: 2    IISECQACQKHKVLTVKTKE--ETASLQPSVMVADIYVNICGPLKEARQMRYILGIIDQC 175
            + S C+ CQ+      +  E   T   +P V     +  +          +Y+L  +D  
Sbjct: 733  VTSACKVCQQVNAGATRVPEGKRTRGNRPGVYWEIDFTEV---KPHYAGYKYLLVFVDTF 789

Query: 176  SKYIVLTAIRRQDENTVQRVISNNWLLKFGCPKRIQMDCGRSFESKAMLQFAKRWNIELC 355
            S ++     R++  + V + I      +FG PK I  D G +F S+     A+   I   
Sbjct: 790  SGWVEAYPTRQETAHMVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARTLGINWK 849

Query: 356  FSSPYHHNTNVQIERQFKTVRDLLNTTLEERGISNWTEIL 475
                Y   ++ Q+ER  +T+++ L     E G+ +W  +L
Sbjct: 850  LHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLL 889
>sp|P10272|POL_BAEVM Pol polyprotein [Contains: Protease ; Reverse
            transcriptase/ribonuclease H (RT); Integrase (IN)]
          Length = 1189

 Score = 48.5 bits (114), Expect = 3e-05
 Identities = 28/110 (25%), Positives = 51/110 (46%)
 Frame = +2

Query: 146  RYILGIIDQCSKYIVLTAIRRQDENTVQRVISNNWLLKFGCPKRIQMDCGRSFESKAMLQ 325
            +Y+L  +D  S ++     R++  + V + I      +FG PK I  D G +F S+    
Sbjct: 923  KYLLVFVDTFSGWVEAFPTRQETAHIVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQG 982

Query: 326  FAKRWNIELCFSSPYHHNTNVQIERQFKTVRDLLNTTLEERGISNWTEIL 475
             A+   I       Y   ++ Q+ER  +T+++ L     E G+ +W  +L
Sbjct: 983  LARILGINWKLHCAYRPQSSGQVERMNRTIKETLTKLTLETGLKDWRRLL 1032
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 103,211,246
Number of Sequences: 369166
Number of extensions: 2053769
Number of successful extensions: 5179
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4969
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5170
length of database: 68,354,980
effective HSP length: 110
effective length of database: 48,034,130
effective search space used: 9750928390
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)