Planarian EST Database


Dr_sW_003_L19

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_003_L19
         (767 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q9BSH4|U082_HUMAN  UPF0082 protein PRO0477                      87   4e-17
sp|Q8K0Z7|U082_MOUSE  UPF0082 protein                              84   4e-16
sp|Q8U9K1|Y1C7_AGRT5  Hypothetical UPF0082 protein Atu3727/A...    77   4e-14
sp|Q838A9|Y663_ENTFA  Hypothetical UPF0082 protein EF0663          77   7e-14
sp|Q8Y6Z5|YF35_LISMO  Hypothetical UPF0082 protein lmo1535         76   9e-14
sp|Q65GN7|Y2909_BACLD  Hypothetical UPF0082 protein BLi02909...    75   1e-13
sp|P62037|Y904_LACJO  Hypothetical UPF0082 protein LJ0904          75   1e-13
sp|Q71ZD5|Y1554_LISMF  Hypothetical UPF0082 protein LMOf2365...    75   3e-13
sp|Q5QYV3|Y1088_IDILO  Hypothetical UPF0082 protein IL1088         75   3e-13
sp|Q8YPC2|Y4276_ANASP  Hypothetical UPF0082 protein All4276        74   3e-13
>sp|Q9BSH4|U082_HUMAN UPF0082 protein PRO0477
          Length = 297

 Score = 87.4 bits (215), Expect = 4e-17
 Identities = 59/219 (26%), Positives = 122/219 (55%), Gaps = 4/219 (1%)
 Frame = +2

Query: 116 AGHSHWQNVKQTKDMKDKIKCDNAMLISRLMLNA-IKIGDGIRDPKLNSHLASVIDRARS 292
           AGH+ W  V+  K  KD    + + + S+L LN  + + +G  +P+ NS+LA++++  RS
Sbjct: 58  AGHNKWSKVRHIKGPKD---VERSRIFSKLCLNIRLAVKEGGPNPEHNSNLANILEVCRS 114

Query: 293 LNVSMAALEKILSKNETVDPYFI-EYQAPGGIFVVIESCNKHIVKEKSTIVSVVKKYGFK 469
            ++  + +E  L   ++ D Y + E + PGG  ++IE+ +    K ++ I  ++ K G  
Sbjct: 115 KHMPKSTIETALKMEKSKDTYLLYEGRGPGGSSLLIEALSNSSHKCQADIRHILNKNGGV 174

Query: 470 LAPTSFLKVAFSYKGEVSCKPDNNILS--NEDQGLEVALEVGADEVIYAKDEETDSMLYK 643
           +A  +  + +F  KG +  + ++      N ++ LE+A+E GA++V   +DEE +  ++K
Sbjct: 175 MAVGA--RHSFDKKGVIVVEVEDREKKAVNLERALEMAIEAGAEDVKETEDEE-ERNVFK 231

Query: 644 FFCDPNNITQIKSALQNKKLEVVNSTDYLVPLNRVKITD 760
           F CD +++ Q++  L +  L  V+     +P ++V++ +
Sbjct: 232 FICDASSLHQVRKKLDSLGLCSVSCALEFIPNSKVQLAE 270
>sp|Q8K0Z7|U082_MOUSE UPF0082 protein
          Length = 294

 Score = 84.0 bits (206), Expect = 4e-16
 Identities = 61/219 (27%), Positives = 117/219 (53%), Gaps = 4/219 (1%)
 Frame = +2

Query: 116 AGHSHWQNVKQTKDMKDKIKCDNAMLISRLMLNA-IKIGDGIRDPKLNSHLASVIDRARS 292
           AGH+ W  V+  K  KD    + + + S+L L+  + + +G  +P+ NS LA++++  RS
Sbjct: 55  AGHNKWSKVRHIKGPKDM---ERSRIFSKLTLSIRLAVKEGGPNPENNSSLANILELCRS 111

Query: 293 LNVSMAALEKILSKNETVDPYFI-EYQAPGGIFVVIESCNKHIVKEKSTIVSVVKKYGFK 469
            N+  + +E  L   +    Y + E + PGG  ++IE+ +    K    I  ++ K G  
Sbjct: 112 KNMPKSTIESALKTEKNKGIYLLYEGRGPGGSSLLIEALSNSGPKCHLDIKYILNKNGGM 171

Query: 470 LAPTSFLKVAFSYKGEVSCKPDNNILS--NEDQGLEVALEVGADEVIYAKDEETDSMLYK 643
           +A  +  +  F  KG V    ++      N ++ LE+A+E GA++V  A+DEE +  L+K
Sbjct: 172 MAEGA--RHFFDKKGVVVVGVEDREKKAVNLERALELAIEAGAEDVKEAEDEE-EKNLFK 228

Query: 644 FFCDPNNITQIKSALQNKKLEVVNSTDYLVPLNRVKITD 760
           F CD +++ Q++  L +  L  V+ +   +P ++V++ +
Sbjct: 229 FICDASSLHQVRKKLDSLGLCPVSCSMEFIPHSKVQLAE 267
>sp|Q8U9K1|Y1C7_AGRT5 Hypothetical UPF0082 protein Atu3727/AGR_L_2215
          Length = 248

 Score = 77.4 bits (189), Expect = 4e-14
 Identities = 56/221 (25%), Positives = 103/221 (46%), Gaps = 5/221 (2%)
 Frame = +2

Query: 116 AGHSHWQNVKQTKDMKDKIKCDNAMLISRLMLNAIKIGDGIRDPKLNSHLASVIDRARSL 295
           AGHS ++N+   K  +D ++      ++R +  A K G  + DP +N+ L   I  A++ 
Sbjct: 2   AGHSQFKNIMHRKGKQDSVRSKMFSKLAREITVAAKTG--MPDPNMNARLRLAIQNAKAQ 59

Query: 296 NVSMAALEKILSK-----NETVDPYFIEYQAPGGIFVVIESCNKHIVKEKSTIVSVVKKY 460
           ++    +E+ + K     +E  D    E   PGG+ VV+E+   +  +  S + S+  K 
Sbjct: 60  SMPKDNIERAIKKASGADSENYDEVRYEGYGPGGVAVVVEALTDNRNRTASNVRSIFTKA 119

Query: 461 GFKLAPTSFLKVAFSYKGEVSCKPDNNILSNEDQGLEVALEVGADEVIYAKDEETDSMLY 640
           G  L  T  +  +F   GE++ K +   + + D+ +E A+E GAD+V  ++D  T     
Sbjct: 120 GGALGETGSVSFSFDRVGEITYKAE---VGDADKVMEAAIEAGADDVESSEDGHT----- 171

Query: 641 KFFCDPNNITQIKSALQNKKLEVVNSTDYLVPLNRVKITDE 763
              C    + ++  AL+    E  +      P N V + +E
Sbjct: 172 -IICGFEAMNEVSKALEGVLGEAESVKAIWKPQNTVPVDEE 211
>sp|Q838A9|Y663_ENTFA Hypothetical UPF0082 protein EF0663
          Length = 242

 Score = 76.6 bits (187), Expect = 7e-14
 Identities = 53/222 (23%), Positives = 100/222 (45%), Gaps = 6/222 (2%)
 Frame = +2

Query: 116 AGHSHWQNVKQTKDMKDKIKCDNAMLISRLMLNAIKIGDGIRDPKLNSHLASVIDRARSL 295
           +GHS W N++  K+ +D  +      +SR +  A K G    DP +N  L   +D+A+S 
Sbjct: 2   SGHSKWSNIQGRKNAQDAKRGKIFQKVSREIYMAAKAGG--PDPAMNPALRLAVDKAKSA 59

Query: 296 NVSMAALEKILSK------NETVDPYFIEYQAPGGIFVVIESCNKHIVKEKSTIVSVVKK 457
           N+    + + + K       E  D    E   PGG+ V++ +   +  +  + +     +
Sbjct: 60  NMPNDNIARAIKKASSAGEGEHYDEVTYEGYGPGGVAVLVHALTDNRNRTATNVRVAFTR 119

Query: 458 YGFKLAPTSFLKVAFSYKGEVSCKPDNNILSNEDQGLEVALEVGADEVIYAKDEETDSML 637
            G  L  T  +   F  KG +  K +++ +  ED  LEV LE G +++      ET   +
Sbjct: 120 NGGSLGETGSVNYMFDRKGYIVIKREDHAI-EEDDMLEVVLEAGGEDI------ETSPEV 172

Query: 638 YKFFCDPNNITQIKSALQNKKLEVVNSTDYLVPLNRVKITDE 763
           ++ +  P + T ++ AL+     +  +   +VP   + + DE
Sbjct: 173 FEIYTAPEDFTAVRDALEQAGYSLAQAELTMVPQTLLTLNDE 214
>sp|Q8Y6Z5|YF35_LISMO Hypothetical UPF0082 protein lmo1535
          Length = 241

 Score = 76.3 bits (186), Expect = 9e-14
 Identities = 54/198 (27%), Positives = 95/198 (47%), Gaps = 6/198 (3%)
 Frame = +2

Query: 116 AGHSHWQNVKQTKDMKDKIKCDNAMLISRLMLNAIKIGDGIRDPKLNSHLASVIDRARSL 295
           +GHS W N++  K+ +D  +      ++R +  A K G    DP LN  L  V+D+A+++
Sbjct: 2   SGHSKWNNIQGRKNAQDSKRSKVFQKLAREIFVAAKKGP---DPSLNPSLRLVMDKAKAV 58

Query: 296 NVSMAALEKILSK------NETVDPYFIEYQAPGGIFVVIESCNKHIVKEKSTIVSVVKK 457
           N+    +++ + K       E  D    E  APGGI V++ +   +  +  + +     K
Sbjct: 59  NMPNDNIKRAIDKASGNTSGENYDEVTYEGYAPGGIAVLVHALTDNKNRTSTNVRVAFNK 118

Query: 458 YGFKLAPTSFLKVAFSYKGEVSCKPDNNILSNEDQGLEVALEVGADEVIYAKDEETDSML 637
            G  L  T  +   F  KG +    +   +  E+  LE A+E GAD+V  ++D      +
Sbjct: 119 NGGSLGETGSVSYMFDRKGYLVILREGLTVDEEEFMLE-AIEAGADDVEVSED------V 171

Query: 638 YKFFCDPNNITQIKSALQ 691
           ++ F DP   +++K ALQ
Sbjct: 172 FEIFTDPATFSEVKEALQ 189
>sp|Q65GN7|Y2909_BACLD Hypothetical UPF0082 protein BLi02909/BL01150
          Length = 240

 Score = 75.5 bits (184), Expect = 1e-13
 Identities = 49/223 (21%), Positives = 112/223 (50%), Gaps = 6/223 (2%)
 Frame = +2

Query: 116 AGHSHWQNVKQTKDMKDKIKCDNAMLISRLMLNAIKIGDGIRDPKLNSHLASVIDRARSL 295
           AGHS W+N+++ K+ +D  +    M +++ +  A K  +G  DP+ N+ L  VID+A++ 
Sbjct: 2   AGHSKWKNIQRRKNAQDAKRGKIFMKLAKEIYVAAK--EGGPDPESNASLRLVIDKAKNA 59

Query: 296 NVSMAALEKILSK------NETVDPYFIEYQAPGGIFVVIESCNKHIVKEKSTIVSVVKK 457
           N+    +++ + K       ++ +    E   P G+ V+++    +  +  +++ +   K
Sbjct: 60  NMPNDNIDRAIKKASGSQDGKSYEEITYEGYGPSGVAVMVKCLTDNKNRTATSVRTAFSK 119

Query: 458 YGFKLAPTSFLKVAFSYKGEVSCKPDNNILSNEDQGLEVALEVGADEVIYAKDEETDSML 637
            G  L  T  +   F  KG ++ + ++  +  E+  LEV ++ GA+E+      ET   L
Sbjct: 120 NGGSLGETGCVSYMFDRKGYIAIEREDLEIDEEEFMLEV-IDAGAEEL------ETSEEL 172

Query: 638 YKFFCDPNNITQIKSALQNKKLEVVNSTDYLVPLNRVKITDEM 766
           ++ + +P    ++K +L+ +  ++  S   +VP    ++ + +
Sbjct: 173 FEIYTEPEQFEEVKKSLEERGYKLATSEITMVPQTYAEVDEAL 215
>sp|P62037|Y904_LACJO Hypothetical UPF0082 protein LJ0904
          Length = 243

 Score = 75.5 bits (184), Expect = 1e-13
 Identities = 55/218 (25%), Positives = 99/218 (45%), Gaps = 5/218 (2%)
 Frame = +2

Query: 116 AGHSHWQNVKQTKDMKDKIKCDNAMLISRLMLNAIKIGDGIRDPKLNSHLASVIDRARSL 295
           +GHS W N++  K+ +D  +      +SR +  A K G    DP  N  L  V+D+AR+ 
Sbjct: 2   SGHSKWHNIQGRKNAQDAKRGKVFQKLSREIYMAAKSGGP--DPSGNPTLRMVMDKARAA 59

Query: 296 NVSMAALEKILSK-----NETVDPYFIEYQAPGGIFVVIESCNKHIVKEKSTIVSVVKKY 460
           N+    +E+ + K     +E  D    E  APGG+ V++E+   +  +  S +     + 
Sbjct: 60  NMPKTNIERAIKKAEGNSDEHYDEITYEGYAPGGVAVLVEALTDNKNRTASDVRVAFTRN 119

Query: 461 GFKLAPTSFLKVAFSYKGEVSCKPDNNILSNEDQGLEVALEVGADEVIYAKDEETDSMLY 640
           G  L  T  +   F  KG +         ++EDQ L   ++ G D++      ET    +
Sbjct: 120 GGSLGATGSVAYMFDRKGYLVIDRSTTD-ADEDQVLLDVMDAGGDDL------ETSDDAF 172

Query: 641 KFFCDPNNITQIKSALQNKKLEVVNSTDYLVPLNRVKI 754
           + + DP   T ++ AL+    ++ N+   ++P N   +
Sbjct: 173 EIYTDPKQFTAVRDALEKAGYKLANAELTMIPQNTTPV 210
>sp|Q71ZD5|Y1554_LISMF Hypothetical UPF0082 protein LMOf2365_1554
          Length = 241

 Score = 74.7 bits (182), Expect = 3e-13
 Identities = 54/198 (27%), Positives = 94/198 (47%), Gaps = 6/198 (3%)
 Frame = +2

Query: 116 AGHSHWQNVKQTKDMKDKIKCDNAMLISRLMLNAIKIGDGIRDPKLNSHLASVIDRARSL 295
           +GHS W N++  K+ +D  +      ++R +  A K G    DP LN  L  V+D+A+++
Sbjct: 2   SGHSKWNNIQGRKNAQDSKRSKVFQKLAREIFVAAKKGP---DPNLNPSLRLVMDKAKAV 58

Query: 296 NVSMAALEKILSK------NETVDPYFIEYQAPGGIFVVIESCNKHIVKEKSTIVSVVKK 457
           N+    +++ + K       E  D    E  APGGI V++ +   +  +  + +     K
Sbjct: 59  NMPNDNIKRAIDKAAGNTSGENYDEVTYEGYAPGGIAVLVHALTDNKNRTSTNVRVAFNK 118

Query: 458 YGFKLAPTSFLKVAFSYKGEVSCKPDNNILSNEDQGLEVALEVGADEVIYAKDEETDSML 637
            G  L  T  +   F  KG +    +   +  E+  LE A+E GAD+V  ++D      +
Sbjct: 119 NGGSLGETGSVSYMFDRKGYLVILREGLDVDEEEFMLE-AIEAGADDVEVSED------V 171

Query: 638 YKFFCDPNNITQIKSALQ 691
           ++ F DP    ++K ALQ
Sbjct: 172 FEIFTDPATFPEVKEALQ 189
>sp|Q5QYV3|Y1088_IDILO Hypothetical UPF0082 protein IL1088
          Length = 249

 Score = 74.7 bits (182), Expect = 3e-13
 Identities = 60/223 (26%), Positives = 99/223 (44%), Gaps = 7/223 (3%)
 Frame = +2

Query: 116 AGHSHWQNVKQTKDMKDKIKCDNAMLISRLMLN-AIKIGDGIRDPKLNSHLASVIDRARS 292
           AGHS W N+K  K  +D        + ++L+    +   +G  DP+ N  L + ID+A S
Sbjct: 2   AGHSKWSNIKHRKAAQD---AKRGKIFTKLIREITVSAREGGGDPETNPRLRAAIDKALS 58

Query: 293 LNVSMAALEKILSK------NETVDPYFIEYQAPGGIFVVIESCNKHIVKEKSTIVSVVK 454
            N+    ++  + +       + VD    E   P G+ V++E    +  +  S +     
Sbjct: 59  NNMKRDTIDTAVKRGSGDLEGDNVDELTYEGYGPSGVAVLLECMTDNRNRTVSDVRHAFS 118

Query: 455 KYGFKLAPTSFLKVAFSYKGEVSCKPDNNILSNEDQGLEVALEVGADEVIYAKDEETDSM 634
           K G  L     +   F+ KG +S          E+Q +E ALE GA++++ A DE +  +
Sbjct: 119 KLGGNLGTDGSVAYLFNKKGVISYSDG----VTEEQVMEPALEAGAEDIL-AYDEGSIDV 173

Query: 635 LYKFFCDPNNITQIKSALQNKKLEVVNSTDYLVPLNRVKITDE 763
           +      P N   +K AL  K LE  N+    VP  RV + +E
Sbjct: 174 I----TSPENFGAVKDALDAKGLEASNAEVTQVPDTRVDLDEE 212
>sp|Q8YPC2|Y4276_ANASP Hypothetical UPF0082 protein All4276
          Length = 252

 Score = 74.3 bits (181), Expect = 3e-13
 Identities = 56/223 (25%), Positives = 105/223 (47%), Gaps = 8/223 (3%)
 Frame = +2

Query: 116 AGHSHWQNVKQTKDMKDKIKCDNAMLISRLMLNAIKIGDGIRDPKLNSHLASVIDRARSL 295
           AGHS W N+K+ K + D  K      +SR ++ A +   G+ DP  N  L + ID+A++ 
Sbjct: 2   AGHSKWANIKRQKAVVDAKKGKTFTQLSRAIILAAR--SGVPDPSGNFQLRTAIDKAKAA 59

Query: 296 NVSMAALEKILSK--------NETVDPYFIEYQAPGGIFVVIESCNKHIVKEKSTIVSVV 451
            +    +E+ ++K        N +++    E   PGG+ ++IE+   +  +  + +    
Sbjct: 60  GIPNDNIERAIAKGAGTFGGDNASLEEIRYEGYGPGGVAILIEALTDNRNRTAADLRVAF 119

Query: 452 KKYGFKLAPTSFLKVAFSYKGEVSCKPDNNILSNEDQGLEVALEVGADEVIYAKDEETDS 631
            K G  L  T  +   F  KG   C     +  +EDQ LE +LE GA+     +DE  + 
Sbjct: 120 SKNGGNLGETGCVSWMFDQKG--VCVVSGVV--DEDQLLEASLEGGAESYEMTEDETAE- 174

Query: 632 MLYKFFCDPNNITQIKSALQNKKLEVVNSTDYLVPLNRVKITD 760
                F +  N+  +   L+++  +V ++    +P N +++T+
Sbjct: 175 ----VFTEVANLEILNQTLKDQGFKVTDAELRWIPSNHLEVTE 213
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 74,102,733
Number of Sequences: 369166
Number of extensions: 1318465
Number of successful extensions: 3810
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3634
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3712
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 7115329200
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)