Planarian EST Database


Dr_sW_005_C12

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_005_C12
         (933 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|O08730|GLYG_RAT  Glycogenin-1                                  322   7e-88
sp|Q9R062|GLYG_MOUSE  Glycogenin-1                                320   4e-87
sp|P46976|GLYG_HUMAN  Glycogenin-1                                317   2e-86
sp|P13280|GLYG_RABIT  Glycogenin-1                                316   5e-86
sp|O15488|GLYG2_HUMAN  Glycogenin-2 (GN-2) (GN2)                  282   1e-75
sp|P47011|GLG2_YEAST  Glycogen synthesis initiator protein GLG2   123   6e-28
sp|P36143|GLG1_YEAST  Glycogen synthesis initiator protein GLG1    84   7e-16
sp|Q9Y761|GNT1A_KLULA  Glucose N-acetyltransferase 1-A (N-ac...    42   0.002
sp|Q6CT96|GNT1B_KLULA  Glucose N-acetyltransferase 1-B (N-ac...    37   0.080
sp|Q09680|YA0C_SCHPO  Hypothetical protein C5H10.12c in chro...    36   0.18 
>sp|O08730|GLYG_RAT Glycogenin-1
          Length = 333

 Score =  322 bits (826), Expect = 7e-88
 Identities = 165/313 (52%), Positives = 205/313 (65%), Gaps = 14/313 (4%)
 Frame = +1

Query: 37  ALGALTLAMSLKHSGTQKELVIMITDNISEAMRNVLSQVFNHIEMVNVLDSNDVINLGLL 216
           A GAL L  SLK   T +  V++ +  +S++MR VL  VF+ + MV+VLDS D  +L L+
Sbjct: 16  AKGALVLGSSLKQHRTTRRTVVLASPQVSDSMRKVLETVFDEVIMVDVLDSGDSAHLTLM 75

Query: 217 ERPELGITFTKLHCWRLVQYSKCVFMDADTLVIQNIDDLFEREELSAAPDPGWPDCFNSG 396
           +RPELGIT TKLHCW L QYSKCVFMDADTLV+ NIDDLFEREELSAAPDPGWPDCFNSG
Sbjct: 76  KRPELGITLTKLHCWSLTQYSKCVFMDADTLVLSNIDDLFEREELSAAPDPGWPDCFNSG 135

Query: 397 VFVYKPSIDTYVELLQFAIDKGSFDGGDQGLLNMFFSNWSTKDIKHHLPFVYNCVSQAFY 576
           VFVY+PSI+TY +LL  A ++GSFDGGDQGLLN +FS W+T DI  HLPFVYN  S + Y
Sbjct: 136 VFVYQPSIETYNQLLHLASEQGSFDGGDQGLLNTYFSGWATTDITKHLPFVYNLSSLSIY 195

Query: 577 SYLPAYTHFRADIKVLHFIGPHKPWHHTFDSDTGTV--IFQEGCGQNYESLQLWWTTFIA 750
           SYLPA+  F  + KV+HF+G  KPW++T++  T +V    Q+    + E L LWW TF  
Sbjct: 196 SYLPAFKAFGKNAKVVHFLGRTKPWNYTYNPQTKSVKCESQDPIVSHPEFLNLWWDTFTT 255

Query: 751 YTKPLLHDE-----------MGGVCGRMASLDVSSFVP-HEGKFSPKSHQQAWEHGIIDY 894
              PLL              M  V G ++ L      P  +   S +  ++ WE G  DY
Sbjct: 256 NVLPLLQHHGLVKDAGSYLMMEHVTGALSDLSFGEAPPASQPSLSSEERKERWEQGQADY 315

Query: 895 RGLDRFENIKMHL 933
            G D F+NIK  L
Sbjct: 316 MGADSFDNIKRKL 328
>sp|Q9R062|GLYG_MOUSE Glycogenin-1
          Length = 333

 Score =  320 bits (820), Expect = 4e-87
 Identities = 164/313 (52%), Positives = 205/313 (65%), Gaps = 14/313 (4%)
 Frame = +1

Query: 37  ALGALTLAMSLKHSGTQKELVIMITDNISEAMRNVLSQVFNHIEMVNVLDSNDVINLGLL 216
           A GAL L  SLK   T + +V++ +  +S++MR VL  VF+ + MV+VLDS D  +L L+
Sbjct: 16  AKGALVLGSSLKQHRTTRRMVVLTSPQVSDSMRKVLETVFDDVIMVDVLDSGDSAHLTLM 75

Query: 217 ERPELGITFTKLHCWRLVQYSKCVFMDADTLVIQNIDDLFEREELSAAPDPGWPDCFNSG 396
           +RPELGIT TKLHCW L QYSKCVFMDADTLV+ NIDDLFEREELSAAPDPGWPDCFNSG
Sbjct: 76  KRPELGITLTKLHCWSLTQYSKCVFMDADTLVLSNIDDLFEREELSAAPDPGWPDCFNSG 135

Query: 397 VFVYKPSIDTYVELLQFAIDKGSFDGGDQGLLNMFFSNWSTKDIKHHLPFVYNCVSQAFY 576
           VFVY+PSI+TY +LL  A ++GSFDGGDQGLLN +FS W+T DI  HLPFVYN  S + Y
Sbjct: 136 VFVYQPSIETYNQLLHLASEQGSFDGGDQGLLNTYFSGWATTDITKHLPFVYNLSSISIY 195

Query: 577 SYLPAYTHFRADIKVLHFIGPHKPWHHTFDSDTGTV--IFQEGCGQNYESLQLWWTTFIA 750
           SYLPA+  F  + KV+HF+G  KPW++T++  T +V    Q+    + E L LWW TF  
Sbjct: 196 SYLPAFKAFGKNAKVVHFLGRTKPWNYTYNPQTKSVNCDSQDPTVSHPEFLNLWWDTFTT 255

Query: 751 YTKPLLHDE-----------MGGVCGRMASLDVSSF-VPHEGKFSPKSHQQAWEHGIIDY 894
              PLL              M  V G ++ L         +   S +  ++ WE G  DY
Sbjct: 256 NVLPLLQHHGLVKDASSYLMMEHVSGALSDLSFGEAPAAPQPSMSSEERKERWEQGQADY 315

Query: 895 RGLDRFENIKMHL 933
            G D F+NIK  L
Sbjct: 316 MGADSFDNIKRKL 328
>sp|P46976|GLYG_HUMAN Glycogenin-1
          Length = 350

 Score =  317 bits (813), Expect = 2e-86
 Identities = 166/331 (50%), Positives = 207/331 (62%), Gaps = 32/331 (9%)
 Frame = +1

Query: 37   ALGALTLAMSLKHSGTQKELVIMITDNISEAMRNVLSQVFNHIEMVNVLDSNDVINLGLL 216
            A GAL L  SLK   T + LV++ T  +S++MR VL  VF+ + MV+VLDS D  +L L+
Sbjct: 16   AKGALVLGSSLKQHRTTRRLVVLATPQVSDSMRKVLETVFDEVIMVDVLDSGDSAHLTLM 75

Query: 217  ERPELGITFTKLHCWRLVQYSKCVFMDADTLVIQNIDDLFEREELSAAPDPGWPDCFNSG 396
            +RPELG+T TKLHCW L QYSKCVFMDADTLV+ NIDDLF+REELSAAPDPGWPDCFNSG
Sbjct: 76   KRPELGVTLTKLHCWSLTQYSKCVFMDADTLVLANIDDLFDREELSAAPDPGWPDCFNSG 135

Query: 397  VFVYKPSIDTYVELLQFAIDKGSFDGGDQGLLNMFFSNWSTKDIKHHLPFVYNCVSQAFY 576
            VFVY+PS++TY +LL  A ++GSFDGGDQG+LN FFS+W+T DI+ HLPF+YN  S + Y
Sbjct: 136  VFVYQPSVETYNQLLHLASEQGSFDGGDQGILNTFFSSWATTDIRKHLPFIYNLSSISIY 195

Query: 577  SYLPAYTHFRADIKVLHFIGPHKPWHHTFDSDTGTVIFQEGCGQNY---ESLQLWWTTFI 747
            SYLPA+  F A  KV+HF+G  KPW++T+D  T +V   E    N    E L LWW  F 
Sbjct: 196  SYLPAFKVFGASAKVVHFLGRVKPWNYTYDPKTKSV-KSEAHDPNMTHPEFLILWWNIFT 254

Query: 748  AYTKPLLHD-------------------EMGGVCGRMASLDVSSFVPH----------EG 840
                PLL                      +   CG     DVS  + H          + 
Sbjct: 255  TNVLPLLQQFGLVKDTCSYVNVLSDLVYTLAFSCGFCRKEDVSGAISHLSLGEIPAMAQP 314

Query: 841  KFSPKSHQQAWEHGIIDYRGLDRFENIKMHL 933
              S +  ++ WE G  DY G D F+NIK  L
Sbjct: 315  FVSSEERKERWEQGQADYMGADSFDNIKRKL 345
>sp|P13280|GLYG_RABIT Glycogenin-1
          Length = 333

 Score =  316 bits (810), Expect = 5e-86
 Identities = 159/313 (50%), Positives = 205/313 (65%), Gaps = 14/313 (4%)
 Frame = +1

Query: 37  ALGALTLAMSLKHSGTQKELVIMITDNISEAMRNVLSQVFNHIEMVNVLDSNDVINLGLL 216
           A GAL L  SLK   T + L ++ T  +S+ MR  L  VF+ +  V++LDS D  +L L+
Sbjct: 16  AKGALVLGSSLKQHRTSRRLAVLTTPQVSDTMRKALEIVFDEVITVDILDSGDSAHLTLM 75

Query: 217 ERPELGITFTKLHCWRLVQYSKCVFMDADTLVIQNIDDLFEREELSAAPDPGWPDCFNSG 396
           +RPELG+T TKLHCW L QYSKCVFMDADTLV+ NIDDLFEREELSAAPDPGWPDCFNSG
Sbjct: 76  KRPELGVTLTKLHCWSLTQYSKCVFMDADTLVLANIDDLFEREELSAAPDPGWPDCFNSG 135

Query: 397 VFVYKPSIDTYVELLQFAIDKGSFDGGDQGLLNMFFSNWSTKDIKHHLPFVYNCVSQAFY 576
           VFVY+PS++TY +LL  A ++GSFDGGDQGLLN FF++W+T DI+ HLPF+YN  S + Y
Sbjct: 136 VFVYQPSVETYNQLLHVASEQGSFDGGDQGLLNTFFNSWATTDIRKHLPFIYNLSSISIY 195

Query: 577 SYLPAYTHFRADIKVLHFIGPHKPWHHTFDSDTGTVIFQ--EGCGQNYESLQLWWTTFIA 750
           SYLPA+  F A+ KV+HF+G  KPW++T+D+ T +V  +  +    + + L +WW  F  
Sbjct: 196 SYLPAFKAFGANAKVVHFLGQTKPWNYTYDTKTKSVRSEGHDPTMTHPQFLNVWWDIFTT 255

Query: 751 YTKPLLHD--EMGGVCGRMASLDVSSFVPH----------EGKFSPKSHQQAWEHGIIDY 894
              PLL     +   C      DVS  V H          +   S +  ++ WE G  DY
Sbjct: 256 SVVPLLQQFGLVQDTCSYQHVEDVSGAVSHLSLGETPATTQPFVSSEERKERWEQGQADY 315

Query: 895 RGLDRFENIKMHL 933
            G D F+NIK  L
Sbjct: 316 MGADSFDNIKKKL 328
>sp|O15488|GLYG2_HUMAN Glycogenin-2 (GN-2) (GN2)
          Length = 501

 Score =  282 bits (721), Expect = 1e-75
 Identities = 135/275 (49%), Positives = 179/275 (65%), Gaps = 20/275 (7%)
 Frame = +1

Query: 43  GALTLAMSLKHSGTQKELVIMITDNISEAMRNVLSQVFNHIEMVNVLDSNDVINLGLLER 222
           GAL L  SL+     ++LV++IT  +S  +R +LS+VF+ +  VN++DS D I+L  L+R
Sbjct: 51  GALVLGQSLRRHRLTRKLVVLITPQVSSLLRVILSKVFDEVIEVNLIDSADYIHLAFLKR 110

Query: 223 PELGITFTKLHCWRLVQYSKCVFMDADTLVIQNIDDLFEREELSAAPDPGWPDCFNSGVF 402
           PELG+T TKLHCW L  YSKCVF+DADTLV+ N+D+LF+R E SAAPDPGWPDCFNSGVF
Sbjct: 111 PELGLTLTKLHCWTLTHYSKCVFLDADTLVLSNVDELFDRGEFSAAPDPGWPDCFNSGVF 170

Query: 403 VYKPSIDTYVELLQFAIDKGSFDGGDQGLLNMFFSNWSTKDIKHHLPFVYNCVSQAFYSY 582
           V++PS+ T+  LLQ A++ GSFDG DQGLLN FF NWST DI  HLPF+YN  S   Y+Y
Sbjct: 171 VFQPSLHTHKLLLQHAMEHGSFDGADQGLLNSFFRNWSTTDIHKHLPFIYNLSSNTMYTY 230

Query: 583 LPAYTHFRADIKVLHFIGPHKPWHHTFDSDTGTVIFQEGCGQNYES---LQLWWTTFIAY 753
            PA+  F +  KV+HF+G  KPW++ ++  +G+V+ Q     +      L LWWT +   
Sbjct: 231 SPAFKQFGSSAKVVHFLGSMKPWNYKYNPQSGSVLEQGSVSSSQHQAAFLHLWWTVYQNN 290

Query: 754 TKP-----------------LLHDEMGGVCGRMAS 807
             P                 L H ++GG C   AS
Sbjct: 291 VLPLYKSVQAGEARASPGHTLCHSDVGGPCADSAS 325

 Score = 31.2 bits (69), Expect = 4.4
 Identities = 13/31 (41%), Positives = 18/31 (58%)
 Frame = +1

Query: 841 KFSPKSHQQAWEHGIIDYRGLDRFENIKMHL 933
           + SP+  ++ WE G IDY G D F  I+  L
Sbjct: 466 ELSPEEERRKWEEGRIDYMGKDAFARIQEKL 496
>sp|P47011|GLG2_YEAST Glycogen synthesis initiator protein GLG2
          Length = 380

 Score =  123 bits (309), Expect = 6e-28
 Identities = 88/259 (33%), Positives = 124/259 (47%), Gaps = 40/259 (15%)
 Frame = +1

Query: 16  LLQMMNIALGALTLAMSL----KHSGTQKELVIMIT-------DNISEAMRNVLSQVFNH 162
           LL   +   GALTLA  L    KH+  + E+ + +        D        ++  +F  
Sbjct: 10  LLYSRDYLPGALTLAYQLQKLLKHAVVEDEITLCLLIEKKLFGDEFKPQEIALIRSLFKE 69

Query: 163 IEMVNVLDSNDV------INLGLLERPELGITFTKLHCWRLVQYSKCVFMDADTLVIQNI 324
           I ++  L   +        NL LL+RPEL  T  K   W LVQ+ + +F+DADTL +   
Sbjct: 70  IIIIEPLKDQEKSIEKNKANLELLKRPELSHTLLKARLWELVQFDQVLFLDADTLPLNK- 128

Query: 325 DDLFE---------REELSAAPDPGWPDCFNSGVFVYKPSIDTYVELLQFAIDKGSFDGG 477
            + FE         R +++A PD GWPD FN+GV +  P +D    L  F I   S DG 
Sbjct: 129 -EFFEILRLYPEQTRFQIAAVPDIGWPDMFNTGVLLLIPDLDMATSLQDFLIKTVSIDGA 187

Query: 478 DQGLLNMFFS---NWSTKDIKH---------HLPFVYNCVSQAF-YSYLPAYTHFRADIK 618
           DQG+ N FF+   N+S K++ H          LPF YN     + Y   PA   F+  I+
Sbjct: 188 DQGIFNQFFNPICNYS-KEVLHKVSPLMEWIRLPFTYNVTMPNYGYQSSPAMNFFQQHIR 246

Query: 619 VLHFIGPHKPW-HHTFDSD 672
           ++HFIG  KPW  +T D D
Sbjct: 247 LIHFIGTFKPWSRNTTDYD 265
>sp|P36143|GLG1_YEAST Glycogen synthesis initiator protein GLG1
          Length = 480

 Score = 83.6 bits (205), Expect = 7e-16
 Identities = 45/112 (40%), Positives = 59/112 (52%), Gaps = 10/112 (8%)
 Frame = +1

Query: 346 ELSAAPDPGWPDCFNSGVFVYKPSIDTYVELLQFAIDKGSFDGGDQGLLNMFFS-NWSTK 522
           ++ A  D GWPD FNSGV +  P  DT   L  +  +  S DG DQG+LN FF+ N  T 
Sbjct: 8   QVGAIADIGWPDMFNSGVMMLIPDADTASVLQNYIFENTSIDGSDQGILNQFFNQNCCTD 67

Query: 523 DIKH--------HLPFVYN-CVSQAFYSYLPAYTHFRADIKVLHFIGPHKPW 651
           ++           L F YN  +    Y   PA  +F+  IK++HFIG HKPW
Sbjct: 68  ELVKDSFSREWVQLSFTYNVTIPNLGYQSSPAMNYFKPSIKLIHFIGKHKPW 119
>sp|Q9Y761|GNT1A_KLULA Glucose N-acetyltransferase 1-A (N-acetylglucosaminyltransferase A)
          Length = 460

 Score = 42.0 bits (97), Expect = 0.002
 Identities = 25/95 (26%), Positives = 51/95 (53%), Gaps = 5/95 (5%)
 Frame = +1

Query: 67  LKHSGTQKELVIMITDNISEAMRNVLSQVFNHIEMVNVLDSNDVI----NLGLLERPELG 234
           L  SGTQ +LV+++   ++E   +    V   +     +  N ++    N+ L +     
Sbjct: 118 LHESGTQAKLVMLVAKELTELPED--DSVTRMLAQFKEISDNCIVKPVENIVLSQGSAQW 175

Query: 235 IT-FTKLHCWRLVQYSKCVFMDADTLVIQNIDDLF 336
           +T  TKL  + +V+Y + V+ D+D+++ +N+D+LF
Sbjct: 176 MTSMTKLRVFGMVEYKRIVYFDSDSIITRNMDELF 210
>sp|Q6CT96|GNT1B_KLULA Glucose N-acetyltransferase 1-B (N-acetylglucosaminyltransferase B)
          Length = 453

 Score = 37.0 bits (84), Expect = 0.080
 Identities = 25/102 (24%), Positives = 48/102 (47%), Gaps = 12/102 (11%)
 Frame = +1

Query: 67  LKHSGTQKELVIMITDNIS---------EAMRNVLSQVFNHI---EMVNVLDSNDVINLG 210
           L  SG++ +L+ ++TD +          EA+ N +  V + +   E+ +V+  ND     
Sbjct: 107 LNDSGSKAKLLALVTDTLVNKSKENKEVEALLNKIKSVSDRVAVTEVGSVIQPND----- 161

Query: 211 LLERPELGITFTKLHCWRLVQYSKCVFMDADTLVIQNIDDLF 336
                    + TKL  + L  Y + ++MD D ++   +D+LF
Sbjct: 162 ---HTPWSKSLTKLAIFNLTDYERIIYMDNDAIIHDKMDELF 200
>sp|Q09680|YA0C_SCHPO Hypothetical protein C5H10.12c in chromosome I
          Length = 371

 Score = 35.8 bits (81), Expect = 0.18
 Identities = 13/33 (39%), Positives = 24/33 (72%)
 Frame = +1

Query: 241 FTKLHCWRLVQYSKCVFMDADTLVIQNIDDLFE 339
           F+KL  +  +Q+ K   +D+D L+++NIDD+F+
Sbjct: 161 FSKLRIFEQIQFDKICVIDSDILIMKNIDDIFD 193
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 113,908,281
Number of Sequences: 369166
Number of extensions: 2441706
Number of successful extensions: 5869
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 5680
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5857
length of database: 68,354,980
effective HSP length: 110
effective length of database: 48,034,130
effective search space used: 9606826000
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)