Planaria EST Database


DrC_00792

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_00792
         (987 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q566L7|FA61B_XENTR  Protein FAM61B homolog                      60   7e-09
sp|Q8CGC4|FA61B_MOUSE  Protein FAM61B                              55   2e-07
sp|Q9BX40|FA61B_HUMAN  Protein FAM61B                              54   5e-07
sp|Q8K2F8|FA61A_MOUSE  Protein FAM61A                              46   2e-04
sp|Q8ND56|FA61A_HUMAN  Protein FAM61A (Possible alpha synucl...    46   2e-04
sp|Q5R4R4|FA61A_PONPY  Protein FAM61A                              46   2e-04
sp|Q9HGL3|SUM2_SCHPO  Protein sum2                                 45   3e-04
sp|P45978|SCD6_YEAST  Protein SCD6                                 44   5e-04
sp|Q92804|RBP56_HUMAN  TATA-binding protein associated facto...    34   0.56 
sp|P25870|CLH_DICDI  Clathrin heavy chain                          31   4.7  
>sp|Q566L7|FA61B_XENTR Protein FAM61B homolog
          Length = 382

 Score = 60.5 bits (145), Expect = 7e-09
 Identities = 52/173 (30%), Positives = 71/173 (41%), Gaps = 15/173 (8%)
 Frame = +3

Query: 366 NNPKMKKKPTRENDASKKDETFKEDYDFETANAIIAKELAALNIS-----QXXXXXXXXX 530
           N  + + +PT   + + K   F+ D+DFETANA   +E            +         
Sbjct: 228 NRSRGQNRPTTVKENAIK---FEGDFDFETANAQFNREELDKEFKDKLNFKDEKPEKEGE 284

Query: 531 XXXXXGQLSENSDAG----------YYDKTKSFFDNISSEMLDRANGKSVRPMNRKEERK 680
                G  ++NSD            YYD++KSFFDNISSEM  R            EERK
Sbjct: 285 EKTDSGVETQNSDGNPEEDPLGPNIYYDRSKSFFDNISSEMKSRRT-------TWAEERK 337

Query: 681 INVETFGVAEAXXXXXXXXXXXXXXXXHYSKSIRGSNFGGTYRGNSNEMTFRR 839
           +N ETFGV+                     + +RG +F G +RG     T RR
Sbjct: 338 LNTETFGVS--------------------GRFLRGRSFRGGFRGGRGSATPRR 370
>sp|Q8CGC4|FA61B_MOUSE Protein FAM61B
          Length = 385

 Score = 55.5 bits (132), Expect = 2e-07
 Identities = 51/171 (29%), Positives = 69/171 (40%), Gaps = 13/171 (7%)
 Frame = +3

Query: 366 NNPKMKKKPTRENDASKKDETFKEDYDFETANAIIAKE------LAALNISQXXXXXXXX 527
           N  + + +PT   + + K   F+ D+DFE+ANA   +E         LN           
Sbjct: 231 NRSRGQNRPTNVKENTIK---FEGDFDFESANAQFNREELDKEFKKKLNFKDDKADKGEE 287

Query: 528 XXXXXXGQLSENSDAG-------YYDKTKSFFDNISSEMLDRANGKSVRPMNRKEERKIN 686
                  Q  E +          YYDK+KSFFDNISSE+       S R     EERK+N
Sbjct: 288 KDPAVMTQSEETAAEEDLLGPNCYYDKSKSFFDNISSEL-----KTSSRRTTWAEERKLN 342

Query: 687 VETFGVAEAXXXXXXXXXXXXXXXXHYSKSIRGSNFGGTYRGNSNEMTFRR 839
            ETFGV+                     + +RG +  G +RG     T RR
Sbjct: 343 TETFGVS--------------------GRFLRGRSSRGGFRGGRGNGTTRR 373
>sp|Q9BX40|FA61B_HUMAN Protein FAM61B
          Length = 385

 Score = 54.3 bits (129), Expect = 5e-07
 Identities = 51/171 (29%), Positives = 69/171 (40%), Gaps = 13/171 (7%)
 Frame = +3

Query: 366 NNPKMKKKPTRENDASKKDETFKEDYDFETANAIIAKE------LAALNISQXXXXXXXX 527
           N  + + +PT   + + K   F+ D+DFE+ANA   +E         LN           
Sbjct: 231 NRSRGQNRPTNVKENTIK---FEGDFDFESANAQFNREELDKEFKKKLNFKDDKAEKGEE 287

Query: 528 XXXXXXGQLSENSDAG-------YYDKTKSFFDNISSEMLDRANGKSVRPMNRKEERKIN 686
                  Q +E            YYDK+KSFFDNISSE+       S R     EERK+N
Sbjct: 288 KDLAVVTQSAEAPAEEDLLGPNCYYDKSKSFFDNISSEL-----KTSSRRTTWAEERKLN 342

Query: 687 VETFGVAEAXXXXXXXXXXXXXXXXHYSKSIRGSNFGGTYRGNSNEMTFRR 839
            ETFGV+                     + +RG +  G +RG     T RR
Sbjct: 343 TETFGVS--------------------GRFLRGRSSRGGFRGGRGNGTTRR 373
>sp|Q8K2F8|FA61A_MOUSE Protein FAM61A
          Length = 462

 Score = 45.8 bits (107), Expect = 2e-04
 Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 21/113 (18%)
 Frame = +3

Query: 429 FKEDYDFETANAIIAKELAALNIS----------QXXXXXXXXXXXXXXGQLSENSDAG- 575
           F++D+DFE+ANA   KE                 +              G  ++NS+   
Sbjct: 291 FEKDFDFESANAQFNKEEIDREFHNKLKLKEDKLEKQEKPVNGEDKGDSGVDTQNSEGNA 350

Query: 576 ----------YYDKTKSFFDNISSEMLDRANGKSVRPMNRKEERKINVETFGV 704
                     YYDKTKSFFDNIS +     + +  RP    EER++N ETFG+
Sbjct: 351 DEEDPLGPNCYYDKTKSFFDNISCD-----DNRERRP-TWAEERRLNAETFGI 397
>sp|Q8ND56|FA61A_HUMAN Protein FAM61A (Possible alpha synuclein-binding protein)
           (AlphaSNBP)
          Length = 463

 Score = 45.8 bits (107), Expect = 2e-04
 Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 21/113 (18%)
 Frame = +3

Query: 429 FKEDYDFETANAIIAKELAALNIS----------QXXXXXXXXXXXXXXGQLSENSDAG- 575
           F++D+DFE+ANA   KE                 +              G  ++NS+   
Sbjct: 292 FEKDFDFESANAQFNKEEIDREFHNKLKLKEDKLEKQEKPVNGEDKGDSGVDTQNSEGNA 351

Query: 576 ----------YYDKTKSFFDNISSEMLDRANGKSVRPMNRKEERKINVETFGV 704
                     YYDKTKSFFDNIS +     + +  RP    EER++N ETFG+
Sbjct: 352 DEEDPLGPNCYYDKTKSFFDNISCD-----DNRERRP-TWAEERRLNAETFGI 398
>sp|Q5R4R4|FA61A_PONPY Protein FAM61A
          Length = 463

 Score = 45.8 bits (107), Expect = 2e-04
 Identities = 36/113 (31%), Positives = 50/113 (44%), Gaps = 21/113 (18%)
 Frame = +3

Query: 429 FKEDYDFETANAIIAKELAALNIS----------QXXXXXXXXXXXXXXGQLSENSDAG- 575
           F++D+DFE+ANA   KE                 +              G  ++NS+   
Sbjct: 292 FEKDFDFESANAQFNKEEIDREFHNKLKLKEDKLEKQEKPVNGEDKGDSGVDTQNSEGNA 351

Query: 576 ----------YYDKTKSFFDNISSEMLDRANGKSVRPMNRKEERKINVETFGV 704
                     YYDKTKSFFDNIS +     + +  RP    EER++N ETFG+
Sbjct: 352 DEEDPLGPNCYYDKTKSFFDNISCD-----DNRERRP-TWAEERRLNAETFGI 398
>sp|Q9HGL3|SUM2_SCHPO Protein sum2
          Length = 426

 Score = 45.1 bits (105), Expect = 3e-04
 Identities = 35/119 (29%), Positives = 54/119 (45%)
 Frame = +3

Query: 351 SNSYDNNPKMKKKPTRENDASKKDETFKEDYDFETANAIIAKELAALNISQXXXXXXXXX 530
           S + + +   K+ PT + DAS      + ++DF+TAN         L             
Sbjct: 280 SQTVETSGPSKEVPTTQPDASAAKP--RTEFDFQTANQKFQSMKDDL------------- 324

Query: 531 XXXXXGQLSENSDAGYYDKTKSFFDNISSEMLDRANGKSVRPMNRKEERKINVETFGVA 707
                G+  E ++  +Y   +SFFDNIS E  ++    + R   R  ER +N+ETFGVA
Sbjct: 325 ---LKGKNDEEAEE-FYKPKQSFFDNISCESKEKGMEAADRRALRDRERSLNMETFGVA 379
>sp|P45978|SCD6_YEAST Protein SCD6
          Length = 349

 Score = 44.3 bits (103), Expect = 5e-04
 Identities = 35/123 (28%), Positives = 55/123 (44%), Gaps = 6/123 (4%)
 Frame = +3

Query: 363 DNNPKMKKKPTRE-NDASKKDETFK-----EDYDFETANAIIAKELAALNISQXXXXXXX 524
           +N PK ++   R  N   + +  FK     ED+DF++ NA   K   + ++ +       
Sbjct: 173 ENEPKYQRNKNRSSNRPPQSNRNFKVDIPNEDFDFQSNNAKFTKG-DSTDVEKEKELESA 231

Query: 525 XXXXXXXGQLSENSDAGYYDKTKSFFDNISSEMLDRANGKSVRPMNRKEERKINVETFGV 704
                      + SD  +Y+K  SFFD IS+      N      M  +EE+ +NV+TFG 
Sbjct: 232 VHK-------QDESDEQFYNKKSSFFDTISTSTETNTN------MRWQEEKMLNVDTFGQ 278

Query: 705 AEA 713
           A A
Sbjct: 279 ASA 281
>sp|Q92804|RBP56_HUMAN TATA-binding protein associated factor 2N (RNA-binding protein 56)
           (TAFII68) (TAF(II)68)
          Length = 592

 Score = 34.3 bits (77), Expect = 0.56
 Identities = 26/114 (22%), Positives = 42/114 (36%), Gaps = 7/114 (6%)
 Frame = +3

Query: 135 GNNYRASCDYGQSRDYYHMKQTGXXXXXXXXXXXXXXXXVNKLRGFESS--ETGRDYSYS 308
           G NY     YGQS+  Y     G                  + +  ESS  + GR  SY 
Sbjct: 47  GQNYSGYSSYGQSQSGYSQSYGGYENQKQSSYSQQPYNNQGQQQNMESSGSQGGRAPSYD 106

Query: 309 Q-----YYDYKPRNGYEKYSNSYDNNPKMKKKPTRENDASKKDETFKEDYDFET 455
           Q        Y  ++GY+++  SYD      ++    +   +   + +E+Y   T
Sbjct: 107 QPDYGQQDSYDQQSGYDQHQGSYDEQSNYDQQHDSYSQNQQSYHSQRENYSHHT 160
>sp|P25870|CLH_DICDI Clathrin heavy chain
          Length = 1694

 Score = 31.2 bits (69), Expect = 4.7
 Identities = 19/61 (31%), Positives = 30/61 (49%)
 Frame = -1

Query: 357 N*NTFHIHYEVYNHNIGSKNNHGQFLKIRSPVIYLHPYYCNTDLDGNHCDIYLFVSCDNN 178
           N N F   +E   + I + N  GQ L +      + PY CNT    N+ ++ + ++C NN
Sbjct: 299 NENIFVTAFEESTNGIIAVNRKGQVLSVSIDDKNIIPYICNT---LNNLELAISMACKNN 355

Query: 177 L 175
           L
Sbjct: 356 L 356
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 93,682,913
Number of Sequences: 369166
Number of extensions: 1724626
Number of successful extensions: 5198
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4945
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5174
length of database: 68,354,980
effective HSP length: 111
effective length of database: 47,849,395
effective search space used: 10383318715
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)

Cluster detail

DrC_00792

  1. Dr_sW_020_G17
  2. Dr_sW_016_D15