Planaria EST Database


DrC_02602

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_02602
         (741 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q5R8E4|FA82B_PONPY  Protein FAM82B                             149   1e-35
sp|Q96DB5|FA82B_HUMAN  Protein FAM82B                             148   2e-35
sp|Q9DCV4|FA82B_MOUSE  Protein FAM82B                             147   2e-35
sp|Q07963|UBR2_YEAST  Ubiquitin-protein ligase E3 UBR2 (Ubiq...    34   0.36 
sp|O75934|BCAS2_HUMAN  Breast carcinoma amplified sequence 2...    33   0.61 
sp|Q5RAX7|BCAS2_PONPY  Breast carcinoma amplified sequence 2...    33   0.61 
sp|P70718|6PGD_ACTAC  6-phosphogluconate dehydrogenase, deca...    31   4.0  
sp|P35601|RFC1_MOUSE  Activator 1 140 kDa subunit (Replicati...    30   6.8  
sp|P55517|Y4JQ_RHISN  Hypothetical 115.9 kDa protein y4jQ          30   8.8  
>sp|Q5R8E4|FA82B_PONPY Protein FAM82B
          Length = 314

 Score =  149 bits (375), Expect = 1e-35
 Identities = 72/164 (43%), Positives = 105/164 (64%), Gaps = 1/164 (0%)
 Frame = +2

Query: 251 EYKSPELMWRNARALREMANNTKDKAEHKNLL-YEGLEIAKNCLEIDENNSGCNKWYAYL 427
           E +  EL+WR ARA R++A  ++   E K LL YE LE AK  LE +E++   +KWYA  
Sbjct: 111 ESEDAELLWRLARASRDVAQLSRTSEEEKKLLVYEALEYAKRALEKNESSFAAHKWYAIC 170

Query: 428 LDLTGRVEGIKKRIENSYAVKKHLDIASEKSNGTDTGALHALGVWCYEIAALSWLNRKLA 607
           L   G  EGIK +I N+Y +K+H + A E  N  D  ++H +G+WCY  A + W  R++A
Sbjct: 171 LSDVGDYEGIKAKIANAYIIKEHFEKAIEL-NPKDATSIHLMGIWCYTFAEMPWYQRRIA 229

Query: 608 STFFATPPTSTYQESLNYLMAAEKISPGFYSQNYLYIGKVFEKI 739
              FATPP+STY+++L+Y   AE++ P FYS+N L +GK + K+
Sbjct: 230 KMLFATPPSSTYEKALSYFHRAEQVDPNFYSKNLLLLGKTYLKL 273
>sp|Q96DB5|FA82B_HUMAN Protein FAM82B
          Length = 314

 Score =  148 bits (373), Expect = 2e-35
 Identities = 72/164 (43%), Positives = 104/164 (63%), Gaps = 1/164 (0%)
 Frame = +2

Query: 251 EYKSPELMWRNARALREMANNTKDKAEHKNLL-YEGLEIAKNCLEIDENNSGCNKWYAYL 427
           E +  EL+WR ARA R++A  ++   E K LL YE LE AK  LE +E++   +KWYA  
Sbjct: 111 ESEDAELLWRLARASRDVAQLSRTSEEEKKLLVYEALEYAKRALEKNESSFASHKWYAIC 170

Query: 428 LDLTGRVEGIKKRIENSYAVKKHLDIASEKSNGTDTGALHALGVWCYEIAALSWLNRKLA 607
           L   G  EGIK +I N+Y +K+H + A E  N  D  ++H +G+WCY  A + W  R++A
Sbjct: 171 LSDVGDYEGIKAKIANAYIIKEHFEKAIEL-NPKDATSIHLMGIWCYTFAEMPWYQRRIA 229

Query: 608 STFFATPPTSTYQESLNYLMAAEKISPGFYSQNYLYIGKVFEKI 739
              FATPP+STY+++L Y   AE++ P FYS+N L +GK + K+
Sbjct: 230 KMLFATPPSSTYEKALGYFHRAEQVDPNFYSKNLLLLGKTYLKL 273
>sp|Q9DCV4|FA82B_MOUSE Protein FAM82B
          Length = 305

 Score =  147 bits (372), Expect = 2e-35
 Identities = 71/164 (43%), Positives = 103/164 (62%), Gaps = 1/164 (0%)
 Frame = +2

Query: 251 EYKSPELMWRNARALREMANNTKDKAEHKNLL-YEGLEIAKNCLEIDENNSGCNKWYAYL 427
           E +  EL+WR ARA R++A  +K   E K +L YE L+ AK  LE  E++S  +KWYA  
Sbjct: 106 ESEDGELLWRLARASRDIAQLSKTSEEEKKVLVYEALDYAKRALEKKESSSAAHKWYAIC 165

Query: 428 LDLTGRVEGIKKRIENSYAVKKHLDIASEKSNGTDTGALHALGVWCYEIAALSWLNRKLA 607
           +   G  EGIK +I N+Y +K+H + A E  N  D  ++H +G+WCY  A + W  R++A
Sbjct: 166 ISDVGDYEGIKVKIANAYVIKEHFEKAIEL-NPKDATSIHLMGIWCYTFAEMPWYQRRIA 224

Query: 608 STFFATPPTSTYQESLNYLMAAEKISPGFYSQNYLYIGKVFEKI 739
              FA PP+STY+E+L Y   AE++ P FYS+N L +GK + K+
Sbjct: 225 KVLFANPPSSTYEEALRYFHKAEEVDPNFYSKNLLLLGKTYLKL 268
>sp|Q07963|UBR2_YEAST Ubiquitin-protein ligase E3 UBR2 (Ubiquitin-protein ligase E3
            component N-recognin-1 homolog)
          Length = 1872

 Score = 34.3 bits (77), Expect = 0.36
 Identities = 34/131 (25%), Positives = 60/131 (45%), Gaps = 6/131 (4%)
 Frame = +2

Query: 278  RNARALREMANNTKDKA----EHKNLLYEGLEIAKNCLEIDENNSGCNKWYAYLLDLTGR 445
            R  + + +M  N   +A    EH  L+   L ++K  L+ DE ++ C+K+   +  L G+
Sbjct: 1698 RIQQVIYDMVQNINTRAYPSPEHIQLIELPLNLSKFSLDNDEISNKCDKYEIAVCLLCGQ 1757

Query: 446  VEGIKKRIENSYAVKKHL--DIASEKSNGTDTGALHALGVWCYEIAALSWLNRKLASTFF 619
                K  I+ S A++ +L  +      NG +  +  A GV+        +L+     TF+
Sbjct: 1758 ----KCHIQKSIALQGYLQGECTDHMRNGCEITS--AYGVFLMTGTNAIYLSYGKRGTFY 1811

Query: 620  ATPPTSTYQES 652
            A P  S Y E+
Sbjct: 1812 AAPYLSKYGET 1822
>sp|O75934|BCAS2_HUMAN Breast carcinoma amplified sequence 2 (DNA amplified in mammary
           carcinoma 1 protein) (Spliceosome-associated protein SPF
           27)
 sp|Q9D287|BCAS2_MOUSE Breast carcinoma amplified sequence 2 homolog (DNA amplified in
           mammary carcinoma 1 protein)
          Length = 225

 Score = 33.5 bits (75), Expect = 0.61
 Identities = 21/79 (26%), Positives = 39/79 (49%), Gaps = 1/79 (1%)
 Frame = +2

Query: 290 ALREMANNTKDKAEHKNLLYEGLEIAKNCLEIDENNSGCNKWYAYLLDLTGRVEGIKKRI 469
           A +E  NN+  + EH+ +  E LE+         +  GCN W  Y  +L   +E  +K +
Sbjct: 102 AWQECVNNSMAQLEHQAVRIENLELM--------SQHGCNAWKVYNENLVHMIEHAQKEL 153

Query: 470 ENSYAVKKHL-DIASEKSN 523
           +    ++KH+ D+  ++ N
Sbjct: 154 QK---LRKHIQDLNWQRKN 169
>sp|Q5RAX7|BCAS2_PONPY Breast carcinoma amplified sequence 2 homolog
          Length = 226

 Score = 33.5 bits (75), Expect = 0.61
 Identities = 21/79 (26%), Positives = 39/79 (49%), Gaps = 1/79 (1%)
 Frame = +2

Query: 290 ALREMANNTKDKAEHKNLLYEGLEIAKNCLEIDENNSGCNKWYAYLLDLTGRVEGIKKRI 469
           A +E  NN+  + EH+ +  E LE+         +  GCN W  Y  +L   +E  +K +
Sbjct: 103 AWQECVNNSMAQLEHQAVRIENLELM--------SQHGCNAWKVYNENLVHMIEHAQKEL 154

Query: 470 ENSYAVKKHL-DIASEKSN 523
           +    ++KH+ D+  ++ N
Sbjct: 155 QK---LRKHIQDLNWQRKN 170
>sp|P70718|6PGD_ACTAC 6-phosphogluconate dehydrogenase, decarboxylating
          Length = 484

 Score = 30.8 bits (68), Expect = 4.0
 Identities = 22/70 (31%), Positives = 34/70 (48%)
 Frame = +2

Query: 329 EHKNLLYEGLEIAKNCLEIDENNSGCNKWYAYLLDLTGRVEGIKKRIENSYAVKKHLDIA 508
           E    L EG+ ++ + L+   N     +  +YL+D+T  + G K   + S  V K LD A
Sbjct: 201 EAYQFLKEGVGLSDDELQATFNEWRNTELDSYLIDITADILGYKD-ADGSRLVDKVLDTA 259

Query: 509 SEKSNGTDTG 538
            +K  G  TG
Sbjct: 260 GQKGTGKWTG 269
>sp|P35601|RFC1_MOUSE Activator 1 140 kDa subunit (Replication factor C large subunit) (A1
            140 kDa subunit) (RF-C 140 kDa subunit) (Activator 1
            large subunit) (A1-P145) (Differentiation-specific
            element binding protein) (ISRE-binding protein)
          Length = 1131

 Score = 30.0 bits (66), Expect = 6.8
 Identities = 50/190 (26%), Positives = 74/190 (38%), Gaps = 34/190 (17%)
 Frame = +2

Query: 260  SPELMWRNARALRE---MANNTKDKAEHKNL--LYEGLEIAKNCLEIDENNSGCNKWYAY 424
            +P +  R+A  + E   MA N +D+   + L  L +  +I   C+  D N+        Y
Sbjct: 692  APSVSARHALIMDEVDGMAGN-EDRGGIQELIGLIKHTKIPIICMCNDRNHPKIRSLVHY 750

Query: 425  LLDLT---GRVEGIKKRIENSYAVKKHLDIASEKSNGTDTGA-------LHALGVWCYEI 574
              DL     RVE IK  +  S A K+ L I     N    GA       LH L +WC + 
Sbjct: 751  CFDLRFQRPRVEQIKSAML-SIAFKEGLKIPPPAMNEIILGANQDVRQVLHNLSMWCAQS 809

Query: 575  AALSWLNRKLAS----------TFFATPPTSTYQESLNYLMAAEK---------ISPGFY 697
             AL++   K  S           F  T       E   ++   +K         I+P F 
Sbjct: 810  KALTYDQAKADSQRAKKDIRLGPFDVTRKVFAAGEETAHMSLMDKSDLFFHDYSIAPLFV 869

Query: 698  SQNYLYIGKV 727
             +NYL++  V
Sbjct: 870  QENYLHVKPV 879
>sp|P55517|Y4JQ_RHISN Hypothetical 115.9 kDa protein y4jQ
          Length = 1039

 Score = 29.6 bits (65), Expect = 8.8
 Identities = 22/93 (23%), Positives = 43/93 (46%), Gaps = 1/93 (1%)
 Frame = +2

Query: 212 EQCDLFKNRYHSDEYKSPELMWRNARALREMANNTKD-KAEHKNLLYEGLEIAKNCLEID 388
           +Q +   N + SD+  S   +    R   E+ ++T D +A  K  L E +E ++  + + 
Sbjct: 250 DQYEFLANLFTSDQSLSKPDVLPFGRDFVEILSSTGDWRASPKAALAERVEPSRFTIPL- 308

Query: 389 ENNSGCNKWYAYLLDLTGRVEGIKKRIENSYAV 487
                 N+W  YLLD  G ++G+    +  + +
Sbjct: 309 -----LNRWCGYLLDSIGILDGVSSAPDREFDI 336
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 73,286,697
Number of Sequences: 369166
Number of extensions: 1311500
Number of successful extensions: 3147
Number of sequences better than 10.0: 9
Number of HSP's better than 10.0 without gapping: 3081
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3138
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 6679696800
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)