Planarian EST Database


Dr_sW_019_H08

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_019_H08
         (837 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P16356|RPB1_CAEEL  DNA-directed RNA polymerase II largest...    39   0.014
sp|P35074|RPB1_CAEBR  DNA-directed RNA polymerase II largest...    39   0.018
sp|P04052|RPB1_DROME  DNA-directed RNA polymerase II largest...    35   0.26 
sp|Q25434|FP1_MYTCO  Adhesive plaque matrix protein precurso...    33   0.97 
sp|O15018|PDZK3_HUMAN  PDZ domain containing protein 3 (PDZ ...    33   0.97 
sp|P20823|HNF1A_HUMAN  Hepatocyte nuclear factor 1-alpha (HN...    33   1.3  
sp|P35085|CBPA_DICDI  Calcium-binding protein                      33   1.3  
sp|P35084|RPB1_DICDI  DNA-directed RNA polymerase II largest...    32   1.7  
sp|O14686|MLL2_HUMAN  Myeloid/lymphoid or mixed-lineage leuk...    32   1.7  
sp|P56945|BCA1_HUMAN  CRK-associated substrate (p130Cas) (Br...    32   1.7  
>sp|P16356|RPB1_CAEEL DNA-directed RNA polymerase II largest subunit
          Length = 1852

 Score = 39.3 bits (90), Expect = 0.014
 Identities = 40/134 (29%), Positives = 48/134 (35%), Gaps = 2/134 (1%)
 Frame = +1

Query: 37   YIPGPGAYKPESTIKKVYSK-APEYPFGIRHMERRTDDTPGPNVYS-VDPMLSRTVRSQK 210
            Y P    Y P S     YS  +P Y  G  +       +P    YS   P  S T     
Sbjct: 1689 YSPTSPTYSPTSP---TYSPTSPSYESGGGYSPSSPKYSPSSPTYSPTSPSYSPTSPQYS 1745

Query: 211  PSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMTSRNPMPGDTTQK 390
            P++P  S SS +           TP   TY  T P  F S  P YS TS    P   +  
Sbjct: 1746 PTSPQYSPSSPTY----------TPSSPTYNPTSPRGFSS--PQYSPTSPTYSPTSPSYT 1793

Query: 391  PGPGAYSAELVTFT 432
            P    YS    T+T
Sbjct: 1794 PSSPQYSPTSPTYT 1807
>sp|P35074|RPB1_CAEBR DNA-directed RNA polymerase II largest subunit
          Length = 1853

 Score = 38.9 bits (89), Expect = 0.018
 Identities = 39/134 (29%), Positives = 48/134 (35%), Gaps = 2/134 (1%)
 Frame = +1

Query: 37   YIPGPGAYKPESTIKKVYS-KAPEYPFGIRHMERRTDDTPGPNVYS-VDPMLSRTVRSQK 210
            Y P    Y P S   + YS  +P+Y             +P    YS   P  S T     
Sbjct: 1700 YSPTSPTYSPTSPSYEGYSPSSPKY-------------SPSSPTYSPTSPSYSPTSPQYS 1746

Query: 211  PSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMTSRNPMPGDTTQK 390
            P++P  S SS +           TP   TY  T P  F S  P YS TS    P   +  
Sbjct: 1747 PTSPQYSPSSPTY----------TPSSPTYNPTSPRAFSS--PQYSPTSPTYSPTSPSYT 1794

Query: 391  PGPGAYSAELVTFT 432
            P    YS    T+T
Sbjct: 1795 PSSPQYSPTSPTYT 1808

 Score = 35.8 bits (81), Expect = 0.15
 Identities = 36/129 (27%), Positives = 48/129 (37%), Gaps = 1/129 (0%)
 Frame = +1

Query: 28   SLDYIPGPGAYKPESTIKKVYSKAPEYPFGIRHMERRTDDTPGPNVYS-VDPMLSRTVRS 204
            S  Y P   +Y P S      S +P  P       R +  +P    YS   P  S T  +
Sbjct: 1648 SPSYSPTSPSYSPTSP-----SYSPTSPSYSPSSPRYSPTSP---TYSPTSPTYSPTSPT 1699

Query: 205  QKPSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMTSRNPMPGDTT 384
              P++P  S +S S  G      + +P   TY  T P  +    P YS TS    P   T
Sbjct: 1700 YSPTSPTYSPTSPSYEGYSPSSPKYSPSSPTYSPTSPS-YSPTSPQYSPTSPQYSPSSPT 1758

Query: 385  QKPGPGAYS 411
              P    Y+
Sbjct: 1759 YTPSSPTYN 1767
>sp|P04052|RPB1_DROME DNA-directed RNA polymerase II largest subunit
          Length = 1887

 Score = 35.0 bits (79), Expect = 0.26
 Identities = 41/161 (25%), Positives = 67/161 (41%), Gaps = 4/161 (2%)
 Frame = +1

Query: 4    LYSRPK--SLSLDYIPGPGAYKPESTIKKVYSK-APEYPFGIRHMERRTDDTPGPNVYSV 174
            LY+ P+  S + ++ P    Y P S+    YS  +P Y   ++     +    G N+YS 
Sbjct: 1611 LYASPRYASTTPNFNPQSTGYSPSSS---GYSPTSPVYSPTVQFQSSPSFAGSGSNIYSP 1667

Query: 175  DPMLSRTVRSQKPSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMT 354
                S +  +  P++P  S +S S           +P   +Y  T P  +    P YS T
Sbjct: 1668 GNAYSPSSSNYSPNSPSYSPTSPSY----------SPSSPSYSPTSP-CYSPTSPSYSPT 1716

Query: 355  SRNPMPGDTTQKPGPGAYSAELVTFTRPGAPKFT-FGIRHS 474
            S N  P   +  P    YSA       P +P ++  G+++S
Sbjct: 1717 SPNYTPVTPSYSPTSPNYSAS--PQYSPASPAYSQTGVKYS 1755

 Score = 30.8 bits (68), Expect = 4.8
 Identities = 39/154 (25%), Positives = 50/154 (32%), Gaps = 15/154 (9%)
 Frame = +1

Query: 37   YIPGPGAYKPESTIKKVYSKAPEYPFGIRHMERRTDDTPGPNVYS-VDPMLSRTVRSQKP 213
            Y P    Y P S        +P+Y             TPG   YS   P  S T     P
Sbjct: 1754 YSPTSPTYSPPSPSYDGSPGSPQY-------------TPGSPQYSPASPKYSPTSPLYSP 1800

Query: 214  SAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGV-------------FKSRYPMYSMT 354
            S+P  S S+           Q +P   TY  T P               +    P Y+ T
Sbjct: 1801 SSPQHSPSN-----------QYSPTGSTYSATSPRYSPNMSIYSPSSTKYSPTSPTYTPT 1849

Query: 355  SRNPMPGDTTQKP-GPGAYSAELVTFTRPGAPKF 453
            +RN  P      P  P  YS     ++ P +P F
Sbjct: 1850 ARNYSPTSPMYSPTAPSHYSPTSPAYS-PSSPTF 1882
>sp|Q25434|FP1_MYTCO Adhesive plaque matrix protein precursor (Foot protein 1) (MCFP1)
          Length = 872

 Score = 33.1 bits (74), Expect = 0.97
 Identities = 38/128 (29%), Positives = 45/128 (35%), Gaps = 11/128 (8%)
 Frame = +1

Query: 49  PGAYKPESTIKKVYSKAPEYPFGIRHMERRTDDTPGPNVYSVDPMLSRTVRSQKPSAPMI 228
           P  YKP+ T    Y   P YP               P  Y   P    T   QKPS P I
Sbjct: 279 PPTYKPKVTYPPTYKPKPSYP------PTYKPKITYPPTYKPKPSYP-TPYKQKPSYPPI 331

Query: 229 SMSSRSKIGGFHEDLQKTPGPGTY--KVTDPGVFKSR--YP-------MYSMTSRNPMPG 375
             S  S    +     K   P TY  K+T P  +K +  YP        YS T +  +  
Sbjct: 332 YKSKSSYPTSYK---SKKTYPPTYKPKITYPPTYKPKPSYPPSYKPKKTYSPTYKPKITY 388

Query: 376 DTTQKPGP 399
             T KP P
Sbjct: 389 PPTYKPKP 396

 Score = 31.2 bits (69), Expect = 3.7
 Identities = 34/117 (29%), Positives = 41/117 (35%), Gaps = 2/117 (1%)
 Frame = +1

Query: 49  PGAYKPESTIKKVYSKAPEYPFGIRHMERRTDDTPGPNVYSVDPMLSRTVRSQKPSAPMI 228
           P  YKP+ T    Y   P YP               P  Y   P    T   QKPS P I
Sbjct: 459 PPTYKPKITYPPTYKPKPSYP------PTYKPKITYPPTYKRKPSYP-TPYKQKPSYPPI 511

Query: 229 SMSSRSKIGGFHEDLQKTPGPGTY--KVTDPGVFKSRYPMYSMTSRNPMPGDTTQKP 393
             S  S    +     K   P TY  K+T P  +K + P Y  + +       T KP
Sbjct: 512 YKSKSSYPTSYK---SKKTYPPTYKPKITYPPTYKPK-PSYPPSYKPKTTYPPTYKP 564

 Score = 30.8 bits (68), Expect = 4.8
 Identities = 37/132 (28%), Positives = 50/132 (37%), Gaps = 9/132 (6%)
 Frame = +1

Query: 49  PGAYKPESTIKKVYSKAPEYPFGIR----HMERRTDDTPGPNVYSVDPMLSRTVRSQKPS 216
           P +YKP++T    Y     YP   +    +          P  Y   P    T   QKPS
Sbjct: 549 PPSYKPKTTYPPTYKPKIRYPPTYKPKASYPPTYKPKITYPPTYKPKPSYP-TPYKQKPS 607

Query: 217 APMISMSSRSKIGGFHEDLQKTPGPGTY--KVTDPGVFKSRYPMYSMTSRNPMPGDTTQK 390
            P I  S  S    +     K   P TY  K+T P  +K + P Y  + R  +    T K
Sbjct: 608 YPPIYKSKSSYPTAYK---SKKTYPPTYKPKITYPPTYKPK-PSYPPSYRPKITYPPTYK 663

Query: 391 PG---PGAYSAE 417
           P    P AY ++
Sbjct: 664 PKKSYPQAYKSK 675
>sp|O15018|PDZK3_HUMAN PDZ domain containing protein 3 (PDZ domain containing protein 2)
            (Activated in prostate cancer protein)
          Length = 2839

 Score = 33.1 bits (74), Expect = 0.97
 Identities = 28/101 (27%), Positives = 41/101 (40%), Gaps = 2/101 (1%)
 Frame = +1

Query: 151  PGPNVYSVDP-MLSRTVRSQKPSAPMISM-SSRSKIGGFHEDLQKTPGPGTYKVTDPGVF 324
            P P+  SVD   +SR     +P++P ++   +RS +   HE    +P PG      P   
Sbjct: 1247 PDPSKTSVDTGQVSRPENPSQPASPRVAKCKARSPVRLPHEG---SPSPGEKAAAPPDYS 1303

Query: 325  KSRYPMYSMTSRNPMPGDTTQKPGPGAYSAELVTFTRPGAP 447
            K+R    + T  N       +  GPGA          PG P
Sbjct: 1304 KTRSASETSTPHNTRRVAALRGAGPGAEGMTPAGAVLPGDP 1344
>sp|P20823|HNF1A_HUMAN Hepatocyte nuclear factor 1-alpha (HNF-1A) (Liver-specific
           transcription factor LF-B1) (LFB1) (Transcription factor
           1) (TCF-1)
          Length = 631

 Score = 32.7 bits (73), Expect = 1.3
 Identities = 38/165 (23%), Positives = 67/165 (40%), Gaps = 11/165 (6%)
 Frame = +1

Query: 43  PGPGAYKPESTIKKVYSKA--PEYPFGIRHMERRTDDTP-------GPNVYSVDPMLSRT 195
           PGPG   P  +   +   A  P    G+R+ +  T +T        GP V    P+   +
Sbjct: 293 PGPGPALPAHSSPGLPPPALSPSKVHGVRYGQPATSETAEVPSSSGGPLVTVSTPLHQVS 352

Query: 196 VRSQKPSAPMISMSSR--SKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMTSRNPM 369
               +PS  ++S  ++  S  GG    +       + + T PG+ +    +   +    +
Sbjct: 353 PTGLEPSHSLLSTEAKLVSAAGGPLPPVSTLTALHSLEQTSPGLNQQPQNLIMAS----L 408

Query: 370 PGDTTQKPGPGAYSAELVTFTRPGAPKFTFGIRHSQYKGEMIVNN 504
           PG  T   GPG  ++   TFT  GA     G+  +Q +   ++N+
Sbjct: 409 PGVMTI--GPGEPASLGPTFTNTGASTLVIGLASTQAQSVPVINS 451
>sp|P35085|CBPA_DICDI Calcium-binding protein
          Length = 467

 Score = 32.7 bits (73), Expect = 1.3
 Identities = 24/64 (37%), Positives = 27/64 (42%), Gaps = 6/64 (9%)
 Frame = +1

Query: 274 QKTPG----PGTYKVTDPGVFKSRYPMYSMTSRNPMPGDTTQKP--GPGAYSAELVTFTR 435
           Q TPG    PG Y    PG   S  P Y  T +   PG   Q P   PG Y  +     +
Sbjct: 34  QSTPGAPGAPGQYPPQQPGAPGSNLPPYPGTQQPGAPGAPGQYPPQQPGQYPPQ-----Q 88

Query: 436 PGAP 447
           PGAP
Sbjct: 89  PGAP 92
>sp|P35084|RPB1_DICDI DNA-directed RNA polymerase II largest subunit
          Length = 902

 Score = 32.3 bits (72), Expect = 1.7
 Identities = 37/142 (26%), Positives = 51/142 (35%), Gaps = 7/142 (4%)
 Frame = +1

Query: 28   SLDYIPGPGAYKPES-----TIKKVYSKAPEY-PFGIRHMERRTDDTPGPNVYS-VDPML 186
            S  Y P   +Y P S     T       +P Y P    +       +P    YS   P  
Sbjct: 760  SPSYSPTSPSYSPTSPFYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSY 819

Query: 187  SRTVRSQKPSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMTSRNP 366
            S T  S  P++P  S +S S           +P   +Y  T P  +    P YS +S + 
Sbjct: 820  SPTSPSYSPTSPSYSPTSPSY----------SPTSPSYSPTSPS-YSPTSPSYSPSSPSY 868

Query: 367  MPGDTTQKPGPGAYSAELVTFT 432
             P   +  P   +YS    TFT
Sbjct: 869  SPSSPSYSPSSPSYSPSSPTFT 890
>sp|O14686|MLL2_HUMAN Myeloid/lymphoid or mixed-lineage leukemia protein 2 (ALL1-related
            protein)
          Length = 5262

 Score = 32.3 bits (72), Expect = 1.7
 Identities = 31/134 (23%), Positives = 49/134 (36%), Gaps = 23/134 (17%)
 Frame = +1

Query: 154  GPNVYSVDPMLSRTVRSQKPSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPG----- 318
            GP  +  D  LSR      PS+  + ++SR  +GG     Q+ P PG+  +         
Sbjct: 2487 GPGSFPSDDRLSRPPPPATPSS--MDVNSRQLVGGSQAFYQRAPYPGSLPLQQQQQQLWQ 2544

Query: 319  ------------VFKSRYP------MYSMTSRNPMPGDTTQKPGPGAYSAELVTFTRPGA 444
                           +R+P      +      +P+ G +T+ PGPG           P  
Sbjct: 2545 QQQATAATSMRFAMSARFPSTPGPELGRQALGSPLAGISTRLPGPGE------PVPGPAG 2598

Query: 445  PKFTFGIRHSQYKG 486
            P     +RH+  KG
Sbjct: 2599 PAQFIELRHNVQKG 2612
>sp|P56945|BCA1_HUMAN CRK-associated substrate (p130Cas) (Breast cancer anti-estrogen
           resistance 1 protein)
          Length = 870

 Score = 32.3 bits (72), Expect = 1.7
 Identities = 24/94 (25%), Positives = 36/94 (38%), Gaps = 4/94 (4%)
 Frame = +1

Query: 151 PGPNVYSVDPMLSRTVRSQKPSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKS 330
           P P      PML  T + Q  S  ++   S+++ G     L + PGP     + P   K 
Sbjct: 92  PAPPASQYTPMLPNTYQPQPDSVYLVPTPSKAQQG-----LYQVPGPSPQFQSPPA--KQ 144

Query: 331 RYPMYSMTSRNPMPGDTTQ----KPGPGAYSAEL 420
                  T  +P P   T      PGPG  + ++
Sbjct: 145 TSTFSKQTPHHPFPSPATDLYQVPPGPGGPAQDI 178
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 103,728,254
Number of Sequences: 369166
Number of extensions: 2425341
Number of successful extensions: 6539
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 6119
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6479
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 8148988185
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)