Planarian EST Database


Dr_sW_018_C01

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_018_C01
         (785 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|O75717|WDHD1_HUMAN  WD repeat and HMG-box DNA binding pro...    50   9e-06
sp|P59328|WDHD1_MOUSE  WD repeat and HMG-box DNA binding pro...    48   3e-05
sp|Q99KP6|PRP19_MOUSE  PRP19/PSO4 homolog (Nuclear matrix pr...    48   3e-05
sp|Q9UMS4|PRP19_HUMAN  PRP19/PSO4 homolog (Nuclear matrix pr...    48   3e-05
sp|P42841|YN57_YEAST  Hypothetical 53.1 kDa Trp-Asp repeats ...    47   6e-05
sp|Q9JMJ4|PRP19_RAT  PRP19/PSO4 homolog (Neuronal differenti...    47   6e-05
sp|Q09866|YAG1_SCHPO  Hypothetical WD-repeat protein C12G12....    45   3e-04
sp|Q8N3Y1|FBXW8_HUMAN  F-box/WD-repeat protein 8 (F-box and ...    44   4e-04
sp|Q8C7V3|UTP15_MOUSE  U3 small nucleolar RNA-associated pro...    44   5e-04
sp|Q96DI7|WDR57_HUMAN  WD-repeat protein 57 (Prp8-binding pr...    44   7e-04
>sp|O75717|WDHD1_HUMAN WD repeat and HMG-box DNA binding protein 1 (Acidic nucleoplasmic
           DNA-binding protein 1) (And-1)
          Length = 1129

 Score = 49.7 bits (117), Expect = 9e-06
 Identities = 18/60 (30%), Positives = 37/60 (61%)
 Frame = +1

Query: 514 LLAGTRKGEIAIIDIRQNKIQNIFQAHDSCITAIKLDPLQYYFVSSATDGSIKIWQLSNQ 693
           + AG+    + I+D+  +  Q  F+ HD+ + ++  DP   +  S++ DGS+++WQ+S+Q
Sbjct: 109 IAAGSSDFLVKIVDVMDSSQQKTFRGHDAPVLSLSFDPKDIFLASASCDGSVRVWQISDQ 168
>sp|P59328|WDHD1_MOUSE WD repeat and HMG-box DNA binding protein 1 (Acidic nucleoplasmic
           DNA-binding protein 1) (And-1)
          Length = 970

 Score = 48.1 bits (113), Expect = 3e-05
 Identities = 16/60 (26%), Positives = 36/60 (60%)
 Frame = +1

Query: 514 LLAGTRKGEIAIIDIRQNKIQNIFQAHDSCITAIKLDPLQYYFVSSATDGSIKIWQLSNQ 693
           + AG+    + ++D+  N  Q  F+ HD+ + ++  DP   +  S++ DG++++W +S+Q
Sbjct: 109 IAAGSSDFLVKVVDVMDNSQQQTFRGHDAPVLSLSFDPKDIFLASASCDGTVRVWNISDQ 168
>sp|Q99KP6|PRP19_MOUSE PRP19/PSO4 homolog (Nuclear matrix protein 200) (Nuclear matrix
           protein SNEV)
          Length = 504

 Score = 47.8 bits (112), Expect = 3e-05
 Identities = 57/228 (25%), Positives = 92/228 (40%), Gaps = 22/228 (9%)
 Frame = +1

Query: 160 NKFGMGDSEGNFSLFHFSYADNSRAQYFQTRCPH-KGVLDFCFINSSSLVATAGSSA--- 327
           NK   G ++ N  +F     D S  Q   T   H K V    F  S  LV +A   A   
Sbjct: 235 NKILTGGADKNVVVF-----DKSTEQILATLKGHTKKVTSVVFHPSQELVFSASPDATIR 289

Query: 328 LHGLPSVNQLHAVGGGNFDTNSVNI-SMWDVLLPSGRNQVRNLSDPQLA----------I 474
           +  +P+ + +  V         +++ +  D LL S  +Q    SD Q             
Sbjct: 290 IWSVPNTSCVQVVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETS 349

Query: 475 GVVALCYVSRYELLL--AGTRKGEIAIIDIRQNKIQNIFQAHDSCITAIKLDPLQYYFVS 648
           G    C     + L+   GT   +I I D+++      F  H   IT+I      YY  +
Sbjct: 350 GCSLTCAQFHPDGLIFGTGTMDSQIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLAT 409

Query: 649 SATDGSIKIWQLSNQELFETFRSE---PIKASVFSKGD--LAVKGLDI 777
           +A D S+K+W L   + F+T + +    +K+ +F +    LA+ G D+
Sbjct: 410 AADDSSVKLWDLRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDV 457
>sp|Q9UMS4|PRP19_HUMAN PRP19/PSO4 homolog (Nuclear matrix protein 200) (hPso4)
          Length = 504

 Score = 47.8 bits (112), Expect = 3e-05
 Identities = 57/228 (25%), Positives = 92/228 (40%), Gaps = 22/228 (9%)
 Frame = +1

Query: 160 NKFGMGDSEGNFSLFHFSYADNSRAQYFQTRCPH-KGVLDFCFINSSSLVATAGSSA--- 327
           NK   G ++ N  +F     D S  Q   T   H K V    F  S  LV +A   A   
Sbjct: 235 NKILTGGADKNVVVF-----DKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIR 289

Query: 328 LHGLPSVNQLHAVGGGNFDTNSVNI-SMWDVLLPSGRNQVRNLSDPQLA----------I 474
           +  +P+ + +  V         +++ +  D LL S  +Q    SD Q             
Sbjct: 290 IWSVPNASCVQVVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETS 349

Query: 475 GVVALCYVSRYELLL--AGTRKGEIAIIDIRQNKIQNIFQAHDSCITAIKLDPLQYYFVS 648
           G    C     + L+   GT   +I I D+++      F  H   IT+I      YY  +
Sbjct: 350 GCSLTCAQFHPDGLIFGTGTMDSQIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLAT 409

Query: 649 SATDGSIKIWQLSNQELFETFRSE---PIKASVFSKGD--LAVKGLDI 777
           +A D S+K+W L   + F+T + +    +K+ +F +    LA+ G D+
Sbjct: 410 AADDSSVKLWDLRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDV 457
>sp|P42841|YN57_YEAST Hypothetical 53.1 kDa Trp-Asp repeats containing protein in
           HXT14-PHA2 intergenic region
          Length = 465

 Score = 47.0 bits (110), Expect = 6e-05
 Identities = 34/126 (26%), Positives = 58/126 (46%), Gaps = 12/126 (9%)
 Frame = +1

Query: 412 DVLLPS---GRNQVRNL-------SDPQLAIGVVALCYVSRYELLLAGTRKGEIAIIDIR 561
           D++ P+   GR++V NL       S  ++   + A+ +      L+  T  GE ++ +  
Sbjct: 64  DIMPPNAYRGRDRVINLPSKFTHLSSNKVKHVIPAIQWTPEGRRLVVATYSGEFSLWNAS 123

Query: 562 QNKIQNIFQAHDSCITAIKLDPLQYYFVSSATDGSIKIWQ--LSNQELFETFRSEPIKAS 735
               + + QAHDS +T +K      + +S   DG IKIWQ   S  +  +   +E I+  
Sbjct: 124 SFTFETLMQAHDSAVTTMKYSHDSDWMISGDADGMIKIWQPNFSMVKEIDAAHTESIRDM 183

Query: 736 VFSKGD 753
            FS  D
Sbjct: 184 AFSSND 189

 Score = 37.7 bits (86), Expect = 0.036
 Identities = 18/51 (35%), Positives = 29/51 (56%), Gaps = 3/51 (5%)
 Frame = +1

Query: 535 GEIAIIDIRQN---KIQNIFQAHDSCITAIKLDPLQYYFVSSATDGSIKIW 678
           G +   D+ QN    I  I  AHD CIT++  +P+ + F ++A D +I+ W
Sbjct: 327 GSLKHFDLLQNLNEPILTIPYAHDKCITSLSYNPVGHIFATAAKDRTIRFW 377
>sp|Q9JMJ4|PRP19_RAT PRP19/PSO4 homolog (Neuronal differentiation-related gene protein)
          Length = 504

 Score = 47.0 bits (110), Expect = 6e-05
 Identities = 57/228 (25%), Positives = 91/228 (39%), Gaps = 22/228 (9%)
 Frame = +1

Query: 160 NKFGMGDSEGNFSLFHFSYADNSRAQYFQTRCPH-KGVLDFCFINSSSLVATAGSSA--- 327
           NK   G ++ N  +F     D S  Q   T   H K V    F  S  LV +A   A   
Sbjct: 235 NKILTGGADKNVVVF-----DKSTEQILATLKGHTKKVTSVVFHPSQELVFSASPDATIR 289

Query: 328 LHGLPSVNQLHAVGGGNFDTNSVNI-SMWDVLLPSGRNQVRNLSDPQLA----------I 474
           +  +P+ + +  V         +++ +  D LL S  +Q    SD Q             
Sbjct: 290 IWSVPNTSCVQVVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETS 349

Query: 475 GVVALCYVSRYELLL--AGTRKGEIAIIDIRQNKIQNIFQAHDSCITAIKLDPLQYYFVS 648
           G    C     + L+   GT    I I D+++      F  H   IT+I      YY  +
Sbjct: 350 GCSLTCAQFHPDGLIFGTGTMDSHIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLAT 409

Query: 649 SATDGSIKIWQLSNQELFETFRSE---PIKASVFSKGD--LAVKGLDI 777
           +A D S+K+W L   + F+T + +    +K+ +F +    LA+ G D+
Sbjct: 410 AADDSSVKLWDLRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDV 457
>sp|Q09866|YAG1_SCHPO Hypothetical WD-repeat protein C12G12.01c in chromosome I
          Length = 905

 Score = 44.7 bits (104), Expect = 3e-04
 Identities = 42/180 (23%), Positives = 69/180 (38%), Gaps = 1/180 (0%)
 Frame = +1

Query: 142 NFDANGNKFGMGDSEGNFSLFHFSYADNSRAQYFQTRCPHKGVLDFCFINSSSLVATAGS 321
           +F  N    G G S GN SL    + +      FQ                 ++V    +
Sbjct: 52  SFHGNNEILGAGSSTGNISLLSVKHPE------FQ-----------------AVVTPGYA 88

Query: 322 SALHGLPSVNQLHAVGGGNFDTNSVNISMWDVLLPSGRNQVRNLSDPQLAIGVVALCYVS 501
              + L      H V G     N  ++ +WD+       +   L       GV ++CY  
Sbjct: 89  RPCNSLAFSETEHLVAGFAKSRNESSLKLWDLNSLLSDPKSSPLMQSSTLDGVSSVCYKK 148

Query: 502 RYELLLAGTRKGEIAIIDIRQNKIQNIFQAHDSCITAIKLDPLQ-YYFVSSATDGSIKIW 678
              LLL G+    + IID RQ ++ ++   +    + I +DP    YF +++ DG I I+
Sbjct: 149 DTPLLLTGSTSRSVHIIDTRQ-QLDSVSSVNTQYYSNIVVDPFSPNYFAANSYDGDIAIF 207
>sp|Q8N3Y1|FBXW8_HUMAN F-box/WD-repeat protein 8 (F-box and WD-40 domain protein 8) (F-box
           only protein 29)
          Length = 598

 Score = 44.3 bits (103), Expect = 4e-04
 Identities = 23/83 (27%), Positives = 42/83 (50%), Gaps = 1/83 (1%)
 Frame = +1

Query: 511 LLLAGTRKGEIAIIDIRQNKIQNIFQAHDSCITAIKLDPLQYYFVSSATDGSIKIWQL-S 687
           L+++G   G + I D+R   I     AH   ++A+++D   +  VS   +G + +W    
Sbjct: 447 LMVSGNMDGRVRIHDLRSGNIALSLSAHQLRVSAVQMD--DWKIVSGGEEGLVSVWDYRM 504

Query: 688 NQELFETFRSEPIKASVFSKGDL 756
           NQ+L+E +   P++   FS   L
Sbjct: 505 NQKLWEVYSGHPVQHISFSSHSL 527
>sp|Q8C7V3|UTP15_MOUSE U3 small nucleolar RNA-associated protein 15 homolog
           (Src-associated protein SAW)
          Length = 528

 Score = 43.9 bits (102), Expect = 5e-04
 Identities = 26/91 (28%), Positives = 44/91 (48%), Gaps = 4/91 (4%)
 Frame = +1

Query: 508 ELLLAGTRKGEIAIIDIRQNKIQNIFQAHDSCITAIKLDPLQYYFVSSATDGSIKIWQLS 687
           +LL+AG+  G + + DI        F+ H   +  +      Y+ VS A D ++K+W + 
Sbjct: 93  QLLVAGSEDGVVQLFDINGRAPLRQFEGHTKAVHTVDFTADNYHVVSGADDYTVKLWDIP 152

Query: 688 NQELFETFR--SEPIKASVFSK--GDLAVKG 768
           N +   TF+  S+ ++    SK   DL V G
Sbjct: 153 NSKEILTFKEHSDYVRCGCASKLNPDLFVTG 183
>sp|Q96DI7|WDR57_HUMAN WD-repeat protein 57 (Prp8-binding protein) (hPRP8BP) (U5
           snRNP-specific 40 kDa protein) (38 kDa splicing factor)
          Length = 357

 Score = 43.5 bits (101), Expect = 7e-04
 Identities = 20/95 (21%), Positives = 46/95 (48%)
 Frame = +1

Query: 400 ISMWDVLLPSGRNQVRNLSDPQLAIGVVALCYVSRYELLLAGTRKGEIAIIDIRQNKIQN 579
           + +WD+   +     +N         V+A+ +    + +++G    +I + D+RQNK+  
Sbjct: 176 VKLWDIRKKAAIQTFQNTYQ------VLAVTFNDTSDQIISGGIDNDIKVWDLRQNKLTY 229

Query: 580 IFQAHDSCITAIKLDPLQYYFVSSATDGSIKIWQL 684
             + H   +T + L     Y +S+A D ++++W +
Sbjct: 230 TMRGHADSVTGLSLSSEGSYLLSNAMDNTVRVWDV 264

 Score = 34.3 bits (77), Expect = 0.40
 Identities = 21/77 (27%), Positives = 35/77 (45%), Gaps = 2/77 (2%)
 Frame = +1

Query: 490 CYVSRY--ELLLAGTRKGEIAIIDIRQNKIQNIFQAHDSCITAIKLDPLQYYFVSSATDG 663
           CY +R   +L+  G+  G + + DIR+      FQ +   + A+  +      +S   D 
Sbjct: 157 CYPARRGPQLVCTGSDDGTVKLWDIRKKAAIQTFQ-NTYQVLAVTFNDTSDQIISGGIDN 215

Query: 664 SIKIWQLSNQELFETFR 714
            IK+W L   +L  T R
Sbjct: 216 DIKVWDLRQNKLTYTMR 232
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 86,242,654
Number of Sequences: 369166
Number of extensions: 1727180
Number of successful extensions: 5590
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4854
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5547
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 7405750800
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)