Planarian EST Database


Dr_sW_021_A01

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_021_A01
         (865 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q9V7P1|WDR50_DROME  Hypothetical WD-repeat protein l(2)k0...   105   1e-22
sp|Q5SSI6|WDR50_MOUSE  WD-repeat protein 50                       103   5e-22
sp|Q9Y5J1|WDR50_HUMAN  WD-repeat protein 50                       100   5e-21
sp|Q9FMU5|WDR50_ARATH  WD-repeat protein At5g14050                 98   3e-20
sp|P78750|UTP18_SCHPO  Probable U3 small nucleolar RNA-assoc...    83   9e-16
sp|P42000|WDR50_CAEEL  Hypothetical WD-repeat protein B0280....    74   7e-13
sp|P40362|UTP18_YEAST  U3 small nucleolar RNA-associated pro...    70   1e-11
sp|Q8BU03|PWP2_MOUSE  Periodic tryptophan protein 2 homolog        41   0.004
sp|Q5ABA6|ATG18_CANAL  Autophagy-related protein 18                41   0.004
sp|P73594|Y1409_SYNY3  Hypothetical WD-repeat protein slr1409      40   0.006
>sp|Q9V7P1|WDR50_DROME Hypothetical WD-repeat protein l(2)k07824
          Length = 506

 Score =  105 bits (263), Expect = 1e-22
 Identities = 69/213 (32%), Positives = 111/213 (52%), Gaps = 2/213 (0%)
 Frame = +3

Query: 96  IRRLEVSPAGDYMVFSSENSSIEVVSTQTFLVCHKLQVSGAVKSMAFSHNGKYLHVFEVQ 275
           + R EVSP G ++V + +  +I +++ +T  + H  +  G VK   +S + K + V    
Sbjct: 295 MHRFEVSPCGKFIVTAGKFGAIHLLTAKTNELLHSFKQEGKVKGFTWSSDSKRILVCGST 354

Query: 276 GQISVFEFKENIFSRPVFLRRWTDYGCVNAVKITISPDNKYVACGS-SGVVNIYKFTDTW 452
             +SV   ++N+    +F+    D GC++   I +SP+ + +A GS  GVVNIY +   +
Sbjct: 355 SNVSVLNLRQNLIEH-IFM----DDGCIHGESIQLSPNQRLLATGSQEGVVNIYDYESIF 409

Query: 453 QTKFPKPIKEIQYQEGRINQLLFNNSSKILAVGYCLTNDMNNGRFKLIHMDSLTVFSNFP 632
            +K P+P K        I  L FN+SS++LA+  C +   N   FKL H  S TV+SNFP
Sbjct: 410 ASKAPQPEKRFMNLRTAITDLQFNHSSELLAM--CSSEAPN--AFKLAHFPSATVYSNFP 465

Query: 633 GP-VGTSRPTSLSFSPNDKYLAVGRFNGKVTTF 728
                    TS++FSP+  +LA      +V  F
Sbjct: 466 AQNEKVGFVTSMAFSPHSSFLAFATKGKQVPLF 498
>sp|Q5SSI6|WDR50_MOUSE WD-repeat protein 50
          Length = 552

 Score =  103 bits (258), Expect = 5e-22
 Identities = 69/218 (31%), Positives = 113/218 (51%), Gaps = 3/218 (1%)
 Frame = +3

Query: 84  KTSNIRRLEVSPAGDYMVFSSENSSIEVVSTQTFLVCHKLQVSGAVKSMAFSHNGKYLHV 263
           K   +++ EVSP G +++ S       ++S +T  +   ++++G + +  FS + K ++ 
Sbjct: 336 KEKTVKQFEVSPDGSFLLISGIAGFSHLLSMKTKELIGSMKINGRIAASTFSSDSKRIYT 395

Query: 264 FEVQGQISVFEFKENIFSRPVFLRRWTDYGCVNAVKITISPDNKYVACGS-SGVVNIYKF 440
           +   G++ V++    + SR   + R+ D G +  + I  S + +YVACGS SGVVNIY  
Sbjct: 396 YSENGEVYVWD----VNSRKC-MNRFLDEGSLCGLSIAASKNGQYVACGSKSGVVNIYNQ 450

Query: 441 TDTWQTKFPKPIKEIQYQEGRINQLLFNNSSKILAVGYCLTNDMNNGRFKLIHMDSLTVF 620
               Q   PKPIK I      +  L FN +++ILAV    +  M     +L+H+ S TVF
Sbjct: 451 DSCLQQTNPKPIKAIMNLVTGVTSLAFNPTTEILAVA---SRKMKEA-VRLVHLPSCTVF 506

Query: 621 SNFP--GPVGTSRPTSLSFSPNDKYLAVGRFNGKVTTF 728
           SNFP       SR  ++ FSP   Y A+G   G+   +
Sbjct: 507 SNFPVFKKSTLSRVQTMDFSPRGGYFALGNEKGRALMY 544
>sp|Q9Y5J1|WDR50_HUMAN WD-repeat protein 50
          Length = 556

 Score =  100 bits (249), Expect = 5e-21
 Identities = 67/218 (30%), Positives = 111/218 (50%), Gaps = 3/218 (1%)
 Frame = +3

Query: 84  KTSNIRRLEVSPAGDYMVFSSENSSIEVVSTQTFLVCHKLQVSGAVKSMAFSHNGKYLHV 263
           K   +R  EVSP G +++ +     + +++ +T  +   ++++G V +  FS + K ++ 
Sbjct: 340 KEKIVRSFEVSPDGSFLLINGIAGYLHLLAMKTKELIGSMKINGRVAASTFSSDSKKVYA 399

Query: 264 FEVQGQISVFEFKENIFSRPVFLRRWTDYGCVNAVKITISPDNKYVACGSS-GVVNIYKF 440
               G++ V++    + SR   L R+ D G +  + I  S + +YVACGS+ GVVNIY  
Sbjct: 400 SSGDGEVYVWD----VNSRKC-LNRFVDEGSLYGLSIATSRNGQYVACGSNCGVVNIYNQ 454

Query: 441 TDTWQTKFPKPIKEIQYQEGRINQLLFNNSSKILAVGYCLTNDMNNGRFKLIHMDSLTVF 620
               Q   PKPIK I      +  L FN +++ILA+     ++      +L+H+ S TVF
Sbjct: 455 DSCLQETNPKPIKAIMNLVTGVTSLTFNPTTEILAI----ASEKMKEAVRLVHLPSCTVF 510

Query: 621 SNFP--GPVGTSRPTSLSFSPNDKYLAVGRFNGKVTTF 728
           SNFP       S   ++ FSP   Y A+G   GK   +
Sbjct: 511 SNFPVIKNKNISHVHTMDFSPRSGYFALGNEKGKALMY 548
>sp|Q9FMU5|WDR50_ARATH WD-repeat protein At5g14050
          Length = 546

 Score = 98.2 bits (243), Expect = 3e-20
 Identities = 70/214 (32%), Positives = 112/214 (52%), Gaps = 2/214 (0%)
 Frame = +3

Query: 93  NIRRLEVSPAGDYMVFSSENSSIEVVSTQTFLVCHKLQVSGAVKSMAFSHNGKYLHVFEV 272
           ++   EVS   + + F      I +VST+T  +   L+++G+V+S+AFS +GK+L     
Sbjct: 335 SLEYFEVSQDSNTIAFVGNEGYILLVSTKTKELIGTLKMNGSVRSLAFSEDGKHLLSSGG 394

Query: 273 QGQISVFEFKENIFSRPVFLRRWTDYGCVNAVKITISPDNKYVACGSS-GVVNIYKFTDT 449
            GQ+ V++ +         L +  D G      +  S +    A G+  G+VNIYK ++ 
Sbjct: 395 DGQVYVWDLR-----TMKCLYKGVDEGSTCGTSLCSSLNGALFASGTDRGIVNIYKKSEF 449

Query: 450 WQTKFPKPIKEIQYQEGRINQLLFNNSSKILAVGYCLTNDMNNGRFKLIHMDSLTVFSNF 629
              K  KPIK +     +I+ + FN+ ++ILA+     + MN    KL+H+ SLTVFSN+
Sbjct: 450 VGGK-RKPIKTVDNLTSKIDFMKFNHDAQILAI----VSTMNKNSVKLVHVPSLTVFSNW 504

Query: 630 PGPVGTSR-PTSLSFSPNDKYLAVGRFNGKVTTF 728
           P P  T   P  L FSP   ++A+G   GKV  +
Sbjct: 505 PPPNSTMHYPRCLDFSPGSGFMAMGNAAGKVLLY 538
>sp|P78750|UTP18_SCHPO Probable U3 small nucleolar RNA-associated protein 18 (U3
           snoRNA-associated protein 18)
          Length = 556

 Score = 83.2 bits (204), Expect = 9e-16
 Identities = 65/250 (26%), Positives = 111/250 (44%), Gaps = 11/250 (4%)
 Frame = +3

Query: 3   RLVIFTNRSYYKVWHITNGNITDHNISKT-------SNIRRLEVSPAGDYMVFSSENSSI 161
           R++    R Y  +W + +  +    +S+         ++ R  V P G Y+     +  I
Sbjct: 275 RVIAAGRRKYMYIWDLESAQV--QKVSRMYGQENFQPSMERFHVDPTGKYIALEGRSGHI 332

Query: 162 EVVSTQTFLVCHKLQVSGAVKSMAFSHNGKYLHVFEVQGQISVFEFKENIFSRPVFLRRW 341
            ++   T       ++ G +  + F+ +G  + V     ++  F    N+  R V +RRW
Sbjct: 333 NLLHALTGQFATSFKIEGVLSDVLFTSDGSEMLVLSYGAEVWHF----NVEQRSV-VRRW 387

Query: 342 TDYGCVNAVKITISPDNKYVACGS-SGVVNIYKFTDTWQTKFPKPIKEIQYQEGRINQLL 518
                V+     + P NKY+A GS SG+VNIY    +     PKP+  +      IN + 
Sbjct: 388 QVQDGVSTTHFCLDPSNKYLAIGSKSGIVNIYDLQTSNADAAPKPVTTLDNITFSINSMS 447

Query: 519 FNNSSKILAVGYCLTNDMNNGRFKLIHMDSLTVFSNFP---GPVGTSRPTSLSFSPNDKY 689
           F+  S++LA+      D      +L+H+ S +VF N+P    P+G  R T L+F    + 
Sbjct: 448 FSQDSQVLAIASRGKKD----TLRLVHVPSFSVFRNWPTSATPLG--RVTCLAFGKGGE- 500

Query: 690 LAVGRFNGKV 719
           L VG   G+V
Sbjct: 501 LCVGNEAGRV 510
>sp|P42000|WDR50_CAEEL Hypothetical WD-repeat protein B0280.9 in chromosome III
          Length = 429

 Score = 73.6 bits (179), Expect = 7e-13
 Identities = 54/222 (24%), Positives = 100/222 (45%), Gaps = 4/222 (1%)
 Frame = +3

Query: 75  NISKTSNIRRLEVSPAGDYMVFSSENSSIEVVSTQTFLVCHKLQVSGAVKSMAF--SHNG 248
           N      IR   +S    ++  +  NS I V+   +      + +      + F  SH+ 
Sbjct: 207 NTVPKQGIRLFAISHDSQFLAIAGHNSHIYVLHATSMEHITTISLPANASEIKFFPSHSR 266

Query: 249 KYLHVFEVQGQISVFEFKENIFSRPVFLRRWTDYGCVNAVKITISPDNKYVACGS-SGVV 425
           +   + E  GQI +      +         +TD G V+   + IS    Y A GS +G+V
Sbjct: 267 EIWIICET-GQIVIANI--GLPGTKSSQHTFTDDGAVHGTTLAISQHGDYFATGSDTGIV 323

Query: 426 NIYKFTDTWQTKFPKPIKEIQYQEGRINQLLFNNSSKILAVGYCLTNDMNNGRFKLIHMD 605
           N+Y   D   +  P+P+  +      ++ + FN+ ++++A+     +++ +   +L+H+ 
Sbjct: 324 NVYSGNDCRNSTNPRPLFNVSNLVTAVSSIAFNSDAQLMAI----CSNVKDNHLRLVHVA 379

Query: 606 SLTVFSNFPGPVG-TSRPTSLSFSPNDKYLAVGRFNGKVTTF 728
           S T F NFP   G  +    + FSPN  Y+AVG  +G++  F
Sbjct: 380 SQTTFKNFPERNGKVTHARCVEFSPNGGYMAVGNDDGRLHVF 421
>sp|P40362|UTP18_YEAST U3 small nucleolar RNA-associated protein 18 (U3 snoRNA-associated
            protein 18)
          Length = 594

 Score = 69.7 bits (169), Expect = 1e-11
 Identities = 58/238 (24%), Positives = 104/238 (43%), Gaps = 26/238 (10%)
 Frame = +3

Query: 84   KTSNIRRLEVSPAGDYMVFSSENSSIEVVSTQTFLVCHKLQVSGAVKSMAFSHN----GK 251
            K ++++  + +     ++    N  I ++ + + L     ++ G +      +     GK
Sbjct: 359  KVAHLQNSQTNSVHGIVLLQGNNGWINILHSTSGLWLMGCKIEGVITDFCIDYQPISRGK 418

Query: 252  Y---LHVFEVQGQISVFEFKENIFSRPVFLRRWTDYGCVNAVKITIS------------- 383
            +   L      G++  F+  +N       +RRW D G V   KI +              
Sbjct: 419  FRTILIAVNAYGEVWEFDLNKNGH----VIRRWKDQGGVGITKIQVGGGTTTTCPALQIS 474

Query: 384  --PDNKYVACGS-SGVVNIYKFTDTWQTKFPKPIKEIQYQEGRINQLLFNNSSKILAVGY 554
                N+++A GS SG VN+Y   +   +  P P+  +      I+ L F+   +IL    
Sbjct: 475  KIKQNRWLAVGSESGFVNLYDRNNAMTSSTPTPVAALDQLTTTISNLQFSPDGQIL---- 530

Query: 555  CLTNDMNNGRFKLIHMDSLTVFSNFPG---PVGTSRPTSLSFSPNDKYLAVGRFNGKV 719
            C+ +       +L+H+ S +VFSN+P    P+G  + TS++FSP+   LAVG   GKV
Sbjct: 531  CMASRAVKDALRLVHLPSCSVFSNWPTSGTPLG--KVTSVAFSPSGGLLAVGNEQGKV 586
>sp|Q8BU03|PWP2_MOUSE Periodic tryptophan protein 2 homolog
          Length = 919

 Score = 41.2 bits (95), Expect = 0.004
 Identities = 46/212 (21%), Positives = 84/212 (39%), Gaps = 6/212 (2%)
 Frame = +3

Query: 105 LEVSPAGDYMVFSSENSSIEVVSTQT-FLVCHKLQVSGAVKSMAFSHNGKYLHVFEVQGQ 281
           L  SP G Y+V   ++  ++V +T + F      + S  V  + F+  G  +    + G 
Sbjct: 379 LAYSPDGQYIVTGGDDGKVKVWNTLSGFCFVTLTEHSSGVTGVTFTTTGHVIVTSSLDGT 438

Query: 282 ISVFEFKENIFSRPVFLRRWTDYGCVNAVKITISPDNKYVACGSSGVVNIYKFTDTWQTK 461
           +  F+       R     R T + CV      +    + V+ G+     I+     W  +
Sbjct: 439 VRAFDLHRYRNFRTFTSPRPTQFSCV-----AVDSSGEIVSAGAQDSFEIF----VWSMQ 489

Query: 462 FPKPIKEIQYQEGRINQLLFNNSSKILAVGYC-----LTNDMNNGRFKLIHMDSLTVFSN 626
             + +  +   EG ++ L FN    ILA         L +  ++ R K    ++LT+   
Sbjct: 490 TGRLLDVLSGHEGPVSGLCFNPMKSILASASWDKTVRLWDMFDSWRTK----ETLTL--- 542

Query: 627 FPGPVGTSRPTSLSFSPNDKYLAVGRFNGKVT 722
                 TS   +++F P+   LAV   N ++T
Sbjct: 543 ------TSDALAVTFRPDGAELAVATLNSQIT 568

 Score = 30.4 bits (67), Expect = 6.7
 Identities = 18/71 (25%), Positives = 33/71 (46%)
 Frame = +3

Query: 39  VWHITNGNITDHNISKTSNIRRLEVSPAGDYMVFSSENSSIEVVSTQTFLVCHKLQVSGA 218
           V+ + N       ++   NI+ + +SP G   +   E  +  +VS     V H     G+
Sbjct: 39  VFDLKNNRSNTLPLATKYNIKCVGLSPDGRLAIIVDEGGAALLVSLVCRSVLHHFHFKGS 98

Query: 219 VKSMAFSHNGK 251
           V S++FS +G+
Sbjct: 99  VHSVSFSPDGR 109
>sp|Q5ABA6|ATG18_CANAL Autophagy-related protein 18
          Length = 558

 Score = 41.2 bits (95), Expect = 0.004
 Identities = 44/155 (28%), Positives = 71/155 (45%), Gaps = 15/155 (9%)
 Frame = +3

Query: 51  TNG--NITDHNISKTSNIRRLEVSPAGDYMVFSSENSSIEVVSTQTFLVCHKLQVSGAVK 224
           TNG  N T +NIS  SN      +  GD ++F+   +S++ +S    +  HK      + 
Sbjct: 207 TNGGSNSTQNNISSVSNTP----NRVGDVIIFNL--TSLQPISV---IEAHK----STIA 253

Query: 225 SMAFSHNGKYL----------HVFEVQGQISVFEFKENIFSRPVFLRRWTDYGCVNAVKI 374
           SMAFS+NG YL           +FEV     +++F+   +   ++  R+           
Sbjct: 254 SMAFSNNGLYLATASDKGTIVRIFEVATGTKLYQFRRGTYPTKIYSLRF----------- 302

Query: 375 TISPDNKYV-ACGSSGVVNIYKF--TDTWQTKFPK 470
             S D+KYV A  SS  V+I++    +  +TK  K
Sbjct: 303 --SADDKYVLATSSSLTVHIFRLGEEEALETKHKK 335
>sp|P73594|Y1409_SYNY3 Hypothetical WD-repeat protein slr1409
          Length = 326

 Score = 40.4 bits (93), Expect = 0.006
 Identities = 21/86 (24%), Positives = 43/86 (50%)
 Frame = +3

Query: 36  KVWHITNGNITDHNISKTSNIRRLEVSPAGDYMVFSSENSSIEVVSTQTFLVCHKLQVSG 215
           K+W++    I ++ +  T  +   +  P G+++  +S++ +I        L+     V+ 
Sbjct: 237 KLWNLAGELIHEYKVVPTGWVNSAQFYPKGEWLATASDDGTIRFWQKDGQLIYELPLVNA 296

Query: 216 AVKSMAFSHNGKYLHVFEVQGQISVF 293
            + S++FS +GK L     QGQ+ VF
Sbjct: 297 RLTSLSFSPDGKQLAATSSQGQVWVF 322

 Score = 32.3 bits (72), Expect = 1.8
 Identities = 21/68 (30%), Positives = 32/68 (47%)
 Frame = +3

Query: 525 NSSKILAVGYCLTNDMNNGRFKLIHMDSLTVFSNFPGPVGTSRPTSLSFSPNDKYLAVGR 704
           NS++    G  L    ++G  +    D   ++     P+  +R TSLSFSP+ K LA   
Sbjct: 258 NSAQFYPKGEWLATASDDGTIRFWQKDGQLIYEL---PLVNARLTSLSFSPDGKQLAATS 314

Query: 705 FNGKVTTF 728
             G+V  F
Sbjct: 315 SQGQVWVF 322
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 94,244,403
Number of Sequences: 369166
Number of extensions: 1855640
Number of successful extensions: 5256
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4939
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5220
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 8582957970
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)