Planarian EST Database


Dr_sW_003_D09

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_003_D09
         (628 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P20585|MSH3_HUMAN  DNA mismatch repair protein Msh3 (Dive...    34   0.27 
sp|P20641|VN02_VACCC  Protein N2                                   32   1.7  
sp|P14357|VN02_VACCV  Protein N2                                   31   3.0  
sp|Q9UBT6|POLK_HUMAN  DNA polymerase kappa (DINB protein) (D...    30   3.9  
sp|Q5HU00|ISPE_CAMJR  4-diphosphocytidyl-2-C-methyl-D-erythr...    30   5.0  
sp|O76014|K1H7_HUMAN  Keratin, type I cuticular Ha7 (Hair ke...    30   5.0  
sp|Q9US08|SPO3_SCHPO  Sporulation-specific protein 3               30   5.0  
sp|P38677|CMLE_NEUCR  Carboxy-cis,cis-muconate cyclase (3-ca...    30   6.6  
sp|P34019|VN02_VARV  Protein N2                                    30   6.6  
sp|P38346|YB9O_YEAST  Hypothetical 61.3 kDa protein in MRPL3...    30   6.6  
>sp|P20585|MSH3_HUMAN DNA mismatch repair protein Msh3 (Divergent upstream protein) (DUP)
           (Mismatch repair protein 1) (MRP1)
          Length = 1137

 Score = 34.3 bits (77), Expect = 0.27
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 3/100 (3%)
 Frame = +3

Query: 267 QKFLMCVSL--KLKTELHYLIHD-NASIHNDVLTTTECHKILKLPPYSPFLNPIENYFGT 437
           Q+F + V     LK+E   +I   N+ I +D+L T     IL++P     L+P+E+Y   
Sbjct: 636 QEFFLIVKTLYHLKSEFQAIIPAVNSHIQSDLLRTV----ILEIPE---LLSPVEHYLKI 688

Query: 438 IKSKLRKKLTNFDFGSNQSKFGTIKNTVDEIMAVEKEINL 557
           +  +  K     +   + S F  IK   DEI  V  EI +
Sbjct: 689 LNEQAAKVGDKTELFKDLSDFPLIKKRKDEIQGVIDEIRM 728
>sp|P20641|VN02_VACCC Protein N2
          Length = 175

 Score = 31.6 bits (70), Expect = 1.7
 Identities = 28/132 (21%), Positives = 52/132 (39%), Gaps = 7/132 (5%)
 Frame = +3

Query: 246 SVTQSIYQKFLMCVSLKLKTELHYLIHDNASIHNDVLTTTECHKILKLPPYSPF-LNPIE 422
           ++   I +   MC+     + +  +++   +++ D L  T+CH I +   Y    ++   
Sbjct: 32  NIMDCINRHINMCIQRTYSSSIIAILNRFLTMNKDELNNTQCHIIKEFMTYEQMAIDHYG 91

Query: 423 NYFGTIKSKLRKKLTNFDFGSNQSKFGTIKNT------VDEIMAVEKEINLSKYFHHIKK 584
            Y   I  ++RK+            F  IK T      VD +  V+K I      +  K 
Sbjct: 92  EYVNAILYQIRKRPNQH---HTIDLFKKIKRTPYDTFKVDPVEFVKKVIGFVSILNKYKP 148

Query: 585 FYPDCLYMNNIF 620
            Y   LY N ++
Sbjct: 149 VYSYVLYENVLY 160
>sp|P14357|VN02_VACCV Protein N2
          Length = 175

 Score = 30.8 bits (68), Expect = 3.0
 Identities = 28/131 (21%), Positives = 51/131 (38%), Gaps = 6/131 (4%)
 Frame = +3

Query: 246 SVTQSIYQKFLMCVSLKLKTELHYLIHDNASIHNDVLTTTECHKILKLPPYSPFLNPIEN 425
           ++   I +   MC+     + +  ++     ++ D L  T+CH I +   Y      I++
Sbjct: 32  NIMDCINRHINMCIQRTYSSSIIAILDRFLMMNKDELNNTQCHIIKEFMTYEQM--AIDH 89

Query: 426 YFGTIKSKLRKKLTNFDFGSNQSKFGTIKNT------VDEIMAVEKEINLSKYFHHIKKF 587
           Y G + + L +     +       F  IK T      VD +  V+K I      +  K  
Sbjct: 90  YGGYVNAILYQIRKRPNQHHTIDLFKRIKRTRYDTFKVDPVEFVKKVIGFVSILNKYKPV 149

Query: 588 YPDCLYMNNIF 620
           Y   LY N ++
Sbjct: 150 YSYVLYENVLY 160
>sp|Q9UBT6|POLK_HUMAN DNA polymerase kappa (DINB protein) (DINP)
          Length = 870

 Score = 30.4 bits (67), Expect = 3.9
 Identities = 31/155 (20%), Positives = 57/155 (36%), Gaps = 31/155 (20%)
 Frame = +3

Query: 180 PNISIIMAIRKNGPIHYKVLQGSVTQS------------IYQKFLMCVSLKLKTELHYLI 323
           PN   +M   K+ PI      G VT+             +YQ+  +   L  +T  HY +
Sbjct: 337 PNRQAVMDFIKDLPIRKVSGIGKVTEKMLKALGIITCTELYQQRALLSLLFSETSWHYFL 396

Query: 324 HDNASIHNDVLTTTECHKILKLPPYSPFLNPIENYFGTIKS------------------- 446
           H +  + +  LT     K + +      +N  E  +   +                    
Sbjct: 397 HISLGLGSTHLTRDGERKSMSVERTFSEINKAEEQYSLCQELCSELAQDLQKERLKGRTV 456

Query: 447 KLRKKLTNFDFGSNQSKFGTIKNTVDEIMAVEKEI 551
            ++ K  NF+  +  S   ++ +T +EI A+ KE+
Sbjct: 457 TIKLKNVNFEVKTRASTVSSVVSTAEEIFAIAKEL 491
>sp|Q5HU00|ISPE_CAMJR 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase (CMK)
           (4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol
           kinase)
 sp|Q9PNJ0|ISPE_CAMJE 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase (CMK)
           (4-(cytidine-5'-diphospho)-2-C-methyl-D-erythritol
           kinase)
          Length = 255

 Score = 30.0 bits (66), Expect = 5.0
 Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 3/52 (5%)
 Frame = +3

Query: 441 KSKLRKKLTNFD---FGSNQSKFGTIKNTVDEIMAVEKEINLSKYFHHIKKF 587
           K+ +  KLT FD   +   +S+F  +K+  DE+  V+KE +  K F  I  F
Sbjct: 6   KANIFLKLTGFDSRKYHLLESRFILLKDVFDELELVDKESDSKKEFEIISNF 57
>sp|O76014|K1H7_HUMAN Keratin, type I cuticular Ha7 (Hair keratin, type I Ha7)
          Length = 449

 Score = 30.0 bits (66), Expect = 5.0
 Identities = 25/93 (26%), Positives = 38/93 (40%)
 Frame = +3

Query: 144 LPGYLQVPSQRGPNISIIMAIRKNGPIHYKVLQGSVTQSIYQKFLMCVSLKLKTELHYLI 323
           LPG   +P     NI I  A  KN       L G   +++  KFL         ++  L 
Sbjct: 79  LPGTCHIPG----NIGICGAYGKN------TLNGHEKETM--KFLNDRLANYLEKVRQLE 126

Query: 324 HDNASIHNDVLTTTECHKILKLPPYSPFLNPIE 422
            +NA +   +L  ++CH+    P Y  +   IE
Sbjct: 127 QENAELETTLLERSKCHESTVCPDYQSYFRTIE 159
>sp|Q9US08|SPO3_SCHPO Sporulation-specific protein 3
          Length = 1028

 Score = 30.0 bits (66), Expect = 5.0
 Identities = 36/155 (23%), Positives = 68/155 (43%), Gaps = 12/155 (7%)
 Frame = +3

Query: 183  NISIIMAIRKNGPIHYKVLQGSVTQSIYQKFLMCVSLKLKTELHYLIHDNASIHNDVLTT 362
            NI I   ++++G I Y+V    + + IY   L   + +    LH+    + +   + L  
Sbjct: 699  NIFISKPLKRSGSI-YEVFSDGIGRVIYGSVLDYEATRNNILLHFGFEIHCAPSEEELIE 757

Query: 363  TE-----CHKILKLPPYSPFLNPIENYFGTI------KSKLRKKLTNFDFGSNQSKFGTI 509
             E      H +L    YS   N I  + GT       K+++ K++  +    N+    +I
Sbjct: 758  REEAFKDFHNMLSFK-YSA--NEIYEFCGTHSRAEVHKNEVLKRMAYYLIDENKEI--SI 812

Query: 510  KNTVDEIMAVEKEINLSKYFHHI-KKFYPDCLYMN 611
               +  +  VE   + ++Y + + + FYPD LY+N
Sbjct: 813  LRRILSLRIVEPTTSFTRYEYDLWRTFYPDVLYLN 847
>sp|P38677|CMLE_NEUCR Carboxy-cis,cis-muconate cyclase (3-carboxy-cis,cis-muconate
           lactonizing enzyme) (CMLE)
          Length = 366

 Score = 29.6 bits (65), Expect = 6.6
 Identities = 23/88 (26%), Positives = 39/88 (44%), Gaps = 9/88 (10%)
 Frame = +3

Query: 345 NDVLTTTECHKIL--KLPPYSPFLNPIENY--FGTIKS-----KLRKKLTNFDFGSNQSK 497
           ND  T T    +L  K PPY+ + NP   +  +G + S     KL K + N+++  N   
Sbjct: 89  NDADTNTRAIFLLAAKQPPYAVYANPFYKFAGYGNVFSVSETGKLEKNVQNYEYQENTGI 148

Query: 498 FGTIKNTVDEIMAVEKEINLSKYFHHIK 581
            G + +   E      ++  +K + H K
Sbjct: 149 HGMVFDPT-ETYLYSADLTANKLWTHRK 175
>sp|P34019|VN02_VARV Protein N2
          Length = 177

 Score = 29.6 bits (65), Expect = 6.6
 Identities = 28/131 (21%), Positives = 51/131 (38%), Gaps = 6/131 (4%)
 Frame = +3

Query: 246 SVTQSIYQKFLMCVSLKLKTELHYLIHDNASIHNDVLTTTECHKILKLPPYSPFLNPIEN 425
           ++   I +   MC+     + +  ++     ++ D L  T+CH I +   Y      I++
Sbjct: 32  NIINCINRHINMCLQHTYSSSIIAILDRFLMMNKDELNNTQCHIIKEFMTYEQM--AIDH 89

Query: 426 YFGTIKSKLRKKLTNFDFGSNQSKFGTIKNT------VDEIMAVEKEINLSKYFHHIKKF 587
           Y G + + L +     +       F  IK T      VD +  V+K I      +  K  
Sbjct: 90  YGGYVNAILYQIRKRPNQHHTIDLFKKIKRTRYDTFKVDPVEFVKKVIGFVSILNKYKPV 149

Query: 588 YPDCLYMNNIF 620
           Y   LY N ++
Sbjct: 150 YSYVLYENVLY 160
>sp|P38346|YB9O_YEAST Hypothetical 61.3 kDa protein in MRPL37-RIF1 intergenic region
          Length = 545

 Score = 29.6 bits (65), Expect = 6.6
 Identities = 22/83 (26%), Positives = 34/83 (40%), Gaps = 3/83 (3%)
 Frame = +3

Query: 144 LPGYLQVPSQRGPNISIIMAIRKNGPIHYKVLQGSVTQSIYQKFLMCVSLKLKTELHYLI 323
           L GYL  PS+  P+I +++          K  + S+    Y+ FL       K+   Y+ 
Sbjct: 448 LDGYLSEPSRYAPSIDVLL---------LKCFRDSIILPYYESFLHTNDGASKSFQRYIF 498

Query: 324 HD---NASIHNDVLTTTECHKIL 383
            +   N     D LT  +C  IL
Sbjct: 499 SEEEQNGVTEEDKLTLLQCFGIL 521
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 70,901,137
Number of Sequences: 369166
Number of extensions: 1404461
Number of successful extensions: 3811
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3700
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3811
length of database: 68,354,980
effective HSP length: 106
effective length of database: 48,773,070
effective search space used: 4974853140
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)