Planarian EST Database


Dr_sW_006_N16

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_006_N16
         (696 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P36124|SET3_YEAST  SET domain protein 3                         55   2e-07
sp|Q9BVI0|PHF20_HUMAN  PHD finger protein 20 (Hepatocellular...    51   3e-06
sp|Q8BLG0|PHF20_MOUSE  PHD finger protein 20 (Hepatocellular...    51   3e-06
sp|Q10362|YDBB_SCHPO  Hypothetical protein C22E12.11c in chr...    50   7e-06
sp|P42948|SET4_YEAST  SET domain protein 4                         45   2e-04
sp|Q9C810|Y1342_ARATH  PHD finger protein At1g33420                44   4e-04
sp|Q9Y7V2|YCHD_SCHPO  Protein C645.13 in chromosome III            42   0.001
sp|O44757|LIN59_CAEEL  Protein lin-59 (Abnormal cell lineage...    40   0.005
sp|Q680Q4|SIZ1_ARATH  Sumoylation ligase E3 (SUMO E3 ligase)...    38   0.029
sp|P36106|YKA5_YEAST  Hypothetical 67.9 kDa protein RPL14A-A...    37   0.038
>sp|P36124|SET3_YEAST SET domain protein 3
          Length = 751

 Score = 55.1 bits (131), Expect = 2e-07
 Identities = 21/56 (37%), Positives = 34/56 (60%), Gaps = 4/56 (7%)
 Frame = +1

Query: 151 MVKCPCGGNKDDGIMIMCEICKNWQHSICFAIIEEDLVPEVHICDTC----ADLNL 306
           ++ C C  N DDG  I C+ C  WQH+IC+ I +  + P+ ++C++C     D+NL
Sbjct: 117 IITCICDLNDDDGFTIQCDHCNRWQHAICYGIKDIGMAPDDYLCNSCDpreVDINL 172
>sp|Q9BVI0|PHF20_HUMAN PHD finger protein 20 (Hepatocellular carcinoma-associated antigen
           58) (Glioma-expressed antigen 2) (Transcription factor
           TZP) (Novel zinc finger protein)
          Length = 1012

 Score = 51.2 bits (121), Expect = 3e-06
 Identities = 20/49 (40%), Positives = 31/49 (63%)
 Frame = +1

Query: 151 MVKCPCGGNKDDGIMIMCEICKNWQHSICFAIIEEDLVPEVHICDTCAD 297
           +V+C C   +++  MI CE C+ WQH +C  ++EE+ VPE + C  C D
Sbjct: 652 VVRCICEVQEENDFMIQCEECQCWQHGVCMGLLEEN-VPEKYTCYVCQD 699
>sp|Q8BLG0|PHF20_MOUSE PHD finger protein 20 (Hepatocellular carcinoma-associated antigen
           58 homolog)
          Length = 1010

 Score = 51.2 bits (121), Expect = 3e-06
 Identities = 20/49 (40%), Positives = 31/49 (63%)
 Frame = +1

Query: 151 MVKCPCGGNKDDGIMIMCEICKNWQHSICFAIIEEDLVPEVHICDTCAD 297
           +V+C C   +++  MI CE C+ WQH +C  ++EE+ VPE + C  C D
Sbjct: 655 VVRCICEVQEENDFMIQCEECQCWQHGVCMGLLEEN-VPEKYTCYVCQD 702
>sp|Q10362|YDBB_SCHPO Hypothetical protein C22E12.11c in chromosome I
          Length = 859

 Score = 49.7 bits (117), Expect = 7e-06
 Identities = 24/76 (31%), Positives = 39/76 (51%), Gaps = 1/76 (1%)
 Frame = +1

Query: 154 VKCPCGGNKDDGIMIMCEICKNWQHSICFAIIEEDLVPEVHICDTCADLNLELVPT-DVK 330
           ++C C    DDG  I CE C+ WQH++C   I+ + VPE + C+ C    ++      ++
Sbjct: 4   IRCVCPFEDDDGFTIQCESCEVWQHAVC-VNIDANNVPEKYFCEQCQPRPIDADKAHKIQ 62

Query: 331 LKNVNVEDRQETCLIR 378
           L  +  E+ Q   L R
Sbjct: 63  LARLQREEEQSRILSR 78
>sp|P42948|SET4_YEAST SET domain protein 4
          Length = 560

 Score = 45.1 bits (105), Expect = 2e-04
 Identities = 21/82 (25%), Positives = 40/82 (48%), Gaps = 1/82 (1%)
 Frame = +1

Query: 49  KRSEADSAKETPSKSGNISKMISTGGNAAHEEEFMVKCPCGGNKD-DGIMIMCEICKNWQ 225
           K+++ D++K + + + +  +   T    A + +    C CG +   D + I C  CK WQ
Sbjct: 126 KQNQDDNSKVSVTHNESSKENKITPSMRAEDNKPKNGCICGSSDSKDELFIQCNKCKTWQ 185

Query: 226 HSICFAIIEEDLVPEVHICDTC 291
           H +C+A  + D +    +C  C
Sbjct: 186 HKLCYAFKKSDPIKRDFVCKRC 207
>sp|Q9C810|Y1342_ARATH PHD finger protein At1g33420
          Length = 697

 Score = 43.9 bits (102), Expect = 4e-04
 Identities = 19/54 (35%), Positives = 27/54 (50%), Gaps = 1/54 (1%)
 Frame = +1

Query: 142 EEFMVKCPCGGNKDDG-IMIMCEICKNWQHSICFAIIEEDLVPEVHICDTCADL 300
           + + V C CG   DDG  M+ C+ C  W H+ C  I   D +P   +C  C +L
Sbjct: 600 DNWKVDCKCGTKDDDGERMLACDGCGVWHHTRCIGINNADALPSKFLCFRCIEL 653
>sp|Q9Y7V2|YCHD_SCHPO Protein C645.13 in chromosome III
          Length = 721

 Score = 42.4 bits (98), Expect = 0.001
 Identities = 17/50 (34%), Positives = 29/50 (58%), Gaps = 1/50 (2%)
 Frame = +1

Query: 145 EFMVKCPCGGNKDDG-IMIMCEICKNWQHSICFAIIEEDLVPEVHICDTC 291
           E +V+C C   +D G   + C+ C  WQH+ C  + ++D +PE + C+ C
Sbjct: 18  ETVVRCVCKSQEDIGDTWVQCDGCDCWQHASCVGLADKD-IPESYYCEVC 66
>sp|O44757|LIN59_CAEEL Protein lin-59 (Abnormal cell lineage protein 59)
          Length = 1312

 Score = 40.4 bits (93), Expect = 0.005
 Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 12/99 (12%)
 Frame = +1

Query: 154  VKCPCGGNKDDGIMIMCEICKNWQHSIC--FAIIEEDLVPE---------VHICDTCADL 300
            V+C CG   ++G M+ C+ C  W H  C  + +   +   +          +ICD C + 
Sbjct: 968  VRCICGALDEEGTMVQCDTCHFWLHVDCCQYVVRSNEKAQKSKNPPSDDGEYICDFCTNK 1027

Query: 301  NLELVPT-DVKLKNVNVEDRQETCLIRRAIMYCRSFEVI 414
               L P+ DVKL     + R E C   R+++  R  +V+
Sbjct: 1028 QNGLRPSADVKLTE-QPDVRFENCDYYRSLINRRGIQVV 1065
>sp|Q680Q4|SIZ1_ARATH Sumoylation ligase E3 (SUMO E3 ligase) (AtSIZ1)
          Length = 884

 Score = 37.7 bits (86), Expect = 0.029
 Identities = 26/96 (27%), Positives = 41/96 (42%), Gaps = 12/96 (12%)
 Frame = +1

Query: 40  NNYKRSEADSAKETPSKSGNISKMISTGGNAAHEE-----EFMVKCPCGGNKDDGIMIMC 204
           + Y++ +   A +  SK G +S   S        E     E  V+C CG + +   MI C
Sbjct: 71  DTYRKMQVSGASDLASK-GQVSSDTSNLKVKGEPEDPFQPEIKVRCVCGNSLETDSMIQC 129

Query: 205 E--ICKNWQHSICFAIIEEDL-----VPEVHICDTC 291
           E   C  WQH  C  + ++ +     +PE   C+ C
Sbjct: 130 EDPRCHVWQHVGCVILPDKPMDGNPPLPESFYCEIC 165
>sp|P36106|YKA5_YEAST Hypothetical 67.9 kDa protein RPL14A-AUR1 intergenic region
          Length = 594

 Score = 37.4 bits (85), Expect = 0.038
 Identities = 28/107 (26%), Positives = 42/107 (39%), Gaps = 21/107 (19%)
 Frame = +1

Query: 52  RSEADSAKETPSKSGNISKMISTGGNAAHEE-----EFMVKCPCGGNKDD--------GI 192
           + + DSA E   KS +  +      N   +E     E  V+C CG N ++        G 
Sbjct: 34  KKKVDSAIEKNKKSDSSQEPRKDTENVRTDEVDEADEGYVRCLCGANNENYDAAEYSHGD 93

Query: 193 MIMCEICKNWQHSICF--------AIIEEDLVPEVHICDTCADLNLE 309
           M+ C+ C  WQH  C          ++ ED      +CD     +LE
Sbjct: 94  MVQCDGCDTWQHIKCMTDGKDTIDGLMSEDSKYYCELCDPSLYAHLE 140
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 73,342,468
Number of Sequences: 369166
Number of extensions: 1479832
Number of successful extensions: 3953
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3806
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3946
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 6024953540
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)