Planarian EST Database


Dr_sW_022_O20

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_022_O20
         (563 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q9CR26|CF055_MOUSE  Protein C6orf55 homolog                    111   1e-24
sp|Q9NP79|CF055_HUMAN  Protein C6orf55 (Dopamine-responsive ...   110   2e-24
sp|Q06263|VTA1_YEAST  Vacuolar protein sorting-associated pr...    42   8e-04
sp|Q9BRQ0|PYGO2_HUMAN  Pygopus homolog 2                           41   0.002
sp|P10163|PR4S_HUMAN  Basic salivary proline-rich protein 4 ...    39   0.012
sp|Q50634|SECD_MYCTU  Protein-export membrane protein secD         39   0.012
sp|Q04118|PRB3_HUMAN  Basic salivary proline-rich protein 3 ...    37   0.026
sp|P30595|CHS2_RHIOL  Chitin synthase 2 (Chitin-UDP acetyl-g...    37   0.026
sp|P10162|PRB4L_HUMAN  Basic salivary proline-rich protein 4...    37   0.044
sp|Q6PB44|PTN23_MOUSE  Tyrosine-protein phosphatase, non-rec...    37   0.044
>sp|Q9CR26|CF055_MOUSE Protein C6orf55 homolog
          Length = 309

 Score =  111 bits (278), Expect = 1e-24
 Identities = 71/190 (37%), Positives = 103/190 (54%), Gaps = 26/190 (13%)
 Frame = +2

Query: 2   SGFLMDVLSVFGEVGEDIEKCRKYAKWKAVYINQCLKKGEIPHSGPI----DNDQEISDF 169
           +  L+DV++VFGE+ ++  K RKYA+WKA YI+ CLK GE P +GP+    +ND E ++ 
Sbjct: 121 ASLLIDVITVFGELTDENVKHRKYARWKATYIHNCLKNGETPQAGPVGIEEENDVEENE- 179

Query: 170 NFPTVTQPTRPPQGSSNV---PSSTQPNPKPRTQKPVGG----------------VDNES 292
           +    + PT+PPQ SS+    PS+  P      Q P G                   N  
Sbjct: 180 DVGATSLPTQPPQPSSSSAYDPSNLAPGSYSGIQIPPGAHAPANTPAEVPHSTGVTSNAV 239

Query: 293 KPINPTIASSYHPEPPPAAANS---HLTAEDFAHAEKLCKYAASALQYQDSPTALEYLTK 463
           +P   T+ ++   +P    A+     LT EDFA A+K CKYA SALQY+D  TA++ L K
Sbjct: 240 QPSPQTVPAAPAVDPDLYTASQGDIRLTPEDFARAQKYCKYAGSALQYEDVGTAVQNLQK 299

Query: 464 CVDLLKYGKK 493
            + LL  G++
Sbjct: 300 ALRLLTTGRE 309
>sp|Q9NP79|CF055_HUMAN Protein C6orf55 (Dopamine-responsive protein DRG-1)
          Length = 307

 Score =  110 bits (276), Expect = 2e-24
 Identities = 75/192 (39%), Positives = 100/192 (52%), Gaps = 28/192 (14%)
 Frame = +2

Query: 2   SGFLMDVLSVFGEVGEDIEKCRKYAKWKAVYINQCLKKGEIPHSGPI----DND-QEISD 166
           +  L+DV++VFGE+ ++  K RKYA+WKA YI+ CLK GE P +GP+    DND +E  D
Sbjct: 121 ASLLIDVITVFGELTDENVKHRKYARWKATYIHNCLKNGETPQAGPVGIEEDNDIEENED 180

Query: 167 FNFPTV-TQPTRPPQGS----SNVPSS------------TQPNPKPRTQKPVGGVDNESK 295
               ++ TQPT+P   S    SN+PS                N         G   N  +
Sbjct: 181 AGAASLPTQPTQPSSSSTYDPSNMPSGNYTGIQIPPGAHAPANTPAEVPHSTGVASNTIQ 240

Query: 296 PINPTIASSYHPEPPPAAANS------HLTAEDFAHAEKLCKYAASALQYQDSPTALEYL 457
           P   TI     P   PA  N+       LT EDFA A+K CKYA SALQY+D  TA++ L
Sbjct: 241 PTPQTI-----PAIDPALFNTISQGDVRLTPEDFARAQKYCKYAGSALQYEDVSTAVQNL 295

Query: 458 TKCVDLLKYGKK 493
            K + LL  G++
Sbjct: 296 QKALKLLTTGRE 307
>sp|Q06263|VTA1_YEAST Vacuolar protein sorting-associated protein VTA1 (VPS20-associated
           protein 1)
          Length = 330

 Score = 42.4 bits (98), Expect = 8e-04
 Identities = 24/60 (40%), Positives = 31/60 (51%), Gaps = 10/60 (16%)
 Frame = +2

Query: 329 PEPPPAAANSHLTAEDFA----------HAEKLCKYAASALQYQDSPTALEYLTKCVDLL 478
           P  P AA +   T ++              +KL KYA SAL Y+D PTA + LTK +DLL
Sbjct: 268 PSEPAAAEHKSYTKDELTKIMDRASKIEQIQKLAKYAISALNYEDLPTAKDELTKALDLL 327
>sp|Q9BRQ0|PYGO2_HUMAN Pygopus homolog 2
          Length = 406

 Score = 40.8 bits (94), Expect = 0.002
 Identities = 26/81 (32%), Positives = 38/81 (46%), Gaps = 5/81 (6%)
 Frame = +2

Query: 134 GPIDNDQEISDFNFPTVTQPT-RPPQGSSNVPSSTQPNPKPRTQKPVGGVDNESKPINPT 310
           GP    Q  +    P    P  RP QG  ++P +T P P P    P  G ++  KP+NP 
Sbjct: 207 GPPSLSQRFAQPGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGEDGGKPLNPP 266

Query: 311 IASSYHPEP----PPAAANSH 361
            ++++  EP    P AA N +
Sbjct: 267 ASTAFPQEPHSGSPAAAVNGN 287
>sp|P10163|PR4S_HUMAN Basic salivary proline-rich protein 4 allele S precursor (Salivary
           proline-rich protein Po) (Parotid o protein) [Contains:
           Protein N1; Glycosylated protein A]
          Length = 247

 Score = 38.5 bits (88), Expect = 0.012
 Identities = 19/52 (36%), Positives = 23/52 (44%)
 Frame = +2

Query: 188 QPTRPPQGSSNVPSSTQPNPKPRTQKPVGGVDNESKPINPTIASSYHPEPPP 343
           +P  PPQ   N P    P  KP+   P GG  N  +P +P       P PPP
Sbjct: 182 KPQGPPQQEGNKPQGPPPPGKPQGPPPAGG--NPQQPQDPPAGKPQGPPPPP 231

 Score = 32.0 bits (71), Expect = 1.1
 Identities = 28/92 (30%), Positives = 34/92 (36%), Gaps = 6/92 (6%)
 Frame = +2

Query: 113 KGEIPHSG----PIDNDQEISDFNFPTVTQPTRPPQGSSNVPSSTQPNP-KPRTQKPVGG 277
           +G  PH G    P       S    P   +P RPP    N      P P KP    P GG
Sbjct: 90  QGPPPHPGKPERPPPQGGNQSQGTPPPPGKPERPPPQGGNQSHRPPPPPGKPERPPPQGG 149

Query: 278 VDNESKPINPTIASSYHPE-PPPAAANSHLTA 370
             ++  P +P       PE PPP   N   +A
Sbjct: 150 NQSQGPPPHPG-----KPEGPPPQEGNKSRSA 176

 Score = 30.8 bits (68), Expect = 2.4
 Identities = 18/53 (33%), Positives = 23/53 (43%), Gaps = 1/53 (1%)
 Frame = +2

Query: 200 PPQGSSNVPSSTQPNPKPRTQKPVGGVDNESKPINPTIASSYHPE-PPPAAAN 355
           PPQG +       P  KP  + P GG  ++  P +P       PE PPP   N
Sbjct: 61  PPQGGNQSQGPPPPPGKPEGRPPQGGNQSQGPPPHPG-----KPERPPPQGGN 108

 Score = 28.9 bits (63), Expect = 9.2
 Identities = 20/59 (33%), Positives = 24/59 (40%), Gaps = 10/59 (16%)
 Frame = +2

Query: 197 RPPQGSSNVPSSTQPNP-KPRTQKPVGGVDNESKPINPTIA---------SSYHPEPPP 343
           RPPQG  N      P+P KP    P GG  ++  P  P             S+ P PPP
Sbjct: 81  RPPQGG-NQSQGPPPHPGKPERPPPQGGNQSQGTPPPPGKPERPPPQGGNQSHRPPPPP 138
>sp|Q50634|SECD_MYCTU Protein-export membrane protein secD
          Length = 573

 Score = 38.5 bits (88), Expect = 0.012
 Identities = 40/144 (27%), Positives = 50/144 (34%), Gaps = 3/144 (2%)
 Frame = +2

Query: 44  GEDIEKCRKYAKWKAVYINQCLKKGEIPHSGPIDNDQEISDFNFPTVTQPTRPPQGSSNV 223
           G D  + R   +   +YI   L    +P     +  Q           QP  PP   S  
Sbjct: 101 GNDGSEARNLGQTARLYIRPVLNS--MPAQPAAEEPQPAPSAEPQPPGQPAAPPPAQSGA 158

Query: 224 PSSTQPNPKPRTQKPVGGVDNESKPINPTIASSYHPEPP---PAAANSHLTAEDFAHAEK 394
           P+S QP  +PR        D    P NPT  +S  P PP   PA       AE  A  +K
Sbjct: 159 PASPQPGAQPRPYPQ----DPAPSP-NPTSPASPPPAPPAEAPATDPRKDLAERIAQEKK 213

Query: 395 LCKYAASALQYQDSPTALEYLTKC 466
           L     S  QY          T+C
Sbjct: 214 L---RQSTNQYMQMVALQFQATRC 234
>sp|Q04118|PRB3_HUMAN Basic salivary proline-rich protein 3 precursor (Parotid salivary
           glycoprotein G1) (Proline-rich protein G1)
          Length = 309

 Score = 37.4 bits (85), Expect = 0.026
 Identities = 19/62 (30%), Positives = 24/62 (38%)
 Frame = +2

Query: 176 PTVTQPTRPPQGSSNVPSSTQPNPKPRTQKPVGGVDNESKPINPTIASSYHPEPPPAAAN 355
           P   +P  PP    N P    P  +P+   P GG  N  +P+ P       P PPP    
Sbjct: 241 PHPGKPQGPPPQEGNKPQRPPPPRRPQGPPPPGG--NPQQPLPPPAGKPQGPPPPPQGGR 298

Query: 356 SH 361
            H
Sbjct: 299 PH 300

 Score = 31.2 bits (69), Expect = 1.9
 Identities = 21/62 (33%), Positives = 24/62 (38%), Gaps = 2/62 (3%)
 Frame = +2

Query: 176 PTVTQPTRPPQGSSNVPSSTQPNP-KPRTQKPVGGVDNESKPINPTIASSYHPE-PPPAA 349
           P   +P  PP    N      P P KP  Q P GG  ++  P  P       PE PPP  
Sbjct: 52  PRPGKPEGPPPQGGNQSQGPPPRPGKPEGQPPQGGNQSQGPPPRPG-----KPEGPPPQG 106

Query: 350 AN 355
            N
Sbjct: 107 GN 108

 Score = 29.6 bits (65), Expect = 5.4
 Identities = 20/62 (32%), Positives = 24/62 (38%), Gaps = 2/62 (3%)
 Frame = +2

Query: 176 PTVTQPTRPPQGSSNVPSSTQPNP-KPRTQKPVGGVDNESKPINPTIASSYHPE-PPPAA 349
           P   +P  PP    N      P+P KP    P GG  ++  P  P       PE PPP  
Sbjct: 115 PRPGEPEGPPPQGGNQSQGPPPHPGKPEGPPPQGGNQSQGPPPRPG-----KPEGPPPQG 169

Query: 350 AN 355
            N
Sbjct: 170 GN 171

 Score = 29.3 bits (64), Expect = 7.0
 Identities = 26/87 (29%), Positives = 30/87 (34%), Gaps = 6/87 (6%)
 Frame = +2

Query: 113 KGEIPH----SGPIDNDQEISDFNFPTVTQPTRPPQGSSNVPSSTQPNP-KPRTQKPVGG 277
           +G  PH     GP       S    P   +P  PP    N      P P KP    P GG
Sbjct: 132 QGPPPHPGKPEGPPPQGGNQSQGPPPRPGKPEGPPPQGGNQSQGPPPRPGKPEGPPPQGG 191

Query: 278 VDNESKPINPTIASSYHPE-PPPAAAN 355
             ++  P  P       PE PPP   N
Sbjct: 192 NQSQGPPPRPG-----KPEGPPPQGGN 213
>sp|P30595|CHS2_RHIOL Chitin synthase 2 (Chitin-UDP acetyl-glucosaminyl transferase 2)
          Length = 858

 Score = 37.4 bits (85), Expect = 0.026
 Identities = 20/72 (27%), Positives = 29/72 (40%)
 Frame = +2

Query: 125 PHSGPIDNDQEISDFNFPTVTQPTRPPQGSSNVPSSTQPNPKPRTQKPVGGVDNESKPIN 304
           P   P   DQ   D   P +T P  PP      P        P  Q+P    +N   P++
Sbjct: 30  PFEDPYPEDQPHFDKQ-PLLTSPAYPPTQYPTSPPPPNFPGSPAVQQPYPPFNNNPSPVS 88

Query: 305 PTIASSYHPEPP 340
           P + + ++P PP
Sbjct: 89  PGVPAYFNPAPP 100
>sp|P10162|PRB4L_HUMAN Basic salivary proline-rich protein 4 allele L (Salivary
           proline-rich protein Po) (Parotid o protein) [Contains:
           Peptide P-D]
          Length = 276

 Score = 36.6 bits (83), Expect = 0.044
 Identities = 19/52 (36%), Positives = 22/52 (42%)
 Frame = +2

Query: 188 QPTRPPQGSSNVPSSTQPNPKPRTQKPVGGVDNESKPINPTIASSYHPEPPP 343
           +P  PPQ   N P    P  KP+   P GG  N  +P  P       P PPP
Sbjct: 211 KPQGPPQQEGNKPQGPPPPGKPQGPPPPGG--NPQQPQAPPAGKPQGPPPPP 260

 Score = 33.9 bits (76), Expect = 0.29
 Identities = 21/66 (31%), Positives = 27/66 (40%), Gaps = 10/66 (15%)
 Frame = +2

Query: 176 PTVTQPTRPPQGSSNVPSSTQPNP-KPRTQKPVGGVDNESKPINPTIA---------SSY 325
           PT  +P  PP    N    T P P KP  + P GG  ++  P +P             S+
Sbjct: 102 PTPGKPEGPPPQGGNQSQGTPPPPGKPEGRPPQGGNQSQGPPPHPGKPERPPPQGGNQSH 161

Query: 326 HPEPPP 343
            P PPP
Sbjct: 162 RPPPPP 167

 Score = 33.5 bits (75), Expect = 0.37
 Identities = 21/62 (33%), Positives = 27/62 (43%), Gaps = 2/62 (3%)
 Frame = +2

Query: 176 PTVTQPTRPPQGSSNVPSSTQPNP-KPRTQKPVGGVDNESKPINPTIASSYHPE-PPPAA 349
           P   +P RPP    N      P+P KP ++ P GG  ++  P  P       PE PPP  
Sbjct: 60  PHPGKPERPPPQGGNQSQGPPPHPGKPESRPPQGGHQSQGPPPTPG-----KPEGPPPQG 114

Query: 350 AN 355
            N
Sbjct: 115 GN 116

 Score = 30.8 bits (68), Expect = 2.4
 Identities = 18/53 (33%), Positives = 23/53 (43%), Gaps = 1/53 (1%)
 Frame = +2

Query: 200 PPQGSSNVPSSTQPNPKPRTQKPVGGVDNESKPINPTIASSYHPE-PPPAAAN 355
           PPQG +       P  KP  + P GG  ++  P +P       PE PPP   N
Sbjct: 27  PPQGGNQSQGPPPPPGKPEGRPPQGGNQSQGPPPHPG-----KPERPPPQGGN 74

 Score = 30.8 bits (68), Expect = 2.4
 Identities = 22/67 (32%), Positives = 27/67 (40%), Gaps = 2/67 (2%)
 Frame = +2

Query: 176 PTVTQPTRPPQGSSNVPSSTQPNP-KPRTQKPVGGVDNESKPINPTIASSYHPE-PPPAA 349
           P   +P RPP    N      P P KP    P GG  ++  P +P       PE PPP  
Sbjct: 144 PHPGKPERPPPQGGNQSHRPPPPPGKPERPPPQGGNQSQGPPPHPG-----KPEGPPPQE 198

Query: 350 ANSHLTA 370
            N   +A
Sbjct: 199 GNKSRSA 205

 Score = 29.6 bits (65), Expect = 5.4
 Identities = 20/57 (35%), Positives = 25/57 (43%), Gaps = 8/57 (14%)
 Frame = +2

Query: 197 RPPQGSSNVPSSTQPNP-KPRTQKPVGGVDNESKPINPTIASSYHPE-------PPP 343
           RPPQG  N      P+P KP    P GG  ++  P +P    S  P+       PPP
Sbjct: 47  RPPQGG-NQSQGPPPHPGKPERPPPQGGNQSQGPPPHPGKPESRPPQGGHQSQGPPP 102
>sp|Q6PB44|PTN23_MOUSE Tyrosine-protein phosphatase, non-receptor type 23
          Length = 1692

 Score = 36.6 bits (83), Expect = 0.044
 Identities = 25/89 (28%), Positives = 35/89 (39%), Gaps = 5/89 (5%)
 Frame = +2

Query: 122  IPHSGPIDNDQEISDFNFPTVTQPTRPPQGSSNVPSSTQPNPKPRTQKPVGGVDNESKPI 301
            +P +GP    Q           QP   PQ      S  QP P+P+ Q+P  G     +P+
Sbjct: 972  VPRTGPQAQAQPQPQPQPQPQPQPQPQPQPQPQSQSQPQPQPQPQPQRPAFGPQPTQQPL 1031

Query: 302  ---NPTIASSYHPE--PPPAAANSHLTAE 373
               +P +  S  P   PPP     H T +
Sbjct: 1032 PFQHPHLFPSQAPGILPPPPPTPYHFTPQ 1060
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 64,032,188
Number of Sequences: 369166
Number of extensions: 1339121
Number of successful extensions: 6925
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 5367
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6524
length of database: 68,354,980
effective HSP length: 104
effective length of database: 49,142,540
effective search space used: 4078830820
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)