Planarian EST Database


Dr_sW_023_I24

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_023_I24
         (817 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q7TMR0|PCP_MOUSE  Lysosomal Pro-X carboxypeptidase precur...   190   5e-48
sp|Q5RBU7|PCP_PONPY  Lysosomal Pro-X carboxypeptidase precur...   181   3e-45
sp|P42785|PCP_HUMAN  Lysosomal Pro-X carboxypeptidase precur...   180   5e-45
sp|P34676|YO26_CAEEL  Putative serine protease Z688.6 precursor   165   1e-40
sp|Q9EPB1|DPP2_RAT  Dipeptidyl-peptidase II precursor (DPP I...   160   3e-39
sp|Q9UHL4|DPP2_HUMAN  Dipeptidyl-peptidase II precursor (DPP...   155   1e-37
sp|Q9ET22|DPP2_MOUSE  Dipeptidyl-peptidase II precursor (DPP...   154   2e-37
sp|P34610|PCP1_CAEEL  Putative serine protease pcp-1 precursor    137   4e-32
sp|Q9NQE7|TSSP_HUMAN  Thymus-specific serine protease precursor    80   9e-15
sp|Q9QXE5|TSSP_MOUSE  Thymus-specific serine protease precursor    75   3e-13
>sp|Q7TMR0|PCP_MOUSE Lysosomal Pro-X carboxypeptidase precursor (Prolylcarboxypeptidase)
           (PRCP) (Proline carboxypeptidase)
          Length = 491

 Score =  190 bits (482), Expect = 5e-48
 Identities = 96/223 (43%), Positives = 137/223 (61%), Gaps = 2/223 (0%)
 Frame = +2

Query: 155 QYETKYFPTYLDHFTYKNDSNKFLMKYLISTKNFVE-GNPILFYCGNEGSIELFANNSGF 331
           +Y   YF   +DHF +  D   F  +YL++ K++   G  ILFY GNEG I  F NN+GF
Sbjct: 45  KYSVLYFEQKVDHFGFA-DMRTFKQRYLVADKHWQRNGGSILFYTGNEGDIVWFCNNTGF 103

Query: 332 VWELGEQLSAIVVFAEHRFYGSTLPFGKKSYDSAQYFGYLNSEQXXXXXXXXXXXXKYNL 511
           +W++ E+L A++VFAEHR+YG +LPFG+ S+  +Q+  +L SEQ            +  +
Sbjct: 104 MWDVAEELKAMLVFAEHRYYGESLPFGQDSFKDSQHLNFLTSEQALADFAELIRHLEKTI 163

Query: 512 PGASHSPVIAFGGSYGGMLAAWFRQKYPNIVAGSLAASAPVLMVGNIENCSYPFQVLTKA 691
           PGA   PVIA GGSYGGMLAAWFR KYP+IV G+LAASAP+  +  +  C    +++T  
Sbjct: 164 PGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQLDGMVPCGEFMKIVTND 223

Query: 692 YQTKGSDACVSNVRNVWPVIQQMNDS-LHLVNLSRIFHTCQPL 817
           ++ K    C  ++R  W VI +++ S   L +L+ I H C PL
Sbjct: 224 FR-KSGPYCSESIRKSWNVIDKLSGSGSGLQSLTNILHLCSPL 265
>sp|Q5RBU7|PCP_PONPY Lysosomal Pro-X carboxypeptidase precursor (Prolylcarboxypeptidase)
           (PRCP) (Proline carboxypeptidase)
          Length = 496

 Score =  181 bits (458), Expect = 3e-45
 Identities = 92/222 (41%), Positives = 132/222 (59%), Gaps = 2/222 (0%)
 Frame = +2

Query: 158 YETKYFPTYLDHFTYKNDSNKFLMKYLISTKNFVE-GNPILFYCGNEGSIELFANNSGFV 334
           Y   YF   +DHF + N    F  +YL++ K + + G  ILFY GNEG I  F NN+GF+
Sbjct: 48  YSVLYFQQKVDHFGF-NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFM 106

Query: 335 WELGEQLSAIVVFAEHRFYGSTLPFGKKSYDSAQYFGYLNSEQXXXXXXXXXXXXKYNLP 514
           W++ E+L A++VFAEHR+YG +LPFG  ++  +++  +L SEQ            K  +P
Sbjct: 107 WDVAEELKAMLVFAEHRYYGESLPFGDNTFKDSRHLNFLTSEQALADFAELIKHLKRTIP 166

Query: 515 GASHSPVIAFGGSYGGMLAAWFRQKYPNIVAGSLAASAPVLMVGNIENCSYPFQVLTKAY 694
           GA + PVIA GGSYGGMLAAWFR KYP++V G+LAASAP+    ++  C    +++T  +
Sbjct: 167 GAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDF 226

Query: 695 QTKGSDACVSNVRNVWPVIQQM-NDSLHLVNLSRIFHTCQPL 817
           +  G   C  ++R  W  I ++ N    L  L+   H C PL
Sbjct: 227 RKSGPH-CSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPL 267
>sp|P42785|PCP_HUMAN Lysosomal Pro-X carboxypeptidase precursor (Prolylcarboxypeptidase)
           (PRCP) (Proline carboxypeptidase) (Angiotensinase C)
           (Lysosomal carboxypeptidase C)
          Length = 496

 Score =  180 bits (456), Expect = 5e-45
 Identities = 92/222 (41%), Positives = 131/222 (59%), Gaps = 2/222 (0%)
 Frame = +2

Query: 158 YETKYFPTYLDHFTYKNDSNKFLMKYLISTKNFVE-GNPILFYCGNEGSIELFANNSGFV 334
           Y   YF   +DHF + N    F  +YL++ K + + G  ILFY GNEG I  F NN+GF+
Sbjct: 48  YSVLYFQQKVDHFGF-NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFM 106

Query: 335 WELGEQLSAIVVFAEHRFYGSTLPFGKKSYDSAQYFGYLNSEQXXXXXXXXXXXXKYNLP 514
           W++ E+L A++VFAEHR+YG +LPFG  S+  +++  +L SEQ            K  +P
Sbjct: 107 WDVAEELKAMLVFAEHRYYGESLPFGDNSFKDSRHLNFLTSEQALADFAELIKHLKRTIP 166

Query: 515 GASHSPVIAFGGSYGGMLAAWFRQKYPNIVAGSLAASAPVLMVGNIENCSYPFQVLTKAY 694
           GA + PVIA GGSYGGMLAAWFR KYP++V G+LAASAP+    ++  C    +++T  +
Sbjct: 167 GAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDF 226

Query: 695 QTKGSDACVSNVRNVWPVIQQM-NDSLHLVNLSRIFHTCQPL 817
           +  G   C  ++   W  I ++ N    L  L+   H C PL
Sbjct: 227 RKSGPH-CSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPL 267
>sp|P34676|YO26_CAEEL Putative serine protease Z688.6 precursor
          Length = 507

 Score =  165 bits (418), Expect = 1e-40
 Identities = 87/189 (46%), Positives = 119/189 (62%), Gaps = 3/189 (1%)
 Frame = +2

Query: 149 EFQYETKYFPTYLDHFTYKNDSNKFLMKYLISTKNFVEGNPILFYCGNEGSIELFANNSG 328
           +++YE  Y    +D F + ND  +F ++Y ++  ++  G PILFY GNEGS+E FA N+G
Sbjct: 38  KYKYEEGYLKAPIDPFAFTNDL-EFDLRYFLNIDHYETGGPILFYTGNEGSLEAFAENTG 96

Query: 329 FVWELGEQLSAIVVFAEHRFYGSTLPFGKKSYDSAQYFGYLNSEQXXXXXXXXXXXXK-Y 505
           F+W+L  +L A VVF EHRFYG + PF  +SY   ++ GYL+S+Q            K  
Sbjct: 97  FMWDLAPELKAAVVFVEHRFYGKSQPFKNESYTDIRHLGYLSSQQALADFALSVQFFKNE 156

Query: 506 NLPGASHSPVIAFGGSYGGMLAAWFRQKYPNIVAGSLAASAPVLMV--GNIENCSYPFQV 679
            + GA  S VIAFGGSYGGML+AWFR KYP+IV G++AASAPV      NI    Y F +
Sbjct: 157 KIKGAQKSAVIAFGGSYGGMLSAWFRIKYPHIVDGAIAASAPVFWFTDSNIPEDVYDF-I 215

Query: 680 LTKAYQTKG 706
           +T+A+   G
Sbjct: 216 VTRAFLDAG 224
>sp|Q9EPB1|DPP2_RAT Dipeptidyl-peptidase II precursor (DPP II) (Dipeptidyl
           aminopeptidase II) (Quiescent cell proline dipeptidase)
           (Dipeptidyl peptidase 7)
          Length = 500

 Score =  160 bits (406), Expect = 3e-39
 Identities = 97/228 (42%), Positives = 133/228 (58%), Gaps = 2/228 (0%)
 Frame = +2

Query: 140 SNLEFQYETKYFPTYLDHFTYKNDSNK-FLMKYLISTKNFVEGN-PILFYCGNEGSIELF 313
           S L+  +   YF  Y+DHF +++ SNK F  ++L+S K +  G  PI FY GNEG I   
Sbjct: 35  SVLDPDFRENYFEQYMDHFNFESFSNKTFGQRFLVSDKFWKMGEGPIFFYTGNEGDIWSL 94

Query: 314 ANNSGFVWELGEQLSAIVVFAEHRFYGSTLPFGKKSYDSAQYFGYLNSEQXXXXXXXXXX 493
           ANNSGF+ EL  Q  A++VFAEHR+YG +LPFG +S     Y   L  EQ          
Sbjct: 95  ANNSGFIVELAAQQEALLVFAEHRYYGKSLPFGVQSTQRG-YTQLLTVEQALADFAVLLQ 153

Query: 494 XXKYNLPGASHSPVIAFGGSYGGMLAAWFRQKYPNIVAGSLAASAPVLMVGNIENCSYPF 673
             ++NL G   +P IAFGGSYGGML+A+ R KYP++VAG+LAASAPV+ V  + N    F
Sbjct: 154 ALRHNL-GVQDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVIAVAGLGNPDQFF 212

Query: 674 QVLTKAYQTKGSDACVSNVRNVWPVIQQMNDSLHLVNLSRIFHTCQPL 817
           + +T  +  + S  C   VR+ +  I+ +        +S+ F TCQ L
Sbjct: 213 RDVTADFYGQ-SPKCAQAVRDAFQQIKDLFLQGAYDTISQNFGTCQSL 259
>sp|Q9UHL4|DPP2_HUMAN Dipeptidyl-peptidase II precursor (DPP II) (Dipeptidyl
           aminopeptidase II) (Quiescent cell proline dipeptidase)
           (Dipeptidyl peptidase 7)
          Length = 492

 Score =  155 bits (393), Expect = 1e-37
 Identities = 92/222 (41%), Positives = 131/222 (59%), Gaps = 2/222 (0%)
 Frame = +2

Query: 158 YETKYFPTYLDHFTYKNDSNK-FLMKYLISTKNFVEGN-PILFYCGNEGSIELFANNSGF 331
           ++ ++F   LDHF ++   NK F  ++L+S + +V G  PI FY GNEG +  FANNSGF
Sbjct: 31  FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGEGPIFFYTGNEGDVWAFANNSGF 90

Query: 332 VWELGEQLSAIVVFAEHRFYGSTLPFGKKSYDSAQYFGYLNSEQXXXXXXXXXXXXKYNL 511
           V EL  +  A++VFAEHR+YG +LPFG +S     +   L  EQ            + +L
Sbjct: 91  VAELAAERGALLVFAEHRYYGKSLPFGAQSTQRG-HTELLTVEQALADFAELLRALRRDL 149

Query: 512 PGASHSPVIAFGGSYGGMLAAWFRQKYPNIVAGSLAASAPVLMVGNIENCSYPFQVLTKA 691
            GA  +P IAFGGSYGGML+A+ R KYP++VAG+LAASAPVL V  + + +  F+ +T  
Sbjct: 150 -GAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTAD 208

Query: 692 YQTKGSDACVSNVRNVWPVIQQMNDSLHLVNLSRIFHTCQPL 817
           ++ + S  C   VR  +  I+ +        +   F TCQPL
Sbjct: 209 FEGQ-SPKCTQGVREAFRQIKDLFLQGAYDTVRWEFGTCQPL 249
>sp|Q9ET22|DPP2_MOUSE Dipeptidyl-peptidase II precursor (DPP II) (Dipeptidyl
           aminopeptidase II) (Quiescent cell proline dipeptidase)
           (Dipeptidyl peptidase 7)
          Length = 506

 Score =  154 bits (390), Expect = 2e-37
 Identities = 94/226 (41%), Positives = 131/226 (57%), Gaps = 2/226 (0%)
 Frame = +2

Query: 146 LEFQYETKYFPTYLDHFTYKNDSNK-FLMKYLISTKNFVEGN-PILFYCGNEGSIELFAN 319
           L+  +   YF  Y+DHF +++  NK F  ++L+S K +  G  PI FY GNEG I  FAN
Sbjct: 37  LDPDFHENYFEQYMDHFNFESFGNKTFGQRFLVSDKFWKMGEGPIFFYTGNEGDIWSFAN 96

Query: 320 NSGFVWELGEQLSAIVVFAEHRFYGSTLPFGKKSYDSAQYFGYLNSEQXXXXXXXXXXXX 499
           NSGF+ EL  Q  A++VFAEHR+YG +LPFG +S     Y   L  EQ            
Sbjct: 97  NSGFMVELAAQQEALLVFAEHRYYGKSLPFGVQSTQRG-YTQLLTVEQALADFAVLLQAL 155

Query: 500 KYNLPGASHSPVIAFGGSYGGMLAAWFRQKYPNIVAGSLAASAPVLMVGNIENCSYPFQV 679
           + +L G   +P IAFGGSYGGML+A+ R KYP++VAG+LAASAPV+ V  + +    F+ 
Sbjct: 156 RQDL-GVHDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGDSYQFFRD 214

Query: 680 LTKAYQTKGSDACVSNVRNVWPVIQQMNDSLHLVNLSRIFHTCQPL 817
           +T  +  + S  C   VR+ +  I+ +        +S+ F TCQ L
Sbjct: 215 VTADFYGQ-SPKCAQAVRDAFQQIKDLFLQGAYDTISQNFGTCQSL 259
>sp|P34610|PCP1_CAEEL Putative serine protease pcp-1 precursor
          Length = 565

 Score =  137 bits (345), Expect = 4e-32
 Identities = 75/193 (38%), Positives = 111/193 (57%), Gaps = 7/193 (3%)
 Frame = +2

Query: 185 LDHFTYKNDSNKFLMKYLISTKNFVEGNPILFYCGNEGSIELFANNSGFVWELGEQLSAI 364
           LDHFT+  D+  F M+ + +   +  G PI FY GNEG +E F   +G +++L    +A 
Sbjct: 51  LDHFTW-GDTRTFDMRVMWNNTFYKPGGPIFFYTGNEGGLESFVTATGMMFDLAPMFNAS 109

Query: 365 VVFAEHRFYGSTLPFGKKSYDSAQYFGYLNSEQXXXXXXXXXXXXK-----YNLPGASHS 529
           ++FAEHRFYG T PFG +SY S    GYL SEQ            K     + +   + +
Sbjct: 110 IIFAEHRFYGQTQPFGNQSYASLANVGYLTSEQALADYAELLTELKRDNNQFKMTFPAAT 169

Query: 530 PVIAFGGSYGGMLAAWFRQKYPNIVAGSLAASAPVLMV--GNIENCSYPFQVLTKAYQTK 703
            VI+FGGSYGGML+AWFRQKYP+IV G+ A SAP++ +  G ++  ++   + ++ Y   
Sbjct: 170 QVISFGGSYGGMLSAWFRQKYPHIVKGAWAGSAPLIYMNGGGVDPGAFD-HITSRTYIDN 228

Query: 704 GSDACVSNVRNVW 742
           G +  +  + N W
Sbjct: 229 GCNRFI--LANAW 239
>sp|Q9NQE7|TSSP_HUMAN Thymus-specific serine protease precursor
          Length = 514

 Score = 79.7 bits (195), Expect = 9e-15
 Identities = 54/154 (35%), Positives = 79/154 (51%), Gaps = 1/154 (0%)
 Frame = +2

Query: 185 LDHFTYKNDSNKFLMKYLISTKNFV-EGNPILFYCGNEGSIELFANNSGFVWELGEQLSA 361
           LD F   +D   FL +Y ++ +++V +  PI  + G EGS+   +   G    L     A
Sbjct: 66  LDPFNV-SDRRSFLQRYWVNDQHWVGQDGPIFLHLGGEGSLGPGSVMRGHPAALAPAWGA 124

Query: 362 IVVFAEHRFYGSTLPFGKKSYDSAQYFGYLNSEQXXXXXXXXXXXXKYNLPGASHSPVIA 541
           +V+  EHRFYG ++P G    + AQ   +L+S                    +S SP I 
Sbjct: 125 LVISLEHRFYGLSIPAG--GLEMAQ-LRFLSSRLALADVVSARLALSRLFNISSSSPWIC 181

Query: 542 FGGSYGGMLAAWFRQKYPNIVAGSLAASAPVLMV 643
           FGGSY G LAAW R K+P+++  S+A+SAPV  V
Sbjct: 182 FGGSYAGSLAAWARLKFPHLIFASVASSAPVRAV 215
>sp|Q9QXE5|TSSP_MOUSE Thymus-specific serine protease precursor
          Length = 509

 Score = 74.7 bits (182), Expect = 3e-13
 Identities = 52/154 (33%), Positives = 77/154 (50%), Gaps = 1/154 (0%)
 Frame = +2

Query: 185 LDHFTYKNDSNKFLMKYLISTKNFV-EGNPILFYCGNEGSIELFANNSGFVWELGEQLSA 361
           LD F   +D   FL +Y ++ ++   +  P+  + G EGS+   +  +G    L     A
Sbjct: 65  LDPFN-ASDRRTFLQRYWVNDQHRTGQDVPVFLHIGGEGSLGPGSVMAGHPAALAPAWGA 123

Query: 362 IVVFAEHRFYGSTLPFGKKSYDSAQYFGYLNSEQXXXXXXXXXXXXKYNLPGASHSPVIA 541
           +V+  EHRFYG ++P G    D A    YL+S                 L  +S SP I 
Sbjct: 124 LVISLEHRFYGLSMPAG--GLDLA-LLRYLSSRHALADVASARQALSGLLNVSSSSPWIC 180

Query: 542 FGGSYGGMLAAWFRQKYPNIVAGSLAASAPVLMV 643
           FGGSY G LA W R K+P++V  ++A+SAP+  V
Sbjct: 181 FGGSYAGSLATWARLKFPHLVFAAVASSAPLSAV 214
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 87,783,416
Number of Sequences: 369166
Number of extensions: 1726794
Number of successful extensions: 4377
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4247
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4363
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 7811456130
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)