Planarian EST Database


Dr_sW_022_C06

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_022_C06
         (698 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P53634|CATC_HUMAN  Dipeptidyl-peptidase I precursor (DPP-...   265   7e-71
sp|Q26563|CATC_SCHMA  Cathepsin C precursor                       261   2e-69
sp|P97821|CATC_MOUSE  Dipeptidyl-peptidase I precursor (DPP-...   260   3e-69
sp|P80067|CATC_RAT  Dipeptidyl-peptidase I precursor (DPP-I)...   257   2e-68
sp|Q60HG6|CATC_MACFA  Dipeptidyl-peptidase I precursor (DPP-...   257   2e-68
sp|O97578|CATC_CANFA  Dipeptidyl-peptidase I precursor (DPP-...   256   3e-68
sp|P25807|CPR1_CAEEL  Gut-specific cysteine proteinase precu...   117   3e-26
sp|P80884|ANAN_ANACO  Ananain precursor                           110   3e-24
sp|P43508|CPR4_CAEEL  Cathepsin B-like cysteine proteinase 4...   110   4e-24
sp|P92133|CATB3_GIALA  Cathepsin B-like CP3 precursor (Cathe...   110   4e-24
>sp|P53634|CATC_HUMAN Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 463

 Score =  265 bits (678), Expect = 7e-71
 Identities = 120/195 (61%), Positives = 151/195 (77%), Gaps = 1/195 (0%)
 Frame = +1

Query: 1   VLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYK 180
           +LEAR RI +NN+  PILSPQ+VV CS Y+QGC+GGFPYLIAGK+A+DFG+ +E+C PY 
Sbjct: 266 MLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYT 325

Query: 181 GMNGKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSYSG 360
           G +  C   +DC RY+++ Y Y+GG+YG  NE LM++ELV +GP+AV FEVYDDF+ Y  
Sbjct: 326 GTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKK 385

Query: 361 GVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWRENGY 537
           G+YHH       T  +  FNPFELTNHAVL+VGYG +++SG  +WIVKNSWG GW ENGY
Sbjct: 386 GIYHH-------TGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGY 438

Query: 538 FRIRRGNDECGIESL 582
           FRIRRG DEC IES+
Sbjct: 439 FRIRRGTDECAIESI 453
>sp|Q26563|CATC_SCHMA Cathepsin C precursor
          Length = 454

 Score =  261 bits (666), Expect = 2e-69
 Identities = 122/195 (62%), Positives = 149/195 (76%), Gaps = 2/195 (1%)
 Frame = +1

Query: 4   LEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKG 183
           LEAR R+ SN + +PILSPQ VV+CSPYS+GC+GGFP+LIAGK+ EDFG+ Q+   PY G
Sbjct: 256 LEARIRLVSNFSEQPILSPQTVVDCSPYSEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTG 315

Query: 184 MN-GKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSYSG 360
            + GKC+ +K+C RY+ T+Y YIGGYYGATNE LM++EL+ NGP  VGFEVY+DF  Y  
Sbjct: 316 EDTGKCTVSKNCTRYYTTDYSYIGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQFYKE 375

Query: 361 GVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWRENGY 537
           G+YHH      + T  + FNPFELTNHAVL+VGYG +  SGE +W VKNSWG  W E GY
Sbjct: 376 GIYHHT----TVQTDHYNFNPFELTNHAVLLVGYGVDKLSGEPYWKVKNSWGVEWGEQGY 431

Query: 538 FRIRRGNDECGIESL 582
           FRI RG DECG+ESL
Sbjct: 432 FRILRGTDECGVESL 446
>sp|P97821|CATC_MOUSE Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 462

 Score =  260 bits (664), Expect = 3e-69
 Identities = 117/195 (60%), Positives = 151/195 (77%), Gaps = 1/195 (0%)
 Frame = +1

Query: 1   VLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYK 180
           +LEAR RI +NN+  PILSPQ+VV CSPY+QGCDGGFPYLIAGK+A+DFG+ +ESC PY 
Sbjct: 265 MLEARIRILTNNSQTPILSPQEVVSCSPYAQGCDGGFPYLIAGKYAQDFGVVEESCFPYT 324

Query: 181 GMNGKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSYSG 360
             +  C   ++C RY++++Y Y+GG+YG  NE LM++ELVK+GP+AV FEV+DDF+ Y  
Sbjct: 325 AKDSPCKpreNCLRYYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHS 384

Query: 361 GVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWRENGY 537
           G+YHH       T     FNPFELTNHAVL+VGYG +  +G ++WI+KNSWG+ W E+GY
Sbjct: 385 GIYHH-------TGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGY 437

Query: 538 FRIRRGNDECGIESL 582
           FRIRRG DEC IES+
Sbjct: 438 FRIRRGTDECAIESI 452
>sp|P80067|CATC_RAT Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 462

 Score =  257 bits (657), Expect = 2e-68
 Identities = 117/195 (60%), Positives = 149/195 (76%), Gaps = 1/195 (0%)
 Frame = +1

Query: 1   VLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYK 180
           +LEAR RI +NN+  PILSPQ+VV CSPY+QGCDGGFPYLIAGK+A+DFG+ +E+C PY 
Sbjct: 265 MLEARIRILTNNSQTPILSPQEVVSCSPYAQGCDGGFPYLIAGKYAQDFGVVEENCFPYT 324

Query: 181 GMNGKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSYSG 360
             +  C   ++C RY+++ Y Y+GG+YG  NE LM++ELVK+GP+AV FEV+DDF+ Y  
Sbjct: 325 ATDAPCKPKENCLRYYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHS 384

Query: 361 GVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWRENGY 537
           G+YHH       T     FNPFELTNHAVL+VGYG +  +G  +WIVKNSWG+ W E+GY
Sbjct: 385 GIYHH-------TGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGY 437

Query: 538 FRIRRGNDECGIESL 582
           FRIRRG DEC IES+
Sbjct: 438 FRIRRGTDECAIESI 452
>sp|Q60HG6|CATC_MACFA Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 463

 Score =  257 bits (656), Expect = 2e-68
 Identities = 116/195 (59%), Positives = 148/195 (75%), Gaps = 1/195 (0%)
 Frame = +1

Query: 1   VLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYK 180
           +LEAR RI +NN+  PILS Q+VV CS Y+QGC+GGFPYL AGK+A+DFG+ +E+C PY 
Sbjct: 266 MLEARIRILTNNSQTPILSSQEVVSCSQYAQGCEGGFPYLTAGKYAQDFGLVEEACFPYT 325

Query: 181 GMNGKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSYSG 360
           G +  C   +DC RY+++ Y Y+GG+YG  NE LM++ELV +GP+AV FEVYDDF+ Y  
Sbjct: 326 GTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQN 385

Query: 361 GVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWRENGY 537
           G+YHH       T  +  FNPFELTNHAVL+VGYG +++SG  +WIVKNSWG  W E+GY
Sbjct: 386 GIYHH-------TGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGY 438

Query: 538 FRIRRGNDECGIESL 582
           FRIRRG DEC IES+
Sbjct: 439 FRIRRGTDECAIESI 453
>sp|O97578|CATC_CANFA Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain 1;
           Dipeptidyl-peptidase I heavy chain 2;
           Dipeptidyl-peptidase I heavy chain 3;
           Dipeptidyl-peptidase I heavy chain 4;
           Dipeptidyl-peptidase I light chain]
          Length = 435

 Score =  256 bits (655), Expect = 3e-68
 Identities = 118/195 (60%), Positives = 151/195 (77%), Gaps = 1/195 (0%)
 Frame = +1

Query: 1   VLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYK 180
           +LEAR RI +NNT  PILSPQ++V CS Y+QGC+GGFPYLIAGK+A+DFG+ +E+C PY 
Sbjct: 239 MLEARIRILTNNTQTPILSPQEIVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYA 298

Query: 181 GMNGKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSYSG 360
           G +  C    DC RY+++ Y Y+GG+YGA NE LM++ELV++GP+AV FEVYDDF  Y  
Sbjct: 299 GSDSPCK-PNDCFRYYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQK 357

Query: 361 GVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWRENGY 537
           G+Y+H       T  +  FNPFELTNHAVL+VGYG +++SG  +WIVKNSWG+ W E+GY
Sbjct: 358 GIYYH-------TGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGY 410

Query: 538 FRIRRGNDECGIESL 582
           FRIRRG DEC IES+
Sbjct: 411 FRIRRGTDECAIESI 425
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase precursor
          Length = 329

 Score =  117 bits (293), Expect = 3e-26
 Identities = 73/217 (33%), Positives = 110/217 (50%), Gaps = 22/217 (10%)
 Frame = +1

Query: 1   VLEARYRIRSNNTVRPILSPQDVVEC--SPYSQGCDGGFPYLIAGKFAEDFGMAQ----- 159
           ++  R  I +    +PI+SP D++ C  S    GC+GG+P + A ++ +  G+       
Sbjct: 121 MISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGGYP-IQALRWWDSKGVVTGGDYH 179

Query: 160 -ESCNPYK---GMNGKCSTTK------DCKRYFATNY---KYIG--GYYGATNEPLMRME 294
              C PY      +G C  +K       C+  ++T Y   K+ G   Y    N   ++ E
Sbjct: 180 GAGCKPYPIAPCTSGNCPESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAE 239

Query: 295 LVKNGPIAVGFEVYDDFMSYSGGVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYGETS 474
           +  NGP+   F VY+DF  Y  GVY H       T  K+      L  HA+ ++G+G T 
Sbjct: 240 IYANGPVEAAFSVYEDFYKYKSGVYKH-------TAGKY------LGGHAIKIIGWG-TE 285

Query: 475 SGEKFWIVKNSWGNGWRENGYFRIRRGNDECGIESLV 585
           SG  +W+V NSWG  W E+G+F+I RG+D+CGIES V
Sbjct: 286 SGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAV 322
>sp|P80884|ANAN_ANACO Ananain precursor
          Length = 345

 Score =  110 bits (276), Expect = 3e-24
 Identities = 65/183 (35%), Positives = 89/183 (48%)
 Frame = +1

Query: 4   LEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKG 183
           +E+ Y+I+  N V   LS Q V++C+  S GC GG+          + G+A  +  PYK 
Sbjct: 156 VESIYKIKRGNLVS--LSEQQVLDCA-VSYGCKGGWINKAYSFIISNKGVASAAIYPYKA 212

Query: 184 MNGKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSYSGG 363
             G C T       + T Y Y+       N     M  V N PIA   +   +F  Y  G
Sbjct: 213 AKGTCKTNGVPNSAYITRYTYV-----QRNNERNMMYAVSNQPIAAALDASGNFQHYKRG 267

Query: 364 VYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYGETSSGEKFWIVKNSWGNGWRENGYFR 543
           V+    GTR               NHA++++GYG+ SSG+KFWIV+NSWG GW E GY R
Sbjct: 268 VFTGPCGTR--------------LNHAIVIIGYGQDSSGKKFWIVRNSWGAGWGEGGYIR 313

Query: 544 IRR 552
           + R
Sbjct: 314 LAR 316
>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 precursor (Cysteine
           protease-related 4)
          Length = 335

 Score =  110 bits (275), Expect = 4e-24
 Identities = 73/222 (32%), Positives = 101/222 (45%), Gaps = 31/222 (13%)
 Frame = +1

Query: 13  RYRIRSNNTVRPILSPQDVVEC-SPYSQGCDGGFP-----YLIAGKFAEDFGM-AQESCN 171
           R+ I SN  V  +LS +DV+ C S    GC+GG+P     YL+   F       AQ  C 
Sbjct: 121 RFCIASNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAWKYLVKSGFCTGGSYEAQFGCK 180

Query: 172 PYK-------------------GMNGKCSTTKDCKRYFATNY---KYIGGYYGATNEPLM 285
           PY                    G +      K   + +   Y   K+ G    A  + + 
Sbjct: 181 PYSLAPCGETVGNVTWPSCPDDGYDTPACVNKCTNKNYNVAYTADKHFGSTAYAVGKKVS 240

Query: 286 RM--ELVKNGPIAVGFEVYDDFMSYSGGVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVG 459
           ++  E++ +GP+   F VY+DF  Y  GVY H  G              EL  HA+ ++G
Sbjct: 241 QIQAEIIAHGPVEAAFTVYEDFYQYKTGVYVHTTGQ-------------ELGGHAIRILG 287

Query: 460 YGETSSGEKFWIVKNSWGNGWRENGYFRIRRGNDECGIESLV 585
           +G T +G  +W+V NSW   W ENGYFRI RG +ECGIE  V
Sbjct: 288 WG-TDNGTPYWLVANSWNVNWGENGYFRIIRGTNECGIEHAV 328
>sp|P92133|CATB3_GIALA Cathepsin B-like CP3 precursor (Cathepsin B-like protease B3)
          Length = 299

 Score =  110 bits (275), Expect = 4e-24
 Identities = 66/181 (36%), Positives = 81/181 (44%), Gaps = 4/181 (2%)
 Frame = +1

Query: 55  SPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYK----GMNGKCSTTKDCKR 222
           SPQ VV C      CDGG+   +  +F    G   + C PY+    G  G C T      
Sbjct: 126 SPQYVVSCDRGDMACDGGWLPSV-WRFLTKTGTTTDECVPYQSGSTGARGTCPTKCADGS 184

Query: 223 YFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSYSGGVYHHNFGTRKLTT 402
                YK         + P +   L   GP+   F VY DFM Y  GVY H +G  +   
Sbjct: 185 DLPHLYKATKAVDYGLDAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVE--- 241

Query: 403 SKFGFNPFELTNHAVLVVGYGETSSGEKFWIVKNSWGNGWRENGYFRIRRGNDECGIESL 582
                       HAV +VGYG    G  +WI+KNSWG  W E+GYFRI R  +ECGIE  
Sbjct: 242 ----------GGHAVDMVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIEEQ 291

Query: 583 V 585
           V
Sbjct: 292 V 292
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 85,148,856
Number of Sequences: 369166
Number of extensions: 1852293
Number of successful extensions: 5177
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4699
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4872
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 6073541875
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)