Planarian EST Database


Dr_sW_013_N14

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_013_N14
         (635 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P53634|CATC_HUMAN  Dipeptidyl-peptidase I precursor (DPP-...   262   5e-70
sp|Q26563|CATC_SCHMA  Cathepsin C precursor                       262   6e-70
sp|P97821|CATC_MOUSE  Dipeptidyl-peptidase I precursor (DPP-...   254   1e-67
sp|Q60HG6|CATC_MACFA  Dipeptidyl-peptidase I precursor (DPP-...   253   2e-67
sp|O97578|CATC_CANFA  Dipeptidyl-peptidase I precursor (DPP-...   253   4e-67
sp|P80067|CATC_RAT  Dipeptidyl-peptidase I precursor (DPP-I)...   251   1e-66
sp|P25807|CPR1_CAEEL  Gut-specific cysteine proteinase precu...   119   5e-27
sp|Q8IIJ9|CATC_PLAF7  Probable cathepsin C precursor              116   6e-26
sp|P92133|CATB3_GIALA  Cathepsin B-like CP3 precursor (Cathe...   112   6e-25
sp|P92132|CATB2_GIALA  Cathepsin B-like CP2 precursor (Cathe...   112   1e-24
>sp|P53634|CATC_HUMAN Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 463

 Score =  262 bits (670), Expect = 5e-70
 Identities = 117/187 (62%), Positives = 146/187 (78%), Gaps = 1/187 (0%)
 Frame = +3

Query: 9   PILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCSTTKDCKRY 188
           PILSPQ+VV CS Y+QGC+GGFPYLIAGK+A+DFG+ +E+C PY G +  C   +DC RY
Sbjct: 281 PILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRY 340

Query: 189 FATNYKYIGGYYGATNEPLMRMELVRNGPIAVGFEVYDDFMSYSGGVYHHNFGTRKLTTS 368
           +++ Y Y+GG+YG  NE LM++ELV +GP+AV FEVYDDF+ Y  G+YHH       T  
Sbjct: 341 YSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHH-------TGL 393

Query: 369 KFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGENGYFRIRRGNDECGIESL 545
           +  FNPFELTNHAVL+VGYG +++SG  +WIVKNSWG GWGENGYFRIRRG DEC IES+
Sbjct: 394 RDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIESI 453

Query: 546 GVASEPI 566
            VA+ PI
Sbjct: 454 AVAATPI 460
>sp|Q26563|CATC_SCHMA Cathepsin C precursor
          Length = 454

 Score =  262 bits (669), Expect = 6e-70
 Identities = 120/190 (63%), Positives = 147/190 (77%), Gaps = 2/190 (1%)
 Frame = +3

Query: 6   RPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMN-GKCSTTKDCK 182
           +PILSPQ VV+CSPYS+GC+GGFP+LIAGK+ EDFG+ Q+   PY G + GKC+ +K+C 
Sbjct: 269 QPILSPQTVVDCSPYSEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTGEDTGKCTVSKNCT 328

Query: 183 RYFATNYKYIGGYYGATNEPLMRMELVRNGPIAVGFEVYDDFMSYSGGVYHHNFGTRKLT 362
           RY+ T+Y YIGGYYGATNE LM++EL+ NGP  VGFEVY+DF  Y  G+YHH      + 
Sbjct: 329 RYYTTDYSYIGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQFYKEGIYHHT----TVQ 384

Query: 363 TSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGENGYFRIRRGNDECGIE 539
           T  + FNPFELTNHAVL+VGYG +  SGE +W VKNSWG  WGE GYFRI RG DECG+E
Sbjct: 385 TDHYNFNPFELTNHAVLLVGYGVDKLSGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVE 444

Query: 540 SLGVASEPIL 569
           SLGV  +P+L
Sbjct: 445 SLGVRFDPVL 454
>sp|P97821|CATC_MOUSE Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 462

 Score =  254 bits (650), Expect = 1e-67
 Identities = 113/187 (60%), Positives = 146/187 (78%), Gaps = 1/187 (0%)
 Frame = +3

Query: 9   PILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCSTTKDCKRY 188
           PILSPQ+VV CSPY+QGCDGGFPYLIAGK+A+DFG+ +ESC PY   +  C   ++C RY
Sbjct: 280 PILSPQEVVSCSPYAQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDSPCKpreNCLRY 339

Query: 189 FATNYKYIGGYYGATNEPLMRMELVRNGPIAVGFEVYDDFMSYSGGVYHHNFGTRKLTTS 368
           ++++Y Y+GG+YG  NE LM++ELV++GP+AV FEV+DDF+ Y  G+YHH       T  
Sbjct: 340 YSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHH-------TGL 392

Query: 369 KFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGENGYFRIRRGNDECGIESL 545
              FNPFELTNHAVL+VGYG +  +G ++WI+KNSWG+ WGE+GYFRIRRG DEC IES+
Sbjct: 393 SDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDECAIESI 452

Query: 546 GVASEPI 566
            VA+ PI
Sbjct: 453 AVAAIPI 459
>sp|Q60HG6|CATC_MACFA Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 463

 Score =  253 bits (647), Expect = 2e-67
 Identities = 113/187 (60%), Positives = 143/187 (76%), Gaps = 1/187 (0%)
 Frame = +3

Query: 9   PILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCSTTKDCKRY 188
           PILS Q+VV CS Y+QGC+GGFPYL AGK+A+DFG+ +E+C PY G +  C   +DC RY
Sbjct: 281 PILSSQEVVSCSQYAQGCEGGFPYLTAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRY 340

Query: 189 FATNYKYIGGYYGATNEPLMRMELVRNGPIAVGFEVYDDFMSYSGGVYHHNFGTRKLTTS 368
           +++ Y Y+GG+YG  NE LM++ELV +GP+AV FEVYDDF+ Y  G+YHH       T  
Sbjct: 341 YSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHH-------TGL 393

Query: 369 KFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGENGYFRIRRGNDECGIESL 545
           +  FNPFELTNHAVL+VGYG +++SG  +WIVKNSWG  WGE+GYFRIRRG DEC IES+
Sbjct: 394 RDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIESI 453

Query: 546 GVASEPI 566
            VA+ PI
Sbjct: 454 AVAATPI 460
>sp|O97578|CATC_CANFA Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain 1;
           Dipeptidyl-peptidase I heavy chain 2;
           Dipeptidyl-peptidase I heavy chain 3;
           Dipeptidyl-peptidase I heavy chain 4;
           Dipeptidyl-peptidase I light chain]
          Length = 435

 Score =  253 bits (645), Expect = 4e-67
 Identities = 115/187 (61%), Positives = 146/187 (78%), Gaps = 1/187 (0%)
 Frame = +3

Query: 9   PILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCSTTKDCKRY 188
           PILSPQ++V CS Y+QGC+GGFPYLIAGK+A+DFG+ +E+C PY G +  C    DC RY
Sbjct: 254 PILSPQEIVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCK-PNDCFRY 312

Query: 189 FATNYKYIGGYYGATNEPLMRMELVRNGPIAVGFEVYDDFMSYSGGVYHHNFGTRKLTTS 368
           +++ Y Y+GG+YGA NE LM++ELVR+GP+AV FEVYDDF  Y  G+Y+H       T  
Sbjct: 313 YSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYH-------TGL 365

Query: 369 KFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGENGYFRIRRGNDECGIESL 545
           +  FNPFELTNHAVL+VGYG +++SG  +WIVKNSWG+ WGE+GYFRIRRG DEC IES+
Sbjct: 366 RDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESI 425

Query: 546 GVASEPI 566
            VA+ PI
Sbjct: 426 AVAATPI 432
>sp|P80067|CATC_RAT Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 462

 Score =  251 bits (640), Expect = 1e-66
 Identities = 112/187 (59%), Positives = 144/187 (77%), Gaps = 1/187 (0%)
 Frame = +3

Query: 9   PILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCSTTKDCKRY 188
           PILSPQ+VV CSPY+QGCDGGFPYLIAGK+A+DFG+ +E+C PY   +  C   ++C RY
Sbjct: 280 PILSPQEVVSCSPYAQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDAPCKPKENCLRY 339

Query: 189 FATNYKYIGGYYGATNEPLMRMELVRNGPIAVGFEVYDDFMSYSGGVYHHNFGTRKLTTS 368
           +++ Y Y+GG+YG  NE LM++ELV++GP+AV FEV+DDF+ Y  G+YHH       T  
Sbjct: 340 YSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYHH-------TGL 392

Query: 369 KFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGENGYFRIRRGNDECGIESL 545
              FNPFELTNHAVL+VGYG +  +G  +WIVKNSWG+ WGE+GYFRIRRG DEC IES+
Sbjct: 393 SDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDECAIESI 452

Query: 546 GVASEPI 566
            +A+ PI
Sbjct: 453 AMAAIPI 459
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase precursor
          Length = 329

 Score =  119 bits (299), Expect = 5e-27
 Identities = 73/207 (35%), Positives = 108/207 (52%), Gaps = 22/207 (10%)
 Frame = +3

Query: 6   RPILSPQDVVEC--SPYSQGCDGGFPYLIAGKFAEDFGMAQ------ESCNPYK---GMN 152
           +PI+SP D++ C  S    GC+GG+P + A ++ +  G+          C PY      +
Sbjct: 135 QPIISPDDLLSCCGSSCGNGCEGGYP-IQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTS 193

Query: 153 GKCSTTK------DCKRYFATNY---KYIG--GYYGATNEPLMRMELVRNGPIAVGFEVY 299
           G C  +K       C+  ++T Y   K+ G   Y    N   ++ E+  NGP+   F VY
Sbjct: 194 GNCPESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVY 253

Query: 300 DDFMSYSGGVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYGETSSGEKFWIVKNSWGN 479
           +DF  Y  GVY H       T  K+      L  HA+ ++G+G T SG  +W+V NSWG 
Sbjct: 254 EDFYKYKSGVYKH-------TAGKY------LGGHAIKIIGWG-TESGSPYWLVANSWGV 299

Query: 480 GWGENGYFRIRRGNDECGIESLGVASE 560
            WGE+G+F+I RG+D+CGIES  VA +
Sbjct: 300 NWGESGFFKIYRGDDQCGIESAVVAGK 326
>sp|Q8IIJ9|CATC_PLAF7 Probable cathepsin C precursor
          Length = 700

 Score =  116 bits (290), Expect = 6e-26
 Identities = 80/252 (31%), Positives = 117/252 (46%), Gaps = 69/252 (27%)
 Frame = +3

Query: 15   LSPQDVVECSPYSQGCDGGFPYLIAGKFAE----------DFGMAQESCNPYK------G 146
            LS Q V+ CS Y QGC+GGFPYL++ K A+           +   +E+C PY        
Sbjct: 431  LSIQTVLSCSFYDQGCNGGFPYLVS-KLAKLQGIPLNVYFPYSATEETC-PYNISKHPND 488

Query: 147  MNGKCSTTK----------------------------------------DCKRYFATNYK 206
            MNG     +                                        +  R++A ++ 
Sbjct: 489  MNGSAKLREINAIFNSNNNMSTYNNINNDHHQLGVYANTASSQEQHGISEENRWYAKDFN 548

Query: 207  YIGGYYGATN---EPLMRMELVRNGPIAVGFEVYDDFMSYSGGVYH-HNFGTRKLTTSK- 371
            Y+GG YG      E +M  E+ RNGPI   FE   DF  Y+ GVY   +F   +  T + 
Sbjct: 549  YVGGCYGCNQCNGEKIMMNEIYRNGPIVSSFEASPDFYDYADGVYFVEDFPHARRCTIEP 608

Query: 372  -----FGFNPFELTNHAVLVVGYGETS-SGE--KFWIVKNSWGNGWGENGYFRIRRGNDE 527
                 +    ++  NHA++++G+GE   +G+  K+WI +NSWGNGWG+ GYF+I RG + 
Sbjct: 609  KNDGVYNITGWDRVNHAIVLLGWGEEEINGKLYKYWIGRNSWGNGWGKEGYFKILRGQNF 668

Query: 528  CGIESLGVASEP 563
             GIES  +  EP
Sbjct: 669  SGIESQSLFIEP 680
>sp|P92133|CATB3_GIALA Cathepsin B-like CP3 precursor (Cathepsin B-like protease B3)
          Length = 299

 Score =  112 bits (281), Expect = 6e-25
 Identities = 66/178 (37%), Positives = 81/178 (45%), Gaps = 4/178 (2%)
 Frame = +3

Query: 18  SPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYK----GMNGKCSTTKDCKR 185
           SPQ VV C      CDGG+   +  +F    G   + C PY+    G  G C T      
Sbjct: 126 SPQYVVSCDRGDMACDGGWLPSV-WRFLTKTGTTTDECVPYQSGSTGARGTCPTKCADGS 184

Query: 186 YFATNYKYIGGYYGATNEPLMRMELVRNGPIAVGFEVYDDFMSYSGGVYHHNFGTRKLTT 365
                YK         + P +   L   GP+   F VY DFM Y  GVY H +G  +   
Sbjct: 185 DLPHLYKATKAVDYGLDAPAIMKALATGGPLQTAFTVYSDFMYYESGVYQHTYGRVE--- 241

Query: 366 SKFGFNPFELTNHAVLVVGYGETSSGEKFWIVKNSWGNGWGENGYFRIRRGNDECGIE 539
                       HAV +VGYG    G  +WI+KNSWG  WGE+GYFRI R  +ECGIE
Sbjct: 242 ----------GGHAVDMVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRIIRMTNECGIE 289
>sp|P92132|CATB2_GIALA Cathepsin B-like CP2 precursor (Cathepsin B-like protease B2)
          Length = 300

 Score =  112 bits (279), Expect = 1e-24
 Identities = 69/188 (36%), Positives = 89/188 (47%), Gaps = 9/188 (4%)
 Frame = +3

Query: 18  SPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKG----MNGKCST-----T 170
           SPQ VV C      C+GG+   +  KF    G   + C PYK     + G C T     +
Sbjct: 127 SPQYVVSCDHGDMACNGGWLPNV-WKFLTKTGTTTDECVPYKSGSTTLRGTCPTKCADGS 185

Query: 171 KDCKRYFATNYKYIGGYYGATNEPLMRMELVRNGPIAVGFEVYDDFMSYSGGVYHHNFGT 350
                  AT+YK  G      + P M   L  +GP+ V F V+ DFM Y  GVY H +G 
Sbjct: 186 SKVHLATATSYKDYG-----LDIPAMMKALSTSGPLQVAFLVHSDFMYYESGVYQHTYGY 240

Query: 351 RKLTTSKFGFNPFELTNHAVLVVGYGETSSGEKFWIVKNSWGNGWGENGYFRIRRGNDEC 530
            +               HAV +VGYG    G  +WI+KNSWG  WGE+GYFR+ RG ++C
Sbjct: 241 ME-------------GGHAVEMVGYGTDDDGVDYWIIKNSWGPDWGEDGYFRMIRGINDC 287

Query: 531 GIESLGVA 554
            IE    A
Sbjct: 288 SIEEQAYA 295
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 78,389,434
Number of Sequences: 369166
Number of extensions: 1743682
Number of successful extensions: 5029
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4554
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4718
length of database: 68,354,980
effective HSP length: 106
effective length of database: 48,773,070
effective search space used: 5121172350
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)