Planarian EST Database


Dr_sW_019_I24

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_019_I24
         (498 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q26563|CATC_SCHMA  Cathepsin C precursor                       195   4e-50
sp|P53634|CATC_HUMAN  Dipeptidyl-peptidase I precursor (DPP-...   189   2e-48
sp|Q60HG6|CATC_MACFA  Dipeptidyl-peptidase I precursor (DPP-...   186   3e-47
sp|O97578|CATC_CANFA  Dipeptidyl-peptidase I precursor (DPP-...   179   2e-45
sp|P97821|CATC_MOUSE  Dipeptidyl-peptidase I precursor (DPP-...   178   7e-45
sp|P80067|CATC_RAT  Dipeptidyl-peptidase I precursor (DPP-I)...   175   5e-44
sp|Q8IIJ9|CATC_PLAF7  Probable cathepsin C precursor              102   5e-22
sp|P25807|CPR1_CAEEL  Gut-specific cysteine proteinase precu...   101   9e-22
sp|P07688|CATB_BOVIN  Cathepsin B precursor [Contains: Cathe...    98   1e-20
sp|P43509|CPR5_CAEEL  Cathepsin B-like cysteine proteinase 5...    97   2e-20
>sp|Q26563|CATC_SCHMA Cathepsin C precursor
          Length = 454

 Score =  195 bits (496), Expect = 4e-50
 Identities = 88/140 (62%), Positives = 106/140 (75%), Gaps = 1/140 (0%)
 Frame = +1

Query: 16  GKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSYSGGVY 195
           GKC+ +K+C RY+ T+Y YIGGYYGATNE LM++EL+ NGP  VGFEVY+DF  Y  G+Y
Sbjct: 319 GKCTVSKNCTRYYTTDYSYIGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQFYKEGIY 378

Query: 196 HHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGENGYFRI 372
           HH      + T  + FNPFELTNHAVL+VGYG +  SGE +W VKNSWG  WGE GYFRI
Sbjct: 379 HHT----TVQTDHYNFNPFELTNHAVLLVGYGVDKLSGEPYWKVKNSWGVEWGEQGYFRI 434

Query: 373 RRGNDECGIESLGVASEPIL 432
            RG DECG+ESLGV  +P+L
Sbjct: 435 LRGTDECGVESLGVRFDPVL 454
>sp|P53634|CATC_HUMAN Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 463

 Score =  189 bits (481), Expect = 2e-48
 Identities = 86/144 (59%), Positives = 107/144 (74%), Gaps = 1/144 (0%)
 Frame = +1

Query: 1   YKGMNGKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSY 180
           Y G +  C   +DC RY+++ Y Y+GG+YG  NE LM++ELV +GP+AV FEVYDDF+ Y
Sbjct: 324 YTGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHY 383

Query: 181 SGGVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGEN 357
             G+YHH       T  +  FNPFELTNHAVL+VGYG +++SG  +WIVKNSWG GWGEN
Sbjct: 384 KKGIYHH-------TGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEN 436

Query: 358 GYFRIRRGNDECGIESLGVASEPI 429
           GYFRIRRG DEC IES+ VA+ PI
Sbjct: 437 GYFRIRRGTDECAIESIAVAATPI 460
>sp|Q60HG6|CATC_MACFA Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 463

 Score =  186 bits (472), Expect = 3e-47
 Identities = 84/144 (58%), Positives = 106/144 (73%), Gaps = 1/144 (0%)
 Frame = +1

Query: 1   YKGMNGKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSY 180
           Y G +  C   +DC RY+++ Y Y+GG+YG  NE LM++ELV +GP+AV FEVYDDF+ Y
Sbjct: 324 YTGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVYHGPLAVAFEVYDDFLHY 383

Query: 181 SGGVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGEN 357
             G+YHH       T  +  FNPFELTNHAVL+VGYG +++SG  +WIVKNSWG  WGE+
Sbjct: 384 QNGIYHH-------TGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGED 436

Query: 358 GYFRIRRGNDECGIESLGVASEPI 429
           GYFRIRRG DEC IES+ VA+ PI
Sbjct: 437 GYFRIRRGTDECAIESIAVAATPI 460
>sp|O97578|CATC_CANFA Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain 1;
           Dipeptidyl-peptidase I heavy chain 2;
           Dipeptidyl-peptidase I heavy chain 3;
           Dipeptidyl-peptidase I heavy chain 4;
           Dipeptidyl-peptidase I light chain]
          Length = 435

 Score =  179 bits (455), Expect = 2e-45
 Identities = 84/144 (58%), Positives = 107/144 (74%), Gaps = 1/144 (0%)
 Frame = +1

Query: 1   YKGMNGKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSY 180
           Y G +  C    DC RY+++ Y Y+GG+YGA NE LM++ELV++GP+AV FEVYDDF  Y
Sbjct: 297 YAGSDSPCKPN-DCFRYYSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHY 355

Query: 181 SGGVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGEN 357
             G+Y+H       T  +  FNPFELTNHAVL+VGYG +++SG  +WIVKNSWG+ WGE+
Sbjct: 356 QKGIYYH-------TGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGED 408

Query: 358 GYFRIRRGNDECGIESLGVASEPI 429
           GYFRIRRG DEC IES+ VA+ PI
Sbjct: 409 GYFRIRRGTDECAIESIAVAATPI 432
>sp|P97821|CATC_MOUSE Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 462

 Score =  178 bits (451), Expect = 7e-45
 Identities = 80/144 (55%), Positives = 106/144 (73%), Gaps = 1/144 (0%)
 Frame = +1

Query: 1   YKGMNGKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSY 180
           Y   +  C   ++C RY++++Y Y+GG+YG  NE LM++ELVK+GP+AV FEV+DDF+ Y
Sbjct: 323 YTAKDSPCKpreNCLRYYSSDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHY 382

Query: 181 SGGVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGEN 357
             G+YHH       T     FNPFELTNHAVL+VGYG +  +G ++WI+KNSWG+ WGE+
Sbjct: 383 HSGIYHH-------TGLSDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGES 435

Query: 358 GYFRIRRGNDECGIESLGVASEPI 429
           GYFRIRRG DEC IES+ VA+ PI
Sbjct: 436 GYFRIRRGTDECAIESIAVAAIPI 459
>sp|P80067|CATC_RAT Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 462

 Score =  175 bits (444), Expect = 5e-44
 Identities = 80/144 (55%), Positives = 104/144 (72%), Gaps = 1/144 (0%)
 Frame = +1

Query: 1   YKGMNGKCSTTKDCKRYFATNYKYIGGYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSY 180
           Y   +  C   ++C RY+++ Y Y+GG+YG  NE LM++ELVK+GP+AV FEV+DDF+ Y
Sbjct: 323 YTATDAPCKPKENCLRYYSSEYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHDDFLHY 382

Query: 181 SGGVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYG-ETSSGEKFWIVKNSWGNGWGEN 357
             G+YHH       T     FNPFELTNHAVL+VGYG +  +G  +WIVKNSWG+ WGE+
Sbjct: 383 HSGIYHH-------TGLSDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGES 435

Query: 358 GYFRIRRGNDECGIESLGVASEPI 429
           GYFRIRRG DEC IES+ +A+ PI
Sbjct: 436 GYFRIRRGTDECAIESIAMAAIPI 459
>sp|Q8IIJ9|CATC_PLAF7 Probable cathepsin C precursor
          Length = 700

 Score =  102 bits (254), Expect = 5e-22
 Identities = 54/140 (38%), Positives = 82/140 (58%), Gaps = 13/140 (9%)
 Frame = +1

Query: 46  RYFATNYKYIGGYYGATN---EPLMRMELVKNGPIAVGFEVYDDFMSYSGGVYH-HNFGT 213
           R++A ++ Y+GG YG      E +M  E+ +NGPI   FE   DF  Y+ GVY   +F  
Sbjct: 541 RWYAKDFNYVGGCYGCNQCNGEKIMMNEIYRNGPIVSSFEASPDFYDYADGVYFVEDFPH 600

Query: 214 RKLTTSK------FGFNPFELTNHAVLVVGYGETS-SGE--KFWIVKNSWGNGWGENGYF 366
            +  T +      +    ++  NHA++++G+GE   +G+  K+WI +NSWGNGWG+ GYF
Sbjct: 601 ARRCTIEPKNDGVYNITGWDRVNHAIVLLGWGEEEINGKLYKYWIGRNSWGNGWGKEGYF 660

Query: 367 RIRRGNDECGIESLGVASEP 426
           +I RG +  GIES  +  EP
Sbjct: 661 KILRGQNFSGIESQSLFIEP 680
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase precursor
          Length = 329

 Score =  101 bits (252), Expect = 9e-22
 Identities = 54/138 (39%), Positives = 77/138 (55%), Gaps = 5/138 (3%)
 Frame = +1

Query: 25  STTKDCKRYFATNY---KYIG--GYYGATNEPLMRMELVKNGPIAVGFEVYDDFMSYSGG 189
           S +  C+  ++T Y   K+ G   Y    N   ++ E+  NGP+   F VY+DF  Y  G
Sbjct: 203 SCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSG 262

Query: 190 VYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYGETSSGEKFWIVKNSWGNGWGENGYFR 369
           VY H       T  K+      L  HA+ ++G+G T SG  +W+V NSWG  WGE+G+F+
Sbjct: 263 VYKH-------TAGKY------LGGHAIKIIGWG-TESGSPYWLVANSWGVNWGESGFFK 308

Query: 370 IRRGNDECGIESLGVASE 423
           I RG+D+CGIES  VA +
Sbjct: 309 IYRGDDQCGIESAVVAGK 326
>sp|P07688|CATB_BOVIN Cathepsin B precursor [Contains: Cathepsin B light chain; Cathepsin
           B heavy chain]
          Length = 335

 Score = 97.8 bits (242), Expect = 1e-20
 Identities = 54/146 (36%), Positives = 75/146 (51%), Gaps = 5/146 (3%)
 Frame = +1

Query: 4   KGMNGKCSTTKDCKRYFATNYKY-----IGGYYGATNEPLMRMELVKNGPIAVGFEVYDD 168
           +G   KC+ T  C+  ++ +YK         Y  A NE  +  E+ KNGP+   F VY D
Sbjct: 201 EGDTPKCNKT--CEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSD 258

Query: 169 FMSYSGGVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYGETSSGEKFWIVKNSWGNGW 348
           F+ Y  GVY H  G               +  HA+ ++G+G   +G  +W+V NSW   W
Sbjct: 259 FLLYKSGVYQHVSGEI-------------MGGHAIRILGWG-VENGTPYWLVGNSWNTDW 304

Query: 349 GENGYFRIRRGNDECGIESLGVASEP 426
           G+NG+F+I RG D CGIES  VA  P
Sbjct: 305 GDNGFFKILRGQDHCGIESEIVAGMP 330
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 precursor (Cysteine
           protease-related 5)
          Length = 344

 Score = 97.4 bits (241), Expect = 2e-20
 Identities = 57/143 (39%), Positives = 75/143 (52%), Gaps = 5/143 (3%)
 Frame = +1

Query: 19  KCSTTKDCKRYFATNY---KYIGGYYGATNEPL--MRMELVKNGPIAVGFEVYDDFMSYS 183
           KC  +   K  +AT Y   K+ G    A  + +  ++ E++ NGPI V F VY+DF  Y+
Sbjct: 212 KCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAFTVYEDFYQYT 271

Query: 184 GGVYHHNFGTRKLTTSKFGFNPFELTNHAVLVVGYGETSSGEKFWIVKNSWGNGWGENGY 363
            GVY H  G               L  HAV ++G+G   +G  +W+V NSW   WGE GY
Sbjct: 272 TGVYVHTAGA-------------SLGGHAVKILGWG-VDNGTPYWLVANSWNVAWGEKGY 317

Query: 364 FRIRRGNDECGIESLGVASEPIL 432
           FRI RG +ECGIE   VA  P L
Sbjct: 318 FRIIRGLNECGIEHSAVAGIPDL 340
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 59,729,215
Number of Sequences: 369166
Number of extensions: 1288864
Number of successful extensions: 3783
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3513
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3603
length of database: 68,354,980
effective HSP length: 102
effective length of database: 49,512,010
effective search space used: 3119256630
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)