Planarian EST Database


Dr_sW_022_C11

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_022_C11
         (689 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P53634|CATC_HUMAN  Dipeptidyl-peptidase I precursor (DPP-...   235   9e-62
sp|P97821|CATC_MOUSE  Dipeptidyl-peptidase I precursor (DPP-...   233   5e-61
sp|Q60HG6|CATC_MACFA  Dipeptidyl-peptidase I precursor (DPP-...   228   9e-60
sp|P80067|CATC_RAT  Dipeptidyl-peptidase I precursor (DPP-I)...   226   3e-59
sp|O97578|CATC_CANFA  Dipeptidyl-peptidase I precursor (DPP-...   223   4e-58
sp|Q26563|CATC_SCHMA  Cathepsin C precursor                       170   4e-42
sp|P82473|CPGP1_ZINOF  Cysteine proteinase GP-I                    80   7e-15
sp|O10364|CATV_NPVOP  Viral cathepsin (V-cath) (Cysteine pro...    79   9e-15
sp|P41715|CATV_NPVCF  Viral cathepsin (V-cath) (Cysteine pro...    77   3e-14
sp|Q80LP4|CATV_NPVAH  Viral cathepsin (V-cath) (Cysteine pro...    77   6e-14
>sp|P53634|CATC_HUMAN Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 463

 Score =  235 bits (599), Expect = 9e-62
 Identities = 113/233 (48%), Positives = 145/233 (62%), Gaps = 9/233 (3%)
 Frame = +3

Query: 18  SYCDRLQPSWFHDVLIRQWQCFKAQRTTTLKE---------KNNVLPHSNIFALTSLRYG 170
           +YC+     W HDVL R W CF  ++  T  E         KN+   +SN        Y 
Sbjct: 116 TYCNETMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQEKYSNRL------YK 169

Query: 171 SQKRIVDKINLENNGWTAKDYPEFHEKTLYEVINMAGGSRSKLERPKPAPITKSILDSVK 350
                V  IN     WTA  Y E+   TL ++I  +GG   K+ RPKPAP+T  I   + 
Sbjct: 170 YDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKIL 229

Query: 351 LIPKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVV 530
            +P S+DWRNV+G+N+VSPVRNQ  CGSCYSFAS GMLEAR RI +NN+  PILSPQ+VV
Sbjct: 230 HLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVV 289

Query: 531 ECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCSTTKDCKRYFA 689
            CS Y+QGC+GGFPYLIAGK+A+DFG+ +E+C PY G +  C   +DC RY++
Sbjct: 290 SCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYS 342
>sp|P97821|CATC_MOUSE Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 462

 Score =  233 bits (593), Expect = 5e-61
 Identities = 115/232 (49%), Positives = 148/232 (63%), Gaps = 7/232 (3%)
 Frame = +3

Query: 15  ISYCDRLQPSWFHDVLIRQWQCFKAQRTTTLKEKNNVLPHSNIFALTSLR-------YGS 173
           ISYC      W HDVL R W CF  ++  +  EK N+    N   L  L+       Y  
Sbjct: 115 ISYCHETMTGWVHDVLGRNWACFVGKKVESHIEKVNM----NAAHLGGLQERYSERLYTH 170

Query: 174 QKRIVDKINLENNGWTAKDYPEFHEKTLYEVINMAGGSRSKLERPKPAPITKSILDSVKL 353
               V  IN     WTA  Y E+ + +L ++I  +G S+ ++ RPKPAP+T  I   +  
Sbjct: 171 NHNFVKAINTVQKSWTATAYKEYEKMSLRDLIRRSGHSQ-RIPRPKPAPMTDEIQQQILN 229

Query: 354 IPKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVE 533
           +P+S+DWRNV G+NYVSPVRNQ  CGSCYSFAS GMLEAR RI +NN+  PILSPQ+VV 
Sbjct: 230 LPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 289

Query: 534 CSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCSTTKDCKRYFA 689
           CSPY+QGCDGGFPYLIAGK+A+DFG+ +ESC PY   +  C   ++C RY++
Sbjct: 290 CSPYAQGCDGGFPYLIAGKYAQDFGVVEESCFPYTAKDSPCKpreNCLRYYS 341
>sp|Q60HG6|CATC_MACFA Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 463

 Score =  228 bits (582), Expect = 9e-60
 Identities = 110/232 (47%), Positives = 141/232 (60%), Gaps = 9/232 (3%)
 Frame = +3

Query: 21  YCDRLQPSWFHDVLIRQWQCFKAQRTTTLKE---------KNNVLPHSNIFALTSLRYGS 173
           YC+     W HDVL R W CF  ++  T  E         KN+   +SN        Y  
Sbjct: 117 YCNETMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQEKYSNRL------YKY 170

Query: 174 QKRIVDKINLENNGWTAKDYPEFHEKTLYEVINMAGGSRSKLERPKPAPITKSILDSVKL 353
               V  IN     WTA  Y E+   TL ++I  +GG   K+ RPKP P+T  I   +  
Sbjct: 171 DHNFVKAINAIQKSWTATTYMEYETLTLGDMIKRSGGHSRKIPRPKPTPLTAEIQQKILH 230

Query: 354 IPKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVE 533
           +P S+DWRNV+G+N+VSPVRNQ  CGSCYSFAS GMLEAR RI +NN+  PILS Q+VV 
Sbjct: 231 LPTSWDWRNVHGINFVSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSSQEVVS 290

Query: 534 CSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCSTTKDCKRYFA 689
           CS Y+QGC+GGFPYL AGK+A+DFG+ +E+C PY G +  C   +DC RY++
Sbjct: 291 CSQYAQGCEGGFPYLTAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYS 342
>sp|P80067|CATC_RAT Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 462

 Score =  226 bits (577), Expect = 3e-59
 Identities = 111/232 (47%), Positives = 144/232 (62%), Gaps = 7/232 (3%)
 Frame = +3

Query: 15  ISYCDRLQPSWFHDVLIRQWQCFKAQRTTTLKEKNNVLPHSNIFALTSLR-------YGS 173
           ISYC      W HD L R W CF  ++     EK  V    N+  L  L+       Y  
Sbjct: 115 ISYCHETMTGWVHDYLGRNWACFVGKKMANHSEKVYV----NVAHLGGLQEKYSERLYSH 170

Query: 174 QKRIVDKINLENNGWTAKDYPEFHEKTLYEVINMAGGSRSKLERPKPAPITKSILDSVKL 353
               V  IN     WTA  Y  + + ++ ++I  +G S  ++ RPKPAPIT  I   +  
Sbjct: 171 HHNFVKAINSVQKSWTATTYRRYEKLSIRDLIRRSGHS-GRILRPKPAPITDEIQQQILS 229

Query: 354 IPKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVE 533
           +P+S+DWRNV G+N+VSPVRNQ  CGSCYSFAS GMLEAR RI +NN+  PILSPQ+VV 
Sbjct: 230 LPESWDWRNVRGINFVSPVRNQESCGSCYSFASIGMLEARIRILTNNSQTPILSPQEVVS 289

Query: 534 CSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCSTTKDCKRYFA 689
           CSPY+QGCDGGFPYLIAGK+A+DFG+ +E+C PY   +  C   ++C RY++
Sbjct: 290 CSPYAQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDAPCKPKENCLRYYS 341
>sp|O97578|CATC_CANFA Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain 1;
           Dipeptidyl-peptidase I heavy chain 2;
           Dipeptidyl-peptidase I heavy chain 3;
           Dipeptidyl-peptidase I heavy chain 4;
           Dipeptidyl-peptidase I light chain]
          Length = 435

 Score =  223 bits (568), Expect = 4e-58
 Identities = 111/231 (48%), Positives = 139/231 (60%), Gaps = 7/231 (3%)
 Frame = +3

Query: 18  SYCDRLQPSWFHDVLIRQWQCFKAQRTTTLKEKNNV-------LPHSNIFALTSLRYGSQ 176
           SYC+     W HDVL R W CF   +  T  EK  V       L  +N   L    Y   
Sbjct: 91  SYCNETMTGWVHDVLGRNWACFTGTKMGTTSEKAKVNTKHIERLQENNSNRLYKYNY--- 147

Query: 177 KRIVDKINLENNGWTAKDYPEFHEKTLYEVINMAGGSRSKLERPKPAPITKSILDSVKLI 356
              V  IN     WTA  Y E+   TL +++   GG   K+ RPKP P+T  I + +  +
Sbjct: 148 -EFVKAINTIQKSWTATRYIEYETLTLRDMMTRVGGR--KIPRPKPTPLTAEIHEEISRL 204

Query: 357 PKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVEC 536
           P S+DWRNV G N+VSPVRNQ  CGSCY+FAS  MLEAR RI +NNT  PILSPQ++V C
Sbjct: 205 PTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSC 264

Query: 537 SPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCSTTKDCKRYFA 689
           S Y+QGC+GGFPYLIAGK+A+DFG+ +E+C PY G +  C    DC RY++
Sbjct: 265 SQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYAGSDSPCK-PNDCFRYYS 314
>sp|Q26563|CATC_SCHMA Cathepsin C precursor
          Length = 454

 Score =  170 bits (430), Expect = 4e-42
 Identities = 96/228 (42%), Positives = 129/228 (56%), Gaps = 7/228 (3%)
 Frame = +3

Query: 24  CDRLQPSWFHDVLIRQWQCFKAQRTTTLKEKNNVLPHSNIFALTSLRYGSQKRIVDKINL 203
           C +  P W HD LI        +     K   N L  S  F  T   Y      V KIN 
Sbjct: 107 CHKSMPMWTHDTLIDSGSVCSGKIGVHDKFHINKLFGSKSFGRTL--YHINPSFVGKINA 164

Query: 204 ENNGWTAKDYPEFHEKTLYEVINMAGGSRSKLERP----KPAPITKSILDSVKLIPKSFD 371
               W  + YPE  + T+ E+ N AGG +S + RP    +  P +K ++     +P  FD
Sbjct: 165 HQKSWRGEIYPELSKYTIDELRNRAGGVKSMVTRPSVLNRKTP-SKELISLTGNLPLEFD 223

Query: 372 WRNV--NGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVECSPY 545
           W +      + V+P+RNQG CGSCY+  SA  LEAR R+ SN + +PILSPQ VV+CSPY
Sbjct: 224 WTSPPDGSRSPVTPIRNQGICGSCYASPSAAALEARIRLVSNFSEQPILSPQTVVDCSPY 283

Query: 546 SQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMN-GKCSTTKDCKRYF 686
           S+GC+GGFP+LIAGK+ EDFG+ Q+   PY G + GKC+ +K+C RY+
Sbjct: 284 SEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTGEDTGKCTVSKNCTRYY 331
>sp|P82473|CPGP1_ZINOF Cysteine proteinase GP-I
          Length = 221

 Score = 79.7 bits (195), Expect = 7e-15
 Identities = 40/107 (37%), Positives = 60/107 (56%)
 Frame = +3

Query: 351 LIPKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVV 530
           ++P S DWR       V PV+NQGGCGSC++F +   +E   +I + + +   LS Q +V
Sbjct: 2   VLPDSIDWREKGA---VVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLIS--LSEQQLV 56

Query: 531 ECSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCSTTKD 671
           +CS  + GC+GG+PY        + G+  E   PY G NG C T ++
Sbjct: 57  DCSTRNHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCDTKEN 103
>sp|O10364|CATV_NPVOP Viral cathepsin (V-cath) (Cysteine proteinase) (CP)
          Length = 324

 Score = 79.3 bits (194), Expect = 9e-15
 Identities = 42/117 (35%), Positives = 62/117 (52%), Gaps = 8/117 (6%)
 Frame = +3

Query: 357 PKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVEC 536
           P  FDWR     N V+ V+NQG CG+C++FA+ G LE+++ I+ N  +   LS Q  ++C
Sbjct: 114 PLEFDWRQ---FNKVTSVKNQGVCGACWAFATLGSLESQFAIKYNRLIN--LSEQQFIDC 168

Query: 537 SPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCST--------TKDCKRY 683
              + GCDGG  +       E  G+  ES  PY+  NG+C           + C+RY
Sbjct: 169 DRVNAGCDGGLLHTAFESAMEMGGVQMESDYPYETANGQCRINPNRFVVGVRSCRRY 225
>sp|P41715|CATV_NPVCF Viral cathepsin (V-cath) (Cysteine proteinase) (CP)
 sp|O41479|CATV_NPVCD Viral cathepsin (V-cath) (Cysteine proteinase) (CP)
          Length = 324

 Score = 77.4 bits (189), Expect = 3e-14
 Identities = 43/117 (36%), Positives = 60/117 (51%), Gaps = 8/117 (6%)
 Frame = +3

Query: 357 PKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVEC 536
           P  FDWR    LN V+ V+NQG CG+C++FA+ G LE+++ I+ N  +   LS Q +++C
Sbjct: 114 PLEFDWRR---LNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNQFIN--LSEQQLIDC 168

Query: 537 SPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCST--------TKDCKRY 683
                GCDGG  +          G+  ES  PY+  NG C           K C RY
Sbjct: 169 DFVDAGCDGGLLHTAFEAVMNMGGIQAESDYPYEANNGDCRANAAKFVVKVKKCYRY 225
>sp|Q80LP4|CATV_NPVAH Viral cathepsin (V-cath) (Cysteine proteinase) (CP)
          Length = 337

 Score = 76.6 bits (187), Expect = 6e-14
 Identities = 41/118 (34%), Positives = 63/118 (53%), Gaps = 8/118 (6%)
 Frame = +3

Query: 354 IPKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVE 533
           +P++FDWR VN  N ++ V++QG CGSC++ A+ G LE  Y I+ N  +   LS Q +++
Sbjct: 126 LPQNFDWR-VN--NKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLIN--LSEQQLID 180

Query: 534 CSPYSQGCDGGFPYLIAGKFAEDFGMAQESCNPYKGMNGKCS--------TTKDCKRY 683
           C   +  CDGG  +    +     G+ +E   PY+G  G C         +   CKRY
Sbjct: 181 CDSANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTKGVCKIDNKKFALSVSSCKRY 238
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 85,403,790
Number of Sequences: 369166
Number of extensions: 1786441
Number of successful extensions: 5024
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4609
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4823
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 5927776870
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)