Planarian EST Database


Dr_sW_015_O17

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_015_O17
         (570 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P10605|CATB_MOUSE  Cathepsin B precursor (Cathepsin B1) [...   171   2e-42
sp|P07688|CATB_BOVIN  Cathepsin B precursor [Contains: Cathe...   170   2e-42
sp|P00787|CATB_RAT  Cathepsin B precursor (Cathepsin B1) (RS...   169   3e-42
sp|P07858|CATB_HUMAN  Cathepsin B precursor (Cathepsin B1) (...   168   1e-41
sp|P43233|CATB_CHICK  Cathepsin B precursor (Cathepsin B1) [...   156   3e-38
sp|P43157|CYSP_SCHJA  Cathepsin B-like cysteine proteinase p...   141   1e-33
sp|P25792|CYSP_SCHMA  Cathepsin B-like cysteine proteinase p...   138   1e-32
sp|P25807|CPR1_CAEEL  Gut-specific cysteine proteinase precu...   130   3e-30
sp|P43507|CPR3_CAEEL  Cathepsin B-like cysteine proteinase 3...   121   1e-27
sp|P43510|CPR6_CAEEL  Cathepsin B-like cysteine proteinase 6...   120   2e-27
>sp|P10605|CATB_MOUSE Cathepsin B precursor (Cathepsin B1) [Contains: Cathepsin B light
           chain; Cathepsin B heavy chain]
          Length = 339

 Score =  171 bits (432), Expect = 2e-42
 Identities = 85/164 (51%), Positives = 102/164 (62%), Gaps = 6/164 (3%)
 Frame = +1

Query: 25  KLPKRKPLLNRVRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSN 204
           KLP R      + LP TFDAR QW  C +IG+IRDQ +CGSCWAFGAVEAI+DR CIH+N
Sbjct: 67  KLPGRVAFGEDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTN 126

Query: 205 GTQTPRISAEDLLTCCGFRCGDGCN------EVLFTCMALLVTDGLLLVENTEHI*DVRI 366
           G     +SAEDLLTCCG +CGDGCN         F     LV+ G+       H+  +  
Sbjct: 127 GRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVY----NSHVGCLPY 182

Query: 367 MPS*CSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYG 498
               C HHV G  P CTGE  TP+C K+C+AGYS +Y EDK +G
Sbjct: 183 TIPPCEHHVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFG 226

 Score = 28.9 bits (63), Expect = 9.4
 Identities = 13/26 (50%), Positives = 18/26 (69%)
 Frame = +2

Query: 491 NMGKSSYSVDSNQQAIMQEILTNGPV 568
           + G +SYSV ++ + IM EI  NGPV
Sbjct: 224 HFGYTSYSVSNSVKEIMAEIYKNGPV 249
>sp|P07688|CATB_BOVIN Cathepsin B precursor [Contains: Cathepsin B light chain; Cathepsin
           B heavy chain]
          Length = 335

 Score =  170 bits (431), Expect = 2e-42
 Identities = 87/164 (53%), Positives = 99/164 (60%), Gaps = 6/164 (3%)
 Frame = +1

Query: 25  KLPKRKPLLNRVRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSN 204
           KLP+R      V LP +FDAR QWP C +I EIRDQ +CGSCWAFGAVEAI+DR CIHSN
Sbjct: 67  KLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSN 126

Query: 205 GTQTPRISAEDLLTCCGFRCGDGCNEVL------FTCMALLVTDGLLLVENTEHI*DVRI 366
           G     +SAED+LTCCG  CGDGCN         F     LV+ GL       H+     
Sbjct: 127 GRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLY----NSHVGCRPY 182

Query: 367 MPS*CSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYG 498
               C HHV G  P CTGE  TPKC K C+ GYS +Y EDK +G
Sbjct: 183 SIPPCEHHVNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFG 226

 Score = 32.3 bits (72), Expect = 0.85
 Identities = 15/26 (57%), Positives = 19/26 (73%)
 Frame = +2

Query: 491 NMGKSSYSVDSNQQAIMQEILTNGPV 568
           + G SSYSV +N++ IM EI  NGPV
Sbjct: 224 HFGCSSYSVANNEKEIMAEIYKNGPV 249
>sp|P00787|CATB_RAT Cathepsin B precursor (Cathepsin B1) (RSG-2) [Contains: Cathepsin B
           light chain; Cathepsin B heavy chain]
          Length = 339

 Score =  169 bits (429), Expect = 3e-42
 Identities = 87/172 (50%), Positives = 104/172 (60%), Gaps = 6/172 (3%)
 Frame = +1

Query: 1   VMKDPNNFKLPKRKPLLNRVRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAIT 180
           V+  PN   LP+R      + LP +FDAR QW  C +I +IRDQ +CGSCWAFGAVEA++
Sbjct: 62  VLGGPN---LPERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMS 118

Query: 181 DRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNEVL------FTCMALLVTDGLLLVENT 342
           DR CIH+NG     +SAEDLLTCCG +CGDGCN         F     LV+ G+      
Sbjct: 119 DRICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVY----N 174

Query: 343 EHI*DVRIMPS*CSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYG 498
            HI  +      C HHV G  P CTGE  TPKC K C+AGYS +Y EDK YG
Sbjct: 175 SHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYG 226

 Score = 29.6 bits (65), Expect = 5.5
 Identities = 13/24 (54%), Positives = 17/24 (70%)
 Frame = +2

Query: 497 GKSSYSVDSNQQAIMQEILTNGPV 568
           G +SYSV  +++ IM EI  NGPV
Sbjct: 226 GYTSYSVSDSEKEIMAEIYKNGPV 249
>sp|P07858|CATB_HUMAN Cathepsin B precursor (Cathepsin B1) (APP secretase) (APPS)
           [Contains: Cathepsin B light chain; Cathepsin B heavy
           chain]
          Length = 339

 Score =  168 bits (425), Expect = 1e-41
 Identities = 87/164 (53%), Positives = 102/164 (62%), Gaps = 6/164 (3%)
 Frame = +1

Query: 25  KLPKRKPLLNRVRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSN 204
           K P+R      ++LP +FDAR QWP+C +I EIRDQ +CGSCWAFGAVEAI+DR CIH+N
Sbjct: 67  KPPQRVMFTEDLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTN 126

Query: 205 GTQTPRISAEDLLTCCGFRCGDGCNEVL------FTCMALLVTDGLLLVENTEHI*DVRI 366
              +  +SAEDLLTCCG  CGDGCN         F     LV+ G  L E+        I
Sbjct: 127 AHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGG--LYESHVGCRPYSI 184

Query: 367 MPS*CSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYG 498
            P  C HHV G  P CTGE  TPKC K C+ GYS TY +DK YG
Sbjct: 185 PP--CEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYG 226

 Score = 29.6 bits (65), Expect = 5.5
 Identities = 13/24 (54%), Positives = 18/24 (75%)
 Frame = +2

Query: 497 GKSSYSVDSNQQAIMQEILTNGPV 568
           G +SYSV ++++ IM EI  NGPV
Sbjct: 226 GYNSYSVSNSEKDIMAEIYKNGPV 249
>sp|P43233|CATB_CHICK Cathepsin B precursor (Cathepsin B1) [Contains: Cathepsin B light
           chain; Cathepsin B heavy chain]
          Length = 340

 Score =  156 bits (395), Expect = 3e-38
 Identities = 80/165 (48%), Positives = 97/165 (58%), Gaps = 7/165 (4%)
 Frame = +1

Query: 25  KLPKRKPLLNRVRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSN 204
           K P+R      + LP TFD R QWP C +I EIRDQ +CGSCWAFGAVEAI+DR C+H+N
Sbjct: 67  KAPERVDFAEDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTN 126

Query: 205 GTQTPRISAEDLLTCCGFRCGDGCNEVL------FTCMALLVTDGLLLVENTEHI*DVRI 366
              +  +SAEDLL+CCGF CG GCN         +     LV+ GL       H+     
Sbjct: 127 AKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLY----DSHVGCRAY 182

Query: 367 MPS*CSHHVIGPYPNCTGEF-PTPKCKKACQAGYSKTYAEDKQYG 498
               C HHV G  P CTGE   TP+C + C+ GYS +Y EDK YG
Sbjct: 183 TIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYG 227
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase precursor (Antigen Sj31)
          Length = 342

 Score =  141 bits (355), Expect = 1e-33
 Identities = 73/152 (48%), Positives = 92/152 (60%), Gaps = 3/152 (1%)
 Frame = +1

Query: 58  VRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAED 237
           V +P+ FD+R +WP CKSI +IRDQS CGSCWAFGAVEA+TDR CI S G Q+  +SA D
Sbjct: 88  VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALD 147

Query: 238 LLTCCGFRCGDGCNEVL-FTCMALLVTDGLLLVENTEHI*DVRIMP-S*CSHHVIGPYPN 411
           L++CC   CGDGC            V  G++   + E+    +  P   C HH  G YP 
Sbjct: 148 LISCCK-DCGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPA 206

Query: 412 C-TGEFPTPKCKKACQAGYSKTYAEDKQYGKE 504
           C T  + TP+CK+ CQ GY   Y +DK YG E
Sbjct: 207 CGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDE 238
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase precursor (Antigen Sm31)
          Length = 340

 Score =  138 bits (347), Expect = 1e-32
 Identities = 73/163 (44%), Positives = 95/163 (58%), Gaps = 7/163 (4%)
 Frame = +1

Query: 34  KRKPLLNR----VRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHS 201
           KR+P ++     V +P+ FD+R +WP CKSI  IRDQS CGSCW+FGAVEA++DR CI S
Sbjct: 75  KRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQS 134

Query: 202 NGTQTPRISAEDLLTCCGFRCGDGC-NEVLFTCMALLVTDGLLLVENTEHI*DVRIMP-S 375
            G Q   +SA DLLTCC   CG GC   +L       V +G++   + E+       P  
Sbjct: 135 GGKQNVELSAVDLLTCCE-SCGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFP 193

Query: 376 *CSHHVIGPYPNCTGE-FPTPKCKKACQAGYSKTYAEDKQYGK 501
            C HH  G YP C  + + TP+CK+ CQ  Y   Y +DK  GK
Sbjct: 194 KCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGK 236

 Score = 31.2 bits (69), Expect = 1.9
 Identities = 13/24 (54%), Positives = 20/24 (83%)
 Frame = +2

Query: 497 GKSSYSVDSNQQAIMQEILTNGPV 568
           GKSSY+V ++++AI +EI+  GPV
Sbjct: 235 GKSSYNVKNDEKAIQKEIMKYGPV 258
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase precursor
          Length = 329

 Score =  130 bits (326), Expect = 3e-30
 Identities = 69/147 (46%), Positives = 88/147 (59%), Gaps = 2/147 (1%)
 Frame = +1

Query: 64  LPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAEDLL 243
           +P TFD+R QW +CKSI  IRDQ+ CGSCWAFGA E I+DR CI + G Q P IS +DLL
Sbjct: 85  VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144

Query: 244 TCCGFRCGDGCNEVLFTCMALLVTDGLLLVENTEHI*DVRIMPS*CSHHVIGPYP--NCT 417
           +CCG  CG+GC E  +   AL   D   +V   ++        + C  + I P    NC 
Sbjct: 145 SCCGSSCGNGC-EGGYPIQALRWWDSKGVVTGGDY------HGAGCKPYPIAPCTSGNCP 197

Query: 418 GEFPTPKCKKACQAGYSKTYAEDKQYG 498
            E  TP C  +CQ+GYS  YA+DK +G
Sbjct: 198 -ESKTPSCSMSCQSGYSTAYAKDKHFG 223
>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 precursor (Cysteine
           protease-related 3)
          Length = 370

 Score =  121 bits (304), Expect = 1e-27
 Identities = 67/152 (44%), Positives = 82/152 (53%), Gaps = 7/152 (4%)
 Frame = +1

Query: 64  LPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAEDLL 243
           LP TFDAR +WP C +I  IR+Q+ CGSCWAFGA E I+DR CI SNGTQ P IS ED+L
Sbjct: 92  LPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVEDIL 151

Query: 244 TCCGFRCGDGCN------EVLFTCMALLVTDGLLLVENTEHI*DVRIMPS*CSHHVIGPY 405
           +CCG  CG GC        + F   +  VT G    +   H          C  +   P 
Sbjct: 152 SCCGTTCGYGCKGGYSIEALRFWASSGAVTGG----DYGGHG---------CMPYSFAPC 198

Query: 406 PNCTGEFPTPKCKKACQAGY-SKTYAEDKQYG 498
                E  TP CK  CQ+ Y ++ Y +DK YG
Sbjct: 199 TKNCPESTTPSCKTTCQSSYKTEEYKKDKHYG 230
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 precursor (Cysteine
           protease-related 6)
          Length = 379

 Score =  120 bits (302), Expect = 2e-27
 Identities = 70/152 (46%), Positives = 90/152 (59%), Gaps = 7/152 (4%)
 Frame = +1

Query: 64  LPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAEDLL 243
           +P +FD+R  WPKC SI  IRDQS+CGSCWAFGAVEA++DR CI S+G     +SA+DLL
Sbjct: 105 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 164

Query: 244 TCCGFRCGDGCN-EVLFTCMALLVTDGLLLVENTEHI*DVRIMP-S*CSHHV----IGPY 405
           +CC   CG GCN           V DG++   N       +  P   C HH       P 
Sbjct: 165 SCCK-SCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPC 223

Query: 406 PNCTGEFPTPKCKKACQAGYS-KTYAEDKQYG 498
           P+    +PTPKC+K C + Y+ KTY+EDK +G
Sbjct: 224 PH--DLYPTPKCEKKCVSDYTDKTYSEDKFFG 253
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 69,708,084
Number of Sequences: 369166
Number of extensions: 1454528
Number of successful extensions: 4112
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3935
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4075
length of database: 68,354,980
effective HSP length: 104
effective length of database: 49,142,540
effective search space used: 4177115900
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)