Planarian EST Database


Dr_sW_022_N01

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_022_N01
         (825 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P00787|CATB_RAT  Cathepsin B precursor (Cathepsin B1) (RS...   224   2e-58
sp|P10605|CATB_MOUSE  Cathepsin B precursor (Cathepsin B1) [...   224   2e-58
sp|P07688|CATB_BOVIN  Cathepsin B precursor [Contains: Cathe...   222   1e-57
sp|P07858|CATB_HUMAN  Cathepsin B precursor (Cathepsin B1) (...   217   4e-56
sp|P43233|CATB_CHICK  Cathepsin B precursor (Cathepsin B1) [...   207   3e-53
sp|P43157|CYSP_SCHJA  Cathepsin B-like cysteine proteinase p...   202   9e-52
sp|P25792|CYSP_SCHMA  Cathepsin B-like cysteine proteinase p...   183   6e-46
sp|P43510|CPR6_CAEEL  Cathepsin B-like cysteine proteinase 6...   181   2e-45
sp|P43509|CPR5_CAEEL  Cathepsin B-like cysteine proteinase 5...   169   1e-41
sp|P25807|CPR1_CAEEL  Gut-specific cysteine proteinase precu...   162   1e-39
>sp|P00787|CATB_RAT Cathepsin B precursor (Cathepsin B1) (RSG-2) [Contains: Cathepsin B
           light chain; Cathepsin B heavy chain]
          Length = 339

 Score =  224 bits (572), Expect = 2e-58
 Identities = 100/207 (48%), Positives = 133/207 (64%)
 Frame = +1

Query: 205 SFDLINYINYVANTTWQAGPTNRFKSNSEFRNVLGLRKTPKSLRLPQKYGYSNKIVIPDF 384
           S D+INYIN   NTTWQAG        S  + + G         LP++ G+S  I +P+ 
Sbjct: 27  SDDMINYINK-QNTTWQAGRNFYNVDISYLKKLCGT--VLGGPNLPERVGFSEDINLPES 83

Query: 385 FDSRTQWSHCKYIDEVRDQSNCGSCWAVAATATITDRYCIHSNGNIQPRISDKDLLSCCG 564
           FD+R QWS+C  I ++RDQ +CGSCWA  A   ++DR CIH+NG +   +S +DLL+CCG
Sbjct: 84  FDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCG 143

Query: 565 SMCGEGCNGGSDHAAWKYWVNFGIVTGGPYNSHQGCQDYPFPPCSHHVIGPYPNCSGDSP 744
             CG+GCNGG    AW +W   G+V+GG YNSH GC  Y  PPC HHV G  P C+G+  
Sbjct: 144 IQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGD 203

Query: 745 TPQCVEKCQSGYSKSYKDDKYFGQNSY 825
           TP+C + C++GYS SYK+DK++G  SY
Sbjct: 204 TPKCNKMCEAGYSTSYKEDKHYGYTSY 230
>sp|P10605|CATB_MOUSE Cathepsin B precursor (Cathepsin B1) [Contains: Cathepsin B light
           chain; Cathepsin B heavy chain]
          Length = 339

 Score =  224 bits (571), Expect = 2e-58
 Identities = 103/208 (49%), Positives = 133/208 (63%), Gaps = 1/208 (0%)
 Frame = +1

Query: 205 SFDLINYINYVANTTWQAGPTNRFKSNSEFRNVLG-LRKTPKSLRLPQKYGYSNKIVIPD 381
           S DLINYIN   NTTWQAG        S  + + G +   PK   LP +  +   I +P+
Sbjct: 27  SDDLINYINK-QNTTWQAGRNFYNVDISYLKKLCGTVLGGPK---LPGRVAFGEDIDLPE 82

Query: 382 FFDSRTQWSHCKYIDEVRDQSNCGSCWAVAATATITDRYCIHSNGNIQPRISDKDLLSCC 561
            FD+R QWS+C  I ++RDQ +CGSCWA  A   I+DR CIH+NG +   +S +DLL+CC
Sbjct: 83  TFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCC 142

Query: 562 GSMCGEGCNGGSDHAAWKYWVNFGIVTGGPYNSHQGCQDYPFPPCSHHVIGPYPNCSGDS 741
           G  CG+GCNGG    AW +W   G+V+GG YNSH GC  Y  PPC HHV G  P C+G+ 
Sbjct: 143 GIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEG 202

Query: 742 PTPQCVEKCQSGYSKSYKDDKYFGQNSY 825
            TP+C + C++GYS SYK+DK+FG  SY
Sbjct: 203 DTPRCNKSCEAGYSPSYKEDKHFGYTSY 230
>sp|P07688|CATB_BOVIN Cathepsin B precursor [Contains: Cathepsin B light chain; Cathepsin
           B heavy chain]
          Length = 335

 Score =  222 bits (565), Expect = 1e-57
 Identities = 99/208 (47%), Positives = 136/208 (65%), Gaps = 1/208 (0%)
 Frame = +1

Query: 205 SFDLINYINYVANTTWQAGPTNRFKSNSEFRNVLG-LRKTPKSLRLPQKYGYSNKIVIPD 381
           S +L+N++N   NTTW+AG        S  + + G +   PK   LPQ+  ++  +V+P+
Sbjct: 27  SDELVNFVNK-QNTTWKAGHNFYNVDLSYVKKLCGAILGGPK---LPQRDAFAADVVLPE 82

Query: 382 FFDSRTQWSHCKYIDEVRDQSNCGSCWAVAATATITDRYCIHSNGNIQPRISDKDLLSCC 561
            FD+R QW +C  I E+RDQ +CGSCWA  A   I+DR CIHSNG +   +S +D+L+CC
Sbjct: 83  SFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCC 142

Query: 562 GSMCGEGCNGGSDHAAWKYWVNFGIVTGGPYNSHQGCQDYPFPPCSHHVIGPYPNCSGDS 741
           G  CG+GCNGG    AW +W   G+V+GG YNSH GC+ Y  PPC HHV G  P C+G+ 
Sbjct: 143 GGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEG 202

Query: 742 PTPQCVEKCQSGYSKSYKDDKYFGQNSY 825
            TP+C + C+ GYS SYK+DK+FG +SY
Sbjct: 203 DTPKCNKTCEPGYSPSYKEDKHFGCSSY 230
>sp|P07858|CATB_HUMAN Cathepsin B precursor (Cathepsin B1) (APP secretase) (APPS)
           [Contains: Cathepsin B light chain; Cathepsin B heavy
           chain]
          Length = 339

 Score =  217 bits (552), Expect = 4e-56
 Identities = 98/208 (47%), Positives = 131/208 (62%), Gaps = 1/208 (0%)
 Frame = +1

Query: 205 SFDLINYINYVANTTWQAGPTNRFKSNSEFRNVLG-LRKTPKSLRLPQKYGYSNKIVIPD 381
           S +L+NY+N   NTTWQAG        S  + + G     PK    PQ+  ++  + +P 
Sbjct: 27  SDELVNYVNK-RNTTWQAGHNFYNVDMSYLKRLCGTFLGGPKP---PQRVMFTEDLKLPA 82

Query: 382 FFDSRTQWSHCKYIDEVRDQSNCGSCWAVAATATITDRYCIHSNGNIQPRISDKDLLSCC 561
            FD+R QW  C  I E+RDQ +CGSCWA  A   I+DR CIH+N ++   +S +DLL+CC
Sbjct: 83  SFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCC 142

Query: 562 GSMCGEGCNGGSDHAAWKYWVNFGIVTGGPYNSHQGCQDYPFPPCSHHVIGPYPNCSGDS 741
           GSMCG+GCNGG    AW +W   G+V+GG Y SH GC+ Y  PPC HHV G  P C+G+ 
Sbjct: 143 GSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEG 202

Query: 742 PTPQCVEKCQSGYSKSYKDDKYFGQNSY 825
            TP+C + C+ GYS +YK DK++G NSY
Sbjct: 203 DTPKCSKICEPGYSPTYKQDKHYGYNSY 230
>sp|P43233|CATB_CHICK Cathepsin B precursor (Cathepsin B1) [Contains: Cathepsin B light
           chain; Cathepsin B heavy chain]
          Length = 340

 Score =  207 bits (527), Expect = 3e-53
 Identities = 96/209 (45%), Positives = 131/209 (62%), Gaps = 2/209 (0%)
 Frame = +1

Query: 205 SFDLINYINYVANTTWQAGPTNRFKSNSEFRNVLG-LRKTPKSLRLPQKYGYSNKIVIPD 381
           S DL+N+IN + NTT +AG        S  + + G     PK+   P++  ++  + +PD
Sbjct: 27  SSDLVNHINKL-NTTGRAGHNFHNTDMSYVKKLCGTFLGGPKA---PERVDFAEDMDLPD 82

Query: 382 FFDSRTQWSHCKYIDEVRDQSNCGSCWAVAATATITDRYCIHSNGNIQPRISDKDLLSCC 561
            FD+R QW +C  I E+RDQ +CGSCWA  A   I+DR C+H+N  +   +S +DLLSCC
Sbjct: 83  TFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCC 142

Query: 562 GSMCGEGCNGGSDHAAWKYWVNFGIVTGGPYNSHQGCQDYPFPPCSHHVIGPYPNCSGD- 738
           G  CG GCNGG    AW+YW   G+V+GG Y+SH GC+ Y  PPC HHV G  P C+G+ 
Sbjct: 143 GFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHVNGSRPPCTGEG 202

Query: 739 SPTPQCVEKCQSGYSKSYKDDKYFGQNSY 825
             TP+C   C+ GYS SYK+DK++G  SY
Sbjct: 203 GETPRCSRHCEPGYSPSYKEDKHYGITSY 231
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase precursor (Antigen Sj31)
          Length = 342

 Score =  202 bits (514), Expect = 9e-52
 Identities = 96/211 (45%), Positives = 122/211 (57%), Gaps = 4/211 (1%)
 Frame = +1

Query: 205 SFDLINYINYVANTTWQAGPTNRFKSNSEFRNVLGLRKTPKSL---RLPQKYGYSNKIVI 375
           S ++I++IN   +  W+A  ++RF S  + R ++G RK    +   R P    +   + I
Sbjct: 31  SDEMISFINEHPDAGWKADKSDRFHSLDDARILMGARKEDAEMKRNRRPTVDHHDLNVEI 90

Query: 376 PDFFDSRTQWSHCKYIDEVRDQSNCGSCWAVAATATITDRYCIHSNGNIQPRISDKDLLS 555
           P  FDSR +W HCK I ++RDQS CGSCWA  A   +TDR CI S G     +S  DL+S
Sbjct: 91  PSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLIS 150

Query: 556 CCGSMCGEGCNGGSDHAAWKYWVNFGIVTGGPYNSHQGCQDYPFPPCSHHVIGPYPNCSG 735
           CC   CG+GC GG    AW YWV  GIVTGG   +H GCQ YPFP C HH  G YP C  
Sbjct: 151 CCKD-CGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGT 209

Query: 736 D-SPTPQCVEKCQSGYSKSYKDDKYFGQNSY 825
               TPQC + CQ GY   Y+ DK++G  SY
Sbjct: 210 KIYKTPQCKQTCQKGYKTPYEQDKHYGDESY 240
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase precursor (Antigen Sm31)
          Length = 340

 Score =  183 bits (464), Expect = 6e-46
 Identities = 101/242 (41%), Positives = 130/242 (53%), Gaps = 4/242 (1%)
 Frame = +1

Query: 112 LILILSIFPNIKNESIKWKRLTKPDIIQALSSFDLINYINYVANTTWQAGPTNRFKSNSE 291
           LI  L    ++KNE  K++ L          S D+I+YIN   N  W+A  +NRF S  +
Sbjct: 11  LITFLEAHISVKNE--KFEPL----------SDDIISYINEHPNAGWRAEKSNRFHSLDD 58

Query: 292 FRNVLGLRKTPKSLRLPQKYGYSNK---IVIPDFFDSRTQWSHCKYIDEVRDQSNCGSCW 462
            R  +G R+    LR  ++    +    + IP  FDSR +W  CK I  +RDQS CGSCW
Sbjct: 59  ARIQMGARREEPDLRRKRRPTVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCW 118

Query: 463 AVAATATITDRYCIHSNGNIQPRISDKDLLSCCGSMCGEGCNGGSDHAAWKYWVNFGIVT 642
           +  A   ++DR CI S G     +S  DLL+CC S CG GC GG    AW YWV  GIVT
Sbjct: 119 SFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCES-CGLGCEGGILGPAWDYWVKEGIVT 177

Query: 643 GGPYNSHQGCQDYPFPPCSHHVIGPYPNC-SGDSPTPQCVEKCQSGYSKSYKDDKYFGQN 819
                +H GC+ YPFP C HH  G YP C S    TP+C + CQ  Y   Y  DK+ G++
Sbjct: 178 ASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKS 237

Query: 820 SY 825
           SY
Sbjct: 238 SY 239
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 precursor (Cysteine
           protease-related 6)
          Length = 379

 Score =  181 bits (460), Expect = 2e-45
 Identities = 102/258 (39%), Positives = 142/258 (55%), Gaps = 10/258 (3%)
 Frame = +1

Query: 82  KLIRMWRCFVLILILSIFPNIKNESIKWKRLTKPDIIQALSSFDLINYINYVANTTWQAG 261
           K +    C V+    +   N+++   K++          L   DLI+Y+N   N  W A 
Sbjct: 2   KTLLFLSCIVVAAYCACNDNLESVLDKYRNREIDSEAAELDGDDLIDYVNENQNL-WTAK 60

Query: 262 PTNRFKS----NSEFR-NVLGLRKTPKSLRLPQKYGYSNKIV--IPDFFDSRTQWSHCKY 420
              RF S    N + +  ++G+     S++  Q    +  +   IP+ FDSR  W  C  
Sbjct: 61  KQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDLDIPESFDSRDNWPKCDS 120

Query: 421 IDEVRDQSNCGSCWAVAATATITDRYCIHSNGNIQPRISDKDLLSCCGSMCGEGCNGGSD 600
           I  +RDQS+CGSCWA  A   ++DR CI S+G +Q  +S  DLLSCC S CG GCNGG  
Sbjct: 121 IKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCKS-CGFGCNGGDP 179

Query: 601 HAAWKYWVNFGIVTGGPYNSHQGCQDYPFPPCSHHVIGP-YPNCSGD-SPTPQCVEKCQS 774
            AAW+YWV  GIVTG  Y ++ GC+ YPFPPC HH     +  C  D  PTP+C +KC S
Sbjct: 180 LAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVS 239

Query: 775 GYS-KSYKDDKYFGQNSY 825
            Y+ K+Y +DK+FG ++Y
Sbjct: 240 DYTDKTYSEDKFFGASAY 257
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 precursor (Cysteine
           protease-related 5)
          Length = 344

 Score =  169 bits (427), Expect = 1e-41
 Identities = 96/226 (42%), Positives = 122/226 (53%), Gaps = 16/226 (7%)
 Frame = +1

Query: 196 ALSSFDLINYINYVANTTWQAGPTNRFKSNSEFRNVLGLRKTPKSLRLPQKYGYSNKI-- 369
           AL+   LI+Y+N  A   W AG             V+   K  K L +  KY   +K   
Sbjct: 26  ALTGQALIDYVNS-AQKLWTAG-----------HQVIPKEKITKKL-MDVKYLVPHKDED 72

Query: 370 --------VIPDFFDSRTQWSHCKYIDEVRDQSNCGSCWAVAATATITDRYCIHSNGNIQ 525
                    IPD FD+R QW +C  I+ +RDQS+CGSCWA AA   I+DR CI SNG + 
Sbjct: 73  IVATEVSDAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVN 132

Query: 526 PRISDKDLLSCCGSM--CGEGCNGGSDHAAWKYWVNFGIVTGGPYNSHQGCQDYPFPPCS 699
             +S +DLLSCC  M  CG GC GG    AWK+WV  G+VTGG Y +  GC+ Y   PC 
Sbjct: 133 TLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCG 192

Query: 700 HHVIG-PYPNCSGDS-PTPQCVEKCQS--GYSKSYKDDKYFGQNSY 825
             V G  +P C  D+ PTP+CV+ C S   Y+  Y  DK+FG  +Y
Sbjct: 193 ETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAY 238
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase precursor
          Length = 329

 Score =  162 bits (409), Expect = 1e-39
 Identities = 78/151 (51%), Positives = 95/151 (62%)
 Frame = +1

Query: 373 IPDFFDSRTQWSHCKYIDEVRDQSNCGSCWAVAATATITDRYCIHSNGNIQPRISDKDLL 552
           +P  FDSRTQWS CK I  +RDQ+ CGSCWA  A   I+DR CI + G  QP IS  DLL
Sbjct: 85  VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144

Query: 553 SCCGSMCGEGCNGGSDHAAWKYWVNFGIVTGGPYNSHQGCQDYPFPPCSHHVIGPYPNCS 732
           SCCGS CG GC GG    A ++W + G+VTGG Y+   GC+ YP  PC+        NC 
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHG-AGCKPYPIAPCTS------GNCP 197

Query: 733 GDSPTPQCVEKCQSGYSKSYKDDKYFGQNSY 825
            +S TP C   CQSGYS +Y  DK+FG ++Y
Sbjct: 198 -ESKTPSCSMSCQSGYSTAYAKDKHFGVSAY 227
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 93,653,422
Number of Sequences: 369166
Number of extensions: 2067949
Number of successful extensions: 5354
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4970
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5215
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 7956112725
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)