Planarian EST Database


Dr_sW_012_K19

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_012_K19
         (641 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P07688|CATB_BOVIN  Cathepsin B precursor [Contains: Cathe...   337   2e-92
sp|P07858|CATB_HUMAN  Cathepsin B precursor (Cathepsin B1) (...   329   4e-90
sp|P00787|CATB_RAT  Cathepsin B precursor (Cathepsin B1) (RS...   326   3e-89
sp|P10605|CATB_MOUSE  Cathepsin B precursor (Cathepsin B1) [...   323   3e-88
sp|P43233|CATB_CHICK  Cathepsin B precursor (Cathepsin B1) [...   310   2e-84
sp|P43157|CYSP_SCHJA  Cathepsin B-like cysteine proteinase p...   293   3e-79
sp|P25792|CYSP_SCHMA  Cathepsin B-like cysteine proteinase p...   284   2e-76
sp|P43510|CPR6_CAEEL  Cathepsin B-like cysteine proteinase 6...   261   1e-69
sp|P43509|CPR5_CAEEL  Cathepsin B-like cysteine proteinase 5...   258   1e-68
sp|P25807|CPR1_CAEEL  Gut-specific cysteine proteinase precu...   257   2e-68
>sp|P07688|CATB_BOVIN Cathepsin B precursor [Contains: Cathepsin B light chain; Cathepsin
           B heavy chain]
          Length = 335

 Score =  337 bits (863), Expect = 2e-92
 Identities = 147/213 (69%), Positives = 168/213 (78%)
 Frame = +1

Query: 1   SCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDG 180
           SCWAFGAVEAI+DR CIHSNG     +SAED+LTCCG  CGDGCNGGFPSGAW++W   G
Sbjct: 107 SCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKG 166

Query: 181 LVTGGEYGAHLGCQDYAFPKCSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYG 360
           LV+GG Y +H+GC+ Y+ P C HHV G  P CTGE  TPKC K C+ GYS +Y EDK +G
Sbjct: 167 LVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFG 226

Query: 361 KSSYSVDSNQQAIMQEILTNGPVEAAFSVYADFPSYKSGVYQHVSGGMLGGHAIKILGWG 540
            SSYSV +N++ IM EI  NGPVE AFSVY+DF  YKSGVYQHVSG ++GGHAI+ILGWG
Sbjct: 227 CSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWG 286

Query: 541 VENNTPYWLVANSWNPTWGDNGYFKILRGSDEC 639
           VEN TPYWLV NSWN  WGDNG+FKILRG D C
Sbjct: 287 VENGTPYWLVGNSWNTDWGDNGFFKILRGQDHC 319
>sp|P07858|CATB_HUMAN Cathepsin B precursor (Cathepsin B1) (APP secretase) (APPS)
           [Contains: Cathepsin B light chain; Cathepsin B heavy
           chain]
          Length = 339

 Score =  329 bits (843), Expect = 4e-90
 Identities = 143/213 (67%), Positives = 168/213 (78%)
 Frame = +1

Query: 1   SCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDG 180
           SCWAFGAVEAI+DR CIH+N   +  +SAEDLLTCCG  CGDGCNGG+P+ AW++W   G
Sbjct: 107 SCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKG 166

Query: 181 LVTGGEYGAHLGCQDYAFPKCSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYG 360
           LV+GG Y +H+GC+ Y+ P C HHV G  P CTGE  TPKC K C+ GYS TY +DK YG
Sbjct: 167 LVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYG 226

Query: 361 KSSYSVDSNQQAIMQEILTNGPVEAAFSVYADFPSYKSGVYQHVSGGMLGGHAIKILGWG 540
            +SYSV ++++ IM EI  NGPVE AFSVY+DF  YKSGVYQHV+G M+GGHAI+ILGWG
Sbjct: 227 YNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWG 286

Query: 541 VENNTPYWLVANSWNPTWGDNGYFKILRGSDEC 639
           VEN TPYWLVANSWN  WGDNG+FKILRG D C
Sbjct: 287 VENGTPYWLVANSWNTDWGDNGFFKILRGQDHC 319
>sp|P00787|CATB_RAT Cathepsin B precursor (Cathepsin B1) (RSG-2) [Contains: Cathepsin B
           light chain; Cathepsin B heavy chain]
          Length = 339

 Score =  326 bits (836), Expect = 3e-89
 Identities = 138/213 (64%), Positives = 167/213 (78%)
 Frame = +1

Query: 1   SCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDG 180
           SCWAFGAVEA++DR CIH+NG     +SAEDLLTCCG +CGDGCNGG+PSGAW++W   G
Sbjct: 107 SCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKG 166

Query: 181 LVTGGEYGAHLGCQDYAFPKCSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYG 360
           LV+GG Y +H+GC  Y  P C HHV G  P CTGE  TPKC K C+AGYS +Y EDK YG
Sbjct: 167 LVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYG 226

Query: 361 KSSYSVDSNQQAIMQEILTNGPVEAAFSVYADFPSYKSGVYQHVSGGMLGGHAIKILGWG 540
            +SYSV  +++ IM EI  NGPVE AF+V++DF +YKSGVY+H +G ++GGHAI+ILGWG
Sbjct: 227 YTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWG 286

Query: 541 VENNTPYWLVANSWNPTWGDNGYFKILRGSDEC 639
           +EN  PYWLVANSWN  WGDNG+FKILRG + C
Sbjct: 287 IENGVPYWLVANSWNVDWGDNGFFKILRGENHC 319
>sp|P10605|CATB_MOUSE Cathepsin B precursor (Cathepsin B1) [Contains: Cathepsin B light
           chain; Cathepsin B heavy chain]
          Length = 339

 Score =  323 bits (827), Expect = 3e-88
 Identities = 138/213 (64%), Positives = 166/213 (77%)
 Frame = +1

Query: 1   SCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDG 180
           SCWAFGAVEAI+DR CIH+NG     +SAEDLLTCCG +CGDGCNGG+PSGAW +W   G
Sbjct: 107 SCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKG 166

Query: 181 LVTGGEYGAHLGCQDYAFPKCSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYG 360
           LV+GG Y +H+GC  Y  P C HHV G  P CTGE  TP+C K+C+AGYS +Y EDK +G
Sbjct: 167 LVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFG 226

Query: 361 KSSYSVDSNQQAIMQEILTNGPVEAAFSVYADFPSYKSGVYQHVSGGMLGGHAIKILGWG 540
            +SYSV ++ + IM EI  NGPVE AF+V++DF +YKSGVY+H +G M+GGHAI+ILGWG
Sbjct: 227 YTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWG 286

Query: 541 VENNTPYWLVANSWNPTWGDNGYFKILRGSDEC 639
           VEN  PYWL ANSWN  WGDNG+FKILRG + C
Sbjct: 287 VENGVPYWLAANSWNLDWGDNGFFKILRGENHC 319
>sp|P43233|CATB_CHICK Cathepsin B precursor (Cathepsin B1) [Contains: Cathepsin B light
           chain; Cathepsin B heavy chain]
          Length = 340

 Score =  310 bits (795), Expect = 2e-84
 Identities = 137/214 (64%), Positives = 159/214 (74%), Gaps = 1/214 (0%)
 Frame = +1

Query: 1   SCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDG 180
           SCWAFGAVEAI+DR C+H+N   +  +SAEDLL+CCGF CG GCNGG+PSGAW YW   G
Sbjct: 107 SCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERG 166

Query: 181 LVTGGEYGAHLGCQDYAFPKCSHHVIGPYPNCTGEF-PTPKCKKACQAGYSKTYAEDKQY 357
           LV+GG Y +H+GC+ Y  P C HHV G  P CTGE   TP+C + C+ GYS +Y EDK Y
Sbjct: 167 LVSGGLYDSHVGCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHY 226

Query: 358 GKSSYSVDSNQQAIMQEILTNGPVEAAFSVYADFPSYKSGVYQHVSGGMLGGHAIKILGW 537
           G +SY V  +++ IM EI  NGPVE AF VY DF  YKSGVYQHVSG  +GGHAI+ILGW
Sbjct: 227 GITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGW 286

Query: 538 GVENNTPYWLVANSWNPTWGDNGYFKILRGSDEC 639
           GVEN TPYWL ANSWN  WG  G+FKILRG D C
Sbjct: 287 GVENGTPYWLAANSWNTDWGITGFFKILRGEDHC 320
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase precursor (Antigen Sj31)
          Length = 342

 Score =  293 bits (750), Expect = 3e-79
 Identities = 127/214 (59%), Positives = 157/214 (73%), Gaps = 1/214 (0%)
 Frame = +1

Query: 1   SCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDG 180
           SCWAFGAVEA+TDR CI S G Q+  +SA DL++CC   CGDGC GGFP  AW YWV  G
Sbjct: 117 SCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCK-DCGDGCQGGFPGVAWDYWVKRG 175

Query: 181 LVTGGEYGAHLGCQDYAFPKCSHHVIGPYPNC-TGEFPTPKCKKACQAGYSKTYAEDKQY 357
           +VTGG    H GCQ Y FPKC HH  G YP C T  + TP+CK+ CQ GY   Y +DK Y
Sbjct: 176 IVTGGSKENHTGCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHY 235

Query: 358 GKSSYSVDSNQQAIMQEILTNGPVEAAFSVYADFPSYKSGVYQHVSGGMLGGHAIKILGW 537
           G  SY+V +N++ I ++I+  GPVEAAF VY DF +YKSG+Y+HV+G ++GGHAI+I+GW
Sbjct: 236 GDESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGW 295

Query: 538 GVENNTPYWLVANSWNPTWGDNGYFKILRGSDEC 639
           GVE  TPYWL+ANSWN  WG+ G F+++RG DEC
Sbjct: 296 GVEKRTPYWLIANSWNEDWGEKGLFRMVRGRDEC 329
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase precursor (Antigen Sm31)
          Length = 340

 Score =  284 bits (726), Expect = 2e-76
 Identities = 124/214 (57%), Positives = 156/214 (72%), Gaps = 1/214 (0%)
 Frame = +1

Query: 1   SCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDG 180
           SCW+FGAVEA++DR CI S G Q   +SA DLLTCC   CG GC GG    AW YWV +G
Sbjct: 116 SCWSFGAVEAMSDRSCIQSGGKQNVELSAVDLLTCCE-SCGLGCEGGILGPAWDYWVKEG 174

Query: 181 LVTGGEYGAHLGCQDYAFPKCSHHVIGPYPNCTGE-FPTPKCKKACQAGYSKTYAEDKQY 357
           +VT      H GC+ Y FPKC HH  G YP C  + + TP+CK+ CQ  Y   Y +DK  
Sbjct: 175 IVTASSKENHTGCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHR 234

Query: 358 GKSSYSVDSNQQAIMQEILTNGPVEAAFSVYADFPSYKSGVYQHVSGGMLGGHAIKILGW 537
           GKSSY+V ++++AI +EI+  GPVEA+F+VY DF +YKSG+Y+H++G  LGGHAI+I+GW
Sbjct: 235 GKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGW 294

Query: 538 GVENNTPYWLVANSWNPTWGDNGYFKILRGSDEC 639
           GVEN TPYWL+ANSWN  WG+NGYF+I+RG DEC
Sbjct: 295 GVENKTPYWLIANSWNEDWGENGYFRIVRGRDEC 328
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 precursor (Cysteine
           protease-related 6)
          Length = 379

 Score =  261 bits (666), Expect = 1e-69
 Identities = 118/218 (54%), Positives = 154/218 (70%), Gaps = 5/218 (2%)
 Frame = +1

Query: 1   SCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDG 180
           SCWAFGAVEA++DR CI S+G     +SA+DLL+CC   CG GCNGG P  AW YWV DG
Sbjct: 132 SCWAFGAVEAMSDRICIASHGELQVTLSADDLLSCCK-SCGFGCNGGDPLAAWRYWVKDG 190

Query: 181 LVTGGEYGAHLGCQDYAFPKCSHHV----IGPYPNCTGEFPTPKCKKACQAGYS-KTYAE 345
           +VTG  Y A+ GC+ Y FP C HH       P P+    +PTPKC+K C + Y+ KTY+E
Sbjct: 191 IVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDL--YPTPKCEKKCVSDYTDKTYSE 248

Query: 346 DKQYGKSSYSVDSNQQAIMQEILTNGPVEAAFSVYADFPSYKSGVYQHVSGGMLGGHAIK 525
           DK +G S+Y V  + +AI +E++T+GP+E AF VY DF +Y  GVY H  G + GGHA+K
Sbjct: 249 DKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVK 308

Query: 526 ILGWGVENNTPYWLVANSWNPTWGDNGYFKILRGSDEC 639
           ++GWG+++  PYW VANSWN  WG++G+F+ILRG DEC
Sbjct: 309 LIGWGIDDGIPYWTVANSWNTDWGEDGFFRILRGVDEC 346
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 precursor (Cysteine
           protease-related 5)
          Length = 344

 Score =  258 bits (658), Expect = 1e-68
 Identities = 118/219 (53%), Positives = 148/219 (67%), Gaps = 6/219 (2%)
 Frame = +1

Query: 1   SCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLTCCG--FRCGDGCNGGFPSGAWHYWVT 174
           SCWAF A EAI+DR CI SNG     +S+EDLL+CC   F CG+GC GG+P  AW +WV 
Sbjct: 109 SCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVK 168

Query: 175 DGLVTGGEYGAHLGCQDYAFPKCSHHVIG-PYPNCTGEF-PTPKCKKACQA--GYSKTYA 342
            GLVTGG Y    GC+ Y+   C   V G  +P C  +  PTPKC  +C +   Y+  Y 
Sbjct: 169 HGLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYL 228

Query: 343 EDKQYGKSSYSVDSNQQAIMQEILTNGPVEAAFSVYADFPSYKSGVYQHVSGGMLGGHAI 522
           +DK +G ++Y+V    + I  EILTNGP+E AF+VY DF  Y +GVY H +G  LGGHA+
Sbjct: 229 QDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAV 288

Query: 523 KILGWGVENNTPYWLVANSWNPTWGDNGYFKILRGSDEC 639
           KILGWGV+N TPYWLVANSWN  WG+ GYF+I+RG +EC
Sbjct: 289 KILGWGVDNGTPYWLVANSWNVAWGEKGYFRIIRGLNEC 327
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase precursor
          Length = 329

 Score =  257 bits (656), Expect = 2e-68
 Identities = 118/213 (55%), Positives = 146/213 (68%)
 Frame = +1

Query: 1   SCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDG 180
           SCWAFGA E I+DR CI + G Q P IS +DLL+CCG  CG+GC GG+P  A  +W + G
Sbjct: 112 SCWAFGAAEMISDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKG 171

Query: 181 LVTGGEYGAHLGCQDYAFPKCSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYG 360
           +VTGG+Y    GC+ Y    C+        NC  E  TP C  +CQ+GYS  YA+DK +G
Sbjct: 172 VVTGGDYHG-AGCKPYPIAPCTSG------NCP-ESKTPSCSMSCQSGYSTAYAKDKHFG 223

Query: 361 KSSYSVDSNQQAIMQEILTNGPVEAAFSVYADFPSYKSGVYQHVSGGMLGGHAIKILGWG 540
            S+Y+V  N  +I  EI  NGPVEAAFSVY DF  YKSGVY+H +G  LGGHAIKI+GWG
Sbjct: 224 VSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKIIGWG 283

Query: 541 VENNTPYWLVANSWNPTWGDNGYFKILRGSDEC 639
            E+ +PYWLVANSW   WG++G+FKI RG D+C
Sbjct: 284 TESGSPYWLVANSWGVNWGESGFFKIYRGDDQC 316
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 84,369,890
Number of Sequences: 369166
Number of extensions: 1997207
Number of successful extensions: 6276
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 5748
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6139
length of database: 68,354,980
effective HSP length: 106
effective length of database: 48,773,070
effective search space used: 5218718490
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)