Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_008_K01
(820 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|P07688|CATB_BOVIN Cathepsin B precursor [Contains: Cathe... 294 2e-79
sp|P00787|CATB_RAT Cathepsin B precursor (Cathepsin B1) (RS... 290 4e-78
sp|P10605|CATB_MOUSE Cathepsin B precursor (Cathepsin B1) [... 288 1e-77
sp|P07858|CATB_HUMAN Cathepsin B precursor (Cathepsin B1) (... 281 2e-75
sp|P43233|CATB_CHICK Cathepsin B precursor (Cathepsin B1) [... 272 7e-73
sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase p... 264 3e-70
sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase p... 246 5e-65
sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6... 219 5e-57
sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5... 201 2e-51
sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase precu... 199 6e-51
>sp|P07688|CATB_BOVIN Cathepsin B precursor [Contains: Cathepsin B light chain; Cathepsin
B heavy chain]
Length = 335
Score = 294 bits (752), Expect = 2e-79
Identities = 139/238 (58%), Positives = 169/238 (71%), Gaps = 1/238 (0%)
Frame = +3
Query: 108 PLSFDLINYVNYVAQTTWKAGPTTRFQSISDIRKVLG-VMKDPNNFKLPKRKPLLNRVRL 284
PLS +L+N+VN TTWKAG +S ++K+ G ++ P KLP+R V L
Sbjct: 25 PLSDELVNFVNK-QNTTWKAGHNFYNVDLSYVKKLCGAILGGP---KLPQRDAFAADVVL 80
Query: 285 PTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLT 464
P +FDAR QWP C +I EIRDQ +CGSCWAFGAVEAI+DR CIHSNG +SAED+LT
Sbjct: 81 PESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLT 140
Query: 465 CCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHLGCQDYAFPKCSHHVIGPYPNCTG 644
CCG CGDGCNGGFPSGAW++W GLV+GG Y +H+GC+ Y+ P C HHV G P CTG
Sbjct: 141 CCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTG 200
Query: 645 EFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQAIMQEILTNGPVEAAFSVYAD 818
E TPKC K C+ GYS +Y EDK +G SSYSV +N++ IM EI NGPVE AFSVY+D
Sbjct: 201 EGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSD 258
>sp|P00787|CATB_RAT Cathepsin B precursor (Cathepsin B1) (RSG-2) [Contains: Cathepsin B
light chain; Cathepsin B heavy chain]
Length = 339
Score = 290 bits (741), Expect = 4e-78
Identities = 135/242 (55%), Positives = 168/242 (69%), Gaps = 1/242 (0%)
Frame = +3
Query: 96 PIHTPLSFDLINYVNYVAQTTWKAGPTTRFQSISDIRKVLG-VMKDPNNFKLPKRKPLLN 272
P PLS D+INY+N TTW+AG IS ++K+ G V+ PN LP+R
Sbjct: 21 PSSHPLSDDMINYINK-QNTTWQAGRNFYNVDISYLKKLCGTVLGGPN---LPERVGFSE 76
Query: 273 RVRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAE 452
+ LP +FDAR QW C +I +IRDQ +CGSCWAFGAVEA++DR CIH+NG +SAE
Sbjct: 77 DINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAE 136
Query: 453 DLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHLGCQDYAFPKCSHHVIGPYP 632
DLLTCCG +CGDGCNGG+PSGAW++W GLV+GG Y +H+GC Y P C HHV G P
Sbjct: 137 DLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRP 196
Query: 633 NCTGEFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQAIMQEILTNGPVEAAFSVY 812
CTGE TPKC K C+AGYS +Y EDK YG +SYSV +++ IM EI NGPVE AF+V+
Sbjct: 197 PCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVF 256
Query: 813 AD 818
+D
Sbjct: 257 SD 258
>sp|P10605|CATB_MOUSE Cathepsin B precursor (Cathepsin B1) [Contains: Cathepsin B light
chain; Cathepsin B heavy chain]
Length = 339
Score = 288 bits (737), Expect = 1e-77
Identities = 137/242 (56%), Positives = 168/242 (69%), Gaps = 1/242 (0%)
Frame = +3
Query: 96 PIHTPLSFDLINYVNYVAQTTWKAGPTTRFQSISDIRKVLG-VMKDPNNFKLPKRKPLLN 272
P PLS DLINY+N TTW+AG IS ++K+ G V+ P KLP R
Sbjct: 21 PSFHPLSDDLINYINK-QNTTWQAGRNFYNVDISYLKKLCGTVLGGP---KLPGRVAFGE 76
Query: 273 RVRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAE 452
+ LP TFDAR QW C +IG+IRDQ +CGSCWAFGAVEAI+DR CIH+NG +SAE
Sbjct: 77 DIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAE 136
Query: 453 DLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHLGCQDYAFPKCSHHVIGPYP 632
DLLTCCG +CGDGCNGG+PSGAW +W GLV+GG Y +H+GC Y P C HHV G P
Sbjct: 137 DLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRP 196
Query: 633 NCTGEFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQAIMQEILTNGPVEAAFSVY 812
CTGE TP+C K+C+AGYS +Y EDK +G +SYSV ++ + IM EI NGPVE AF+V+
Sbjct: 197 PCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVF 256
Query: 813 AD 818
+D
Sbjct: 257 SD 258
>sp|P07858|CATB_HUMAN Cathepsin B precursor (Cathepsin B1) (APP secretase) (APPS)
[Contains: Cathepsin B light chain; Cathepsin B heavy
chain]
Length = 339
Score = 281 bits (719), Expect = 2e-75
Identities = 131/241 (54%), Positives = 167/241 (69%)
Frame = +3
Query: 96 PIHTPLSFDLINYVNYVAQTTWKAGPTTRFQSISDIRKVLGVMKDPNNFKLPKRKPLLNR 275
P PLS +L+NYVN TTW+AG +S ++++ G K P+R
Sbjct: 21 PSFHPLSDELVNYVNK-RNTTWQAGHNFYNVDMSYLKRLCGTFL--GGPKPPQRVMFTED 77
Query: 276 VRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAED 455
++LP +FDAR QWP+C +I EIRDQ +CGSCWAFGAVEAI+DR CIH+N + +SAED
Sbjct: 78 LKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAED 137
Query: 456 LLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHLGCQDYAFPKCSHHVIGPYPN 635
LLTCCG CGDGCNGG+P+ AW++W GLV+GG Y +H+GC+ Y+ P C HHV G P
Sbjct: 138 LLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPP 197
Query: 636 CTGEFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQAIMQEILTNGPVEAAFSVYA 815
CTGE TPKC K C+ GYS TY +DK YG +SYSV ++++ IM EI NGPVE AFSVY+
Sbjct: 198 CTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYS 257
Query: 816 D 818
D
Sbjct: 258 D 258
>sp|P43233|CATB_CHICK Cathepsin B precursor (Cathepsin B1) [Contains: Cathepsin B light
chain; Cathepsin B heavy chain]
Length = 340
Score = 272 bits (696), Expect = 7e-73
Identities = 128/243 (52%), Positives = 161/243 (66%), Gaps = 1/243 (0%)
Frame = +3
Query: 93 IPIHTPLSFDLINYVNYVAQTTWKAGPTTRFQSISDIRKVLGVMKDPNNFKLPKRKPLLN 272
IP + PLS DL+N++N + TT +AG +S ++K+ G K P+R
Sbjct: 20 IPYYPPLSSDLVNHINKL-NTTGRAGHNFHNTDMSYVKKLCGTFL--GGPKAPERVDFAE 76
Query: 273 RVRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAE 452
+ LP TFD R QWP C +I EIRDQ +CGSCWAFGAVEAI+DR C+H+N + +SAE
Sbjct: 77 DMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAE 136
Query: 453 DLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHLGCQDYAFPKCSHHVIGPYP 632
DLL+CCGF CG GCNGG+PSGAW YW GLV+GG Y +H+GC+ Y P C HHV G P
Sbjct: 137 DLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHVNGSRP 196
Query: 633 NCTGE-FPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQAIMQEILTNGPVEAAFSV 809
CTGE TP+C + C+ GYS +Y EDK YG +SY V +++ IM EI NGPVE AF V
Sbjct: 197 PCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIV 256
Query: 810 YAD 818
Y D
Sbjct: 257 YED 259
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase precursor (Antigen Sj31)
Length = 342
Score = 264 bits (674), Expect = 3e-70
Identities = 124/242 (51%), Positives = 159/242 (65%), Gaps = 5/242 (2%)
Frame = +3
Query: 108 PLSFDLINYVNYVAQTTWKAGPTTRFQSISDIRKVLGVMKDPNNFKLPKRKPLLNR---- 275
PLS ++I+++N WKA + RF S+ D R ++G K+ K R+P ++
Sbjct: 29 PLSDEMISFINEHPDAGWKADKSDRFHSLDDARILMGARKEDAEMKR-NRRPTVDHHDLN 87
Query: 276 VRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAED 455
V +P+ FD+R +WP CKSI +IRDQS CGSCWAFGAVEA+TDR CI S G Q+ +SA D
Sbjct: 88 VEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALD 147
Query: 456 LLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHLGCQDYAFPKCSHHVIGPYPN 635
L++CC CGDGC GGFP AW YWV G+VTGG H GCQ Y FPKC HH G YP
Sbjct: 148 LISCCK-DCGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTKGKYPA 206
Query: 636 C-TGEFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQAIMQEILTNGPVEAAFSVY 812
C T + TP+CK+ CQ GY Y +DK YG SY+V +N++ I ++I+ GPVEAAF VY
Sbjct: 207 CGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAFDVY 266
Query: 813 AD 818
D
Sbjct: 267 ED 268
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase precursor (Antigen Sm31)
Length = 340
Score = 246 bits (628), Expect = 5e-65
Identities = 117/242 (48%), Positives = 156/242 (64%), Gaps = 5/242 (2%)
Frame = +3
Query: 108 PLSFDLINYVNYVAQTTWKAGPTTRFQSISDIRKVLGVMKDPNNFKLPKRKPLLNR---- 275
PLS D+I+Y+N W+A + RF S+ D R +G ++ + + KR+P ++
Sbjct: 28 PLSDDIISYINEHPNAGWRAEKSNRFHSLDDARIQMGARREEPDLRR-KRRPTVDHNDWN 86
Query: 276 VRLPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAED 455
V +P+ FD+R +WP CKSI IRDQS CGSCW+FGAVEA++DR CI S G Q +SA D
Sbjct: 87 VEIPSNFDSRKKWPGCKSIATIRDQSRCGSCWSFGAVEAMSDRSCIQSGGKQNVELSAVD 146
Query: 456 LLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHLGCQDYAFPKCSHHVIGPYPN 635
LLTCC CG GC GG AW YWV +G+VT H GC+ Y FPKC HH G YP
Sbjct: 147 LLTCCE-SCGLGCEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEHHTKGKYPP 205
Query: 636 CTGE-FPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQAIMQEILTNGPVEAAFSVY 812
C + + TP+CK+ CQ Y Y +DK GKSSY+V ++++AI +EI+ GPVEA+F+VY
Sbjct: 206 CGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVY 265
Query: 813 AD 818
D
Sbjct: 266 ED 267
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 precursor (Cysteine
protease-related 6)
Length = 379
Score = 219 bits (559), Expect = 5e-57
Identities = 118/245 (48%), Positives = 155/245 (63%), Gaps = 12/245 (4%)
Frame = +3
Query: 120 DLINYVNYVAQTTWKAGPTTRFQSI-SDIRKVLGVMKDPNNFKLP-KRKPLLNRVR---- 281
DLI+YVN Q W A RF S+ + K + N+ +L K K L++ +
Sbjct: 45 DLIDYVNE-NQNLWTAKKQRRFSSVYGENDKAKWGLMGVNHVRLSVKGKQHLSKTKDLDL 103
Query: 282 -LPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAEDL 458
+P +FD+R WPKC SI IRDQS+CGSCWAFGAVEA++DR CI S+G +SA+DL
Sbjct: 104 DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDL 163
Query: 459 LTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHLGCQDYAFPKCSHHV----IGP 626
L+CC CG GCNGG P AW YWV DG+VTG Y A+ GC+ Y FP C HH P
Sbjct: 164 LSCCK-SCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDP 222
Query: 627 YPNCTGEFPTPKCKKACQAGYS-KTYAEDKQYGKSSYSVDSNQQAIMQEILTNGPVEAAF 803
P+ +PTPKC+K C + Y+ KTY+EDK +G S+Y V + +AI +E++T+GP+E AF
Sbjct: 223 CPH--DLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAF 280
Query: 804 SVYAD 818
VY D
Sbjct: 281 EVYED 285
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 precursor (Cysteine
protease-related 5)
Length = 344
Score = 201 bits (511), Expect = 2e-51
Identities = 108/241 (44%), Positives = 139/241 (57%), Gaps = 9/241 (3%)
Frame = +3
Query: 123 LINYVNYVAQTTWKAGPTTRFQSISDIRKVLGVMKDPNNFKLPKRKPLLNRV---RLPTT 293
LI+YVN AQ W AG + K+ + D K + ++ +P
Sbjct: 32 LIDYVNS-AQKLWTAG-----HQVIPKEKITKKLMDVKYLVPHKDEDIVATEVSDAIPDH 85
Query: 294 FDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAEDLLTCCG 473
FDAR QWP C SI IRDQS+CGSCWAF A EAI+DR CI SNG +S+EDLL+CC
Sbjct: 86 FDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCT 145
Query: 474 --FRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHLGCQDYAFPKCSHHVIG-PYPNCTG 644
F CG+GC GG+P AW +WV GLVTGG Y GC+ Y+ C V G +P C
Sbjct: 146 GMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPE 205
Query: 645 EF-PTPKCKKACQA--GYSKTYAEDKQYGKSSYSVDSNQQAIMQEILTNGPVEAAFSVYA 815
+ PTPKC +C + Y+ Y +DK +G ++Y+V + I EILTNGP+E AF+VY
Sbjct: 206 DTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAFTVYE 265
Query: 816 D 818
D
Sbjct: 266 D 266
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase precursor
Length = 329
Score = 199 bits (507), Expect = 6e-51
Identities = 95/179 (53%), Positives = 118/179 (65%)
Frame = +3
Query: 282 LPTTFDARVQWPKCKSIGEIRDQSNCGSCWAFGAVEAITDRYCIHSNGTQTPRISAEDLL 461
+P TFD+R QW +CKSI IRDQ+ CGSCWAFGA E I+DR CI + G Q P IS +DLL
Sbjct: 85 VPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISPDDLL 144
Query: 462 TCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHLGCQDYAFPKCSHHVIGPYPNCT 641
+CCG CG+GC GG+P A +W + G+VTGG+Y GC+ Y C+ NC
Sbjct: 145 SCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHG-AGCKPYPIAPCTS------GNCP 197
Query: 642 GEFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQAIMQEILTNGPVEAAFSVYAD 818
E TP C +CQ+GYS YA+DK +G S+Y+V N +I EI NGPVEAAFSVY D
Sbjct: 198 -ESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYED 255
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 98,427,794
Number of Sequences: 369166
Number of extensions: 2161148
Number of successful extensions: 6694
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 6198
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6534
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 7859674995
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)