Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_028_K05
(581 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precu... 194 1e-49
sp|Q9GLE3|CATK_PIG Cathepsin K precursor 183 2e-46
sp|P43235|CATK_HUMAN Cathepsin K precursor (Cathepsin O) (C... 183 3e-46
sp|P61277|CATK_MACMU Cathepsin K precursor >gi|47117667|sp|... 183 3e-46
sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precu... 183 3e-46
sp|O35186|CATK_RAT Cathepsin K precursor 182 4e-46
sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein) 182 5e-46
sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted ... 181 1e-45
sp|P55097|CATK_MOUSE Cathepsin K precursor 180 2e-45
sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted pr... 180 3e-45
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
Length = 323
Score = 194 bits (493), Expect = 1e-49
Identities = 95/166 (57%), Positives = 116/166 (69%), Gaps = 4/166 (2%)
Frame = +3
Query: 3 QLVDCVTKNS--GCNGGWMNIAFEYI-SSHGIESEDNYPYQAKQGNCVFDKSKVVANCKG 173
QLVDC GCNGGWMN AF+YI +++GI++E YPY+A+ G+C FD + V A C G
Sbjct: 158 QLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATCSG 217
Query: 174 FQNINSCNEKDLAVAVATVGPISVAIDVGYS-FQQYKQGVYYEAKCDPTIQNHAVLVVGY 350
NI S +E L AV +GPISV ID +S FQ Y GVYYE C P+ +HAVL VGY
Sbjct: 218 HTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGY 277
Query: 351 GVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFPIV 488
G E G +WLVKNSW SWG GYIKMS++R+NNCGIAT AS+P+V
Sbjct: 278 GSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323
>sp|Q9GLE3|CATK_PIG Cathepsin K precursor
Length = 330
Score = 183 bits (465), Expect = 2e-46
Identities = 84/161 (52%), Positives = 114/161 (70%), Gaps = 2/161 (1%)
Frame = +3
Query: 6 LVDCVTKNSGCNGGWMNIAFEYISSH-GIESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
LVDCV++N GC GG+M AF+Y+ + GI+SED YPY + NC+++ + A C+G++
Sbjct: 168 LVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYRE 227
Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
I NEK L AVA VGP+SVAID SFQ Y +GVYY+ C+ NHAVL VGYG++
Sbjct: 228 IPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQ 287
Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
G K+W++KNSWG +WG GYI M+++++N CGIA ASFP
Sbjct: 288 KGKKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 328
>sp|P43235|CATK_HUMAN Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2)
Length = 329
Score = 183 bits (464), Expect = 3e-46
Identities = 83/161 (51%), Positives = 116/161 (72%), Gaps = 2/161 (1%)
Frame = +3
Query: 6 LVDCVTKNSGCNGGWMNIAFEYISSH-GIESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
LVDCV++N GC GG+M AF+Y+ + GI+SED YPY ++ +C+++ + A C+G++
Sbjct: 167 LVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYRE 226
Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
I NEK L AVA VGP+SVAID SFQ Y +GVYY+ C+ NHAVL VGYG++
Sbjct: 227 IPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQ 286
Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
G+K+W++KNSWG +WG GYI M+++++N CGIA ASFP
Sbjct: 287 KGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>sp|P61277|CATK_MACMU Cathepsin K precursor
sp|P61276|CATK_MACFA Cathepsin K precursor
Length = 329
Score = 183 bits (464), Expect = 3e-46
Identities = 83/161 (51%), Positives = 116/161 (72%), Gaps = 2/161 (1%)
Frame = +3
Query: 6 LVDCVTKNSGCNGGWMNIAFEYISSH-GIESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
LVDCV++N GC GG+M AF+Y+ + GI+SED YPY ++ +C+++ + A C+G++
Sbjct: 167 LVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYRE 226
Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
I NEK L AVA VGP+SVAID SFQ Y +GVYY+ C+ NHAVL VGYG++
Sbjct: 227 IPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQ 286
Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
G+K+W++KNSWG +WG GYI M+++++N CGIA ASFP
Sbjct: 287 KGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFP 327
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
Length = 321
Score = 183 bits (464), Expect = 3e-46
Identities = 92/166 (55%), Positives = 112/166 (67%), Gaps = 4/166 (2%)
Frame = +3
Query: 3 QLVDCVTK--NSGCNGGWMNIAFEYISSHG-IESEDNYPYQAKQGNCVFDKSKVVANCKG 173
QLVDC T N GC GGWM AF+YI +G I++E +YPY+A+ +C FD + + A C G
Sbjct: 157 QLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTG 216
Query: 174 FQNINSCNEKDLAVAVATVGPISVAIDVG-YSFQQYKQGVYYEAKCDPTIQNHAVLVVGY 350
+ E+ L AV+ VGPISVAID +SFQ Y GVYYE C PT +H VL VGY
Sbjct: 217 SVEVQH-TEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGY 275
Query: 351 GVENGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFPIV 488
G E+ YWLVKNSWG SWG GYIKMS++RDNNCGIA+ S+P V
Sbjct: 276 GTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321
>sp|O35186|CATK_RAT Cathepsin K precursor
Length = 329
Score = 182 bits (463), Expect = 4e-46
Identities = 83/161 (51%), Positives = 114/161 (70%), Gaps = 2/161 (1%)
Frame = +3
Query: 6 LVDCVTKNSGCNGGWMNIAFEYISSHG-IESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
LVDCV++N GC GG+M AF+Y+ +G I+SED YPY + +C+++ + A C+G++
Sbjct: 167 LVDCVSENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYRE 226
Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
I NEK L AVA VGP+SV+ID SFQ Y +GVYY+ CD NHAVLVVGYG +
Sbjct: 227 IPVGNEKALKRAVARVGPVSVSIDASLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQ 286
Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
G+KYW++KNSWG SWG GY+ ++++++N CGI ASFP
Sbjct: 287 KGNKYWIIKNSWGESWGNKGYVLLARNKNNACGITNLASFP 327
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
Length = 329
Score = 182 bits (462), Expect = 5e-46
Identities = 84/161 (52%), Positives = 114/161 (70%), Gaps = 2/161 (1%)
Frame = +3
Query: 6 LVDCVTKNSGCNGGWMNIAFEYIS-SHGIESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
LVDCV++N GC GG+M AF+Y+ + GI+SED YPY + +C+++ + A C+G++
Sbjct: 167 LVDCVSENYGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYRE 226
Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
I NEK L AVA VGP+SVAID SFQ Y +GVYY+ C NHAVL VGYG++
Sbjct: 227 IPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSDNVNHAVLAVGYGIQ 286
Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
G+K+W++KNSWG SWG GYI M+++++N CGIA ASFP
Sbjct: 287 KGNKHWIIKNSWGESWGNKGYILMARNKNNACGIANLASFP 327
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
proteinase) [Contains: Cathepsin L heavy chain;
Cathepsin L light chain]
Length = 334
Score = 181 bits (459), Expect = 1e-45
Identities = 93/169 (55%), Positives = 116/169 (68%), Gaps = 8/169 (4%)
Frame = +3
Query: 6 LVDC--VTKNSGCNGGWMNIAFEYISSHG-IESEDNYPYQAKQGNCVFDKSKVVANCKGF 176
LVDC N GCNGG M+ AF+YI +G ++SE++YPY+AK G+C + VAN GF
Sbjct: 166 LVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGF 225
Query: 177 QNINSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYG 353
+I EK L AVATVGPISVA+D + S Q Y G+YYE C +H VL+VGYG
Sbjct: 226 VDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYG 284
Query: 354 VE----NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFPIV 488
E N +KYWLVKNSWG WGM GYIK++KDRDN+CG+AT AS+P+V
Sbjct: 285 YEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>sp|P55097|CATK_MOUSE Cathepsin K precursor
Length = 329
Score = 180 bits (457), Expect = 2e-45
Identities = 84/161 (52%), Positives = 112/161 (69%), Gaps = 2/161 (1%)
Frame = +3
Query: 6 LVDCVTKNSGCNGGWMNIAFEYISSHG-IESEDNYPYQAKQGNCVFDKSKVVANCKGFQN 182
LVDCVT+N GC GG+M AF+Y+ +G I+SED YPY + +C+++ + A C+G++
Sbjct: 167 LVDCVTENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYRE 226
Query: 183 INSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYGVE 359
I NEK L AVA VGPISV+ID SFQ Y +GVYY+ CD NHAVLVVGYG +
Sbjct: 227 IPVGNEKALKRAVARVGPISVSIDASLASFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQ 286
Query: 360 NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFP 482
G K+W++KNSWG SWG GY ++++++N CGI ASFP
Sbjct: 287 KGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMASFP 327
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
Cathepsin L light chain]
Length = 334
Score = 180 bits (456), Expect = 3e-45
Identities = 94/169 (55%), Positives = 116/169 (68%), Gaps = 8/169 (4%)
Frame = +3
Query: 6 LVDCVTK--NSGCNGGWMNIAFEYISSHG-IESEDNYPYQAKQGNCVFDKSKVVANCKGF 176
LVDC N GCNGG M+ AF+YI +G ++SE++YPY+AK G+C + VAN GF
Sbjct: 166 LVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGF 225
Query: 177 QNINSCNEKDLAVAVATVGPISVAIDVGY-SFQQYKQGVYYEAKCDPTIQNHAVLVVGYG 353
+I EK L AVATVGPISVA+D + S Q Y G+YYE C +H VLVVGYG
Sbjct: 226 VDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYG 284
Query: 354 VE----NGHKYWLVKNSWGPSWGMNGYIKMSKDRDNNCGIATTASFPIV 488
E N KYWLVKNSWG WGM+GYIK++KDR+N+CG+AT AS+PIV
Sbjct: 285 YEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 69,720,567
Number of Sequences: 369166
Number of extensions: 1425456
Number of successful extensions: 3920
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3376
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3513
length of database: 68,354,980
effective HSP length: 105
effective length of database: 48,957,805
effective search space used: 4308286840
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)