Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_027_G02
(797 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein) 248 1e-65
sp|Q9GLE3|CATK_PIG Cathepsin K precursor 248 2e-65
sp|P61277|CATK_MACMU Cathepsin K precursor >gi|47117667|sp|... 245 9e-65
sp|P43235|CATK_HUMAN Cathepsin K precursor (Cathepsin O) (C... 242 7e-64
sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathe... 241 2e-63
sp|O35186|CATK_RAT Cathepsin K precursor 240 3e-63
sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted ... 239 5e-63
sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) ... 239 8e-63
sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted pr... 238 1e-62
sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Catheps... 238 2e-62
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
Length = 329
Score = 248 bits (634), Expect = 1e-65
Identities = 126/256 (49%), Positives = 168/256 (65%), Gaps = 4/256 (1%)
Frame = +1
Query: 40 LNDDWESYKIKFGKKYES-LNEISRRLIWESNLKYIQKHNIESDLGKHTYTLGLNHFADM 216
L+ WE +K + K+Y S ++EISRRLIWE NLK+I HN+E+ LG HTY L +NH DM
Sbjct: 22 LDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDM 81
Query: 217 TNEEFRAKY--LSVPPSRKKISTVFMAPKNMGKLPDTVDWRTEGYVTPVKNQEQCGSCWS 390
T+EE K L VPPSR + P G+ PD++D+R +GYVTPVKNQ QCGSCW+
Sbjct: 82 TSEEVVQKMTGLKVPPSRSHSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCWA 141
Query: 391 FSATGSLEGQHFRKTGNLTSFSEQQLVDXXXXXXXXXXXXXLMDNAFEYIEK-FGIESED 567
FS+ G+LEGQ +KTG L + S Q LVD M NAF+Y+++ GI+SED
Sbjct: 142 FSSVGALEGQLKKKTGKLLNLSPQNLVD--CVSENYGCGGGYMTNAFQYVQRNRGIDSED 199
Query: 568 SYPYTAEDGTCLYDKSKVVGSCTGYVDIPEGSETSLATAAATVGPISVAIDASNYSFQLY 747
+YPY +D +C+Y+ + C GY +IPEG+E +L A A VGP+SVAIDAS SFQ Y
Sbjct: 200 AYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFY 259
Query: 748 KSGIYNEPDCSSTQLD 795
G+Y + +CSS ++
Sbjct: 260 SKGVYYDENCSSDNVN 275
>sp|Q9GLE3|CATK_PIG Cathepsin K precursor
Length = 330
Score = 248 bits (632), Expect = 2e-65
Identities = 126/256 (49%), Positives = 167/256 (65%), Gaps = 4/256 (1%)
Frame = +1
Query: 40 LNDDWESYKIKFGKKYES-LNEISRRLIWESNLKYIQKHNIESDLGKHTYTLGLNHFADM 216
L+ WE +K + K+Y S ++EISRRLIWE NLK+I HN+E+ LG HTY L +NH DM
Sbjct: 23 LDTQWELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDM 82
Query: 217 TNEEFRAKY--LSVPPSRKKISTVFMAPKNMGKLPDTVDWRTEGYVTPVKNQEQCGSCWS 390
T+EE K L VPPS + + P G+ PD++D+R +GYVTPVKNQ QCGSCW+
Sbjct: 83 TSEEVVQKMTGLKVPPSHSRSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCWA 142
Query: 391 FSATGSLEGQHFRKTGNLTSFSEQQLVDXXXXXXXXXXXXXLMDNAFEYIEK-FGIESED 567
FS+ G+LEGQ +KTG L + S Q LVD M NAF+Y++K GI+SED
Sbjct: 143 FSSVGALEGQLKKKTGKLLNLSPQNLVD--CVSENDGCGGGYMTNAFQYVQKNRGIDSED 200
Query: 568 SYPYTAEDGTCLYDKSKVVGSCTGYVDIPEGSETSLATAAATVGPISVAIDASNYSFQLY 747
+YPY +D C+Y+ + C GY +IPEG+E +L A A VGP+SVAIDAS SFQ Y
Sbjct: 201 AYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFY 260
Query: 748 KSGIYNEPDCSSTQLD 795
G+Y + +C+S L+
Sbjct: 261 SKGVYYDENCNSDNLN 276
>sp|P61277|CATK_MACMU Cathepsin K precursor
sp|P61276|CATK_MACFA Cathepsin K precursor
Length = 329
Score = 245 bits (626), Expect = 9e-65
Identities = 127/256 (49%), Positives = 165/256 (64%), Gaps = 4/256 (1%)
Frame = +1
Query: 40 LNDDWESYKIKFGKKYES-LNEISRRLIWESNLKYIQKHNIESDLGKHTYTLGLNHFADM 216
L+ WE +K K+Y S ++EISRRLIWE NLKYI HN+E+ LG HTY L +NH DM
Sbjct: 22 LDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDM 81
Query: 217 TNEEFRAKY--LSVPPSRKKISTVFMAPKNMGKLPDTVDWRTEGYVTPVKNQEQCGSCWS 390
TNEE K L VP S + + P G+ PD+VD+R +GYVTPVKNQ QCGSCW+
Sbjct: 82 TNEEVVQKMTGLKVPASHSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWA 141
Query: 391 FSATGSLEGQHFRKTGNLTSFSEQQLVDXXXXXXXXXXXXXLMDNAFEYIEKF-GIESED 567
FS+ G+LEGQ +KTG L + S Q LVD M NAF+Y++K GI+SED
Sbjct: 142 FSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGY--MTNAFQYVQKNRGIDSED 199
Query: 568 SYPYTAEDGTCLYDKSKVVGSCTGYVDIPEGSETSLATAAATVGPISVAIDASNYSFQLY 747
+YPY ++ +C+Y+ + C GY +IPEG+E +L A A VGP+SVAIDAS SFQ Y
Sbjct: 200 AYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFY 259
Query: 748 KSGIYNEPDCSSTQLD 795
G+Y + C+S L+
Sbjct: 260 SKGVYYDESCNSDNLN 275
>sp|P43235|CATK_HUMAN Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2)
Length = 329
Score = 242 bits (618), Expect = 7e-64
Identities = 125/256 (48%), Positives = 166/256 (64%), Gaps = 4/256 (1%)
Frame = +1
Query: 40 LNDDWESYKIKFGKKYES-LNEISRRLIWESNLKYIQKHNIESDLGKHTYTLGLNHFADM 216
L+ WE +K K+Y + ++EISRRLIWE NLKYI HN+E+ LG HTY L +NH DM
Sbjct: 22 LDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDM 81
Query: 217 TNEEFRAKY--LSVPPSRKKISTVFMAPKNMGKLPDTVDWRTEGYVTPVKNQEQCGSCWS 390
T+EE K L VP S + + P+ G+ PD+VD+R +GYVTPVKNQ QCGSCW+
Sbjct: 82 TSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWA 141
Query: 391 FSATGSLEGQHFRKTGNLTSFSEQQLVDXXXXXXXXXXXXXLMDNAFEYIEKF-GIESED 567
FS+ G+LEGQ +KTG L + S Q LVD M NAF+Y++K GI+SED
Sbjct: 142 FSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGY--MTNAFQYVQKNRGIDSED 199
Query: 568 SYPYTAEDGTCLYDKSKVVGSCTGYVDIPEGSETSLATAAATVGPISVAIDASNYSFQLY 747
+YPY ++ +C+Y+ + C GY +IPEG+E +L A A VGP+SVAIDAS SFQ Y
Sbjct: 200 AYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFY 259
Query: 748 KSGIYNEPDCSSTQLD 795
G+Y + C+S L+
Sbjct: 260 SKGVYYDESCNSDNLN 275
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 333
Score = 241 bits (615), Expect = 2e-63
Identities = 128/257 (49%), Positives = 158/257 (61%), Gaps = 2/257 (0%)
Frame = +1
Query: 31 NSELNDDWESYKIKFGKKYESLNEISRRLIWESNLKYIQKHNIESDLGKHTYTLGLNHFA 210
+ LN W +K + Y E RR +WE N+K I+ HN E GKH +T+ +N F
Sbjct: 22 DQSLNAQWYQWKATHRRLYGMNEEGWRRAVWEKNMKMIELHNREYSQGKHGFTMAMNAFG 81
Query: 211 DMTNEEFRAKYLSVPPSRKKISTVFMAPKNMGKLPDTVDWRTEGYVTPVKNQEQCGSCWS 390
DMTNEEFR + K +F P ++P +VDWR +GYVTPVKNQ QCGSCW+
Sbjct: 82 DMTNEEFRQVMNGFQNQKHKKGKMFQEPL-FAEIPKSVDWREKGYVTPVKNQGQCGSCWA 140
Query: 391 FSATGSLEGQHFRKTGNLTSFSEQQLVDXXXXXXXXXXXXXLMDNAFEYI-EKFGIESED 567
FSATG+LEGQ FRKTG L S SEQ LVD LMDNAF Y+ + G++SE+
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEE 200
Query: 568 SYPYTAED-GTCLYDKSKVVGSCTGYVDIPEGSETSLATAAATVGPISVAIDASNYSFQL 744
SYPY D TC Y + TG+VD+P+ E +L A AT+GPISVAIDA + SFQ
Sbjct: 201 SYPYLGRDTETCNYKPECSAANDTGFVDLPQ-REKALMKAVATLGPISVAIDAGHQSFQF 259
Query: 745 YKSGIYNEPDCSSTQLD 795
YKSGIY +PDCSS LD
Sbjct: 260 YKSGIYFDPDCSSKDLD 276
>sp|O35186|CATK_RAT Cathepsin K precursor
Length = 329
Score = 240 bits (613), Expect = 3e-63
Identities = 124/261 (47%), Positives = 166/261 (63%), Gaps = 4/261 (1%)
Frame = +1
Query: 7 VAPHKLTVNSELNDDWESYKIKFGKKYES-LNEISRRLIWESNLKYIQKHNIESDLGKHT 183
V L+ L+ WE +K GK+Y S ++EISRRLIWE NLK I HN+E+ LG HT
Sbjct: 11 VVSFALSPEETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHT 70
Query: 184 YTLGLNHFADMTNEEFRAKY--LSVPPSRKKISTVFMAPKNMGKLPDTVDWRTEGYVTPV 357
Y L +NH DMT+EE K L VPPSR + P+ G++PD++D+R +GYVTPV
Sbjct: 71 YELAMNHLGDMTSEEVVQKMTGLRVPPSRSFSNDTLYTPEWEGRVPDSIDYRKKGYVTPV 130
Query: 358 KNQEQCGSCWSFSATGSLEGQHFRKTGNLTSFSEQQLVDXXXXXXXXXXXXXLMDNAFEY 537
KNQ QCGSCW+FS+ G+LEGQ +KTG L + S Q LVD M AF+Y
Sbjct: 131 KNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVD--CVSENYGCGGGYMTTAFQY 188
Query: 538 IEK-FGIESEDSYPYTAEDGTCLYDKSKVVGSCTGYVDIPEGSETSLATAAATVGPISVA 714
+++ GI+SED+YPY +D +C+Y+ + C GY +IP G+E +L A A VGP+SV+
Sbjct: 189 VQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYREIPVGNEKALKRAVARVGPVSVS 248
Query: 715 IDASNYSFQLYKSGIYNEPDC 777
IDAS SFQ Y G+Y + +C
Sbjct: 249 IDASLTSFQFYSRGVYYDENC 269
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
Cathepsin L heavy chain; Cathepsin L light chain]
Length = 333
Score = 239 bits (611), Expect = 5e-63
Identities = 128/264 (48%), Positives = 162/264 (61%), Gaps = 1/264 (0%)
Frame = +1
Query: 7 VAPHKLTVNSELNDDWESYKIKFGKKYESLNEISRRLIWESNLKYIQKHNIESDLGKHTY 186
+A LT + L W +K + Y E RR +WE N+K I+ HN E GKH++
Sbjct: 14 IASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSF 73
Query: 187 TLGLNHFADMTNEEFRAKYLSVPPSRKKISTVFMAPKNMGKLPDTVDWRTEGYVTPVKNQ 366
T+ +N F DMT+EEFR + + VF P + P +VDWR +GYVTPVKNQ
Sbjct: 74 TMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPL-FYEAPRSVDWREKGYVTPVKNQ 132
Query: 367 EQCGSCWSFSATGSLEGQHFRKTGNLTSFSEQQLVDXXXXXXXXXXXXXLMDNAFEYI-E 543
QCGSCW+FSATG+LEGQ FRKTG L S SEQ LVD LMD AF+Y+ +
Sbjct: 133 GQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQD 192
Query: 544 KFGIESEDSYPYTAEDGTCLYDKSKVVGSCTGYVDIPEGSETSLATAAATVGPISVAIDA 723
G++SE+SYPY A + +C Y+ V + TG+VDIP+ E +L A ATVGPISVAIDA
Sbjct: 193 NGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALMKAVATVGPISVAIDA 251
Query: 724 SNYSFQLYKSGIYNEPDCSSTQLD 795
+ SF YK GIY EPDCSS +D
Sbjct: 252 GHESFLFYKEGIYFEPDCSSEDMD 275
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
Length = 334
Score = 239 bits (609), Expect = 8e-63
Identities = 126/256 (49%), Positives = 156/256 (60%), Gaps = 1/256 (0%)
Frame = +1
Query: 31 NSELNDDWESYKIKFGKKYESLNEISRRLIWESNLKYIQKHNIESDLGKHTYTLGLNHFA 210
+ L+ W +K + Y + E RR +WE N+K I+ HN E GKH +T+ +N F
Sbjct: 22 DQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMAMNAFG 81
Query: 211 DMTNEEFRAKYLSVPPSRKKISTVFMAPKNMGKLPDTVDWRTEGYVTPVKNQEQCGSCWS 390
DMTNEEFR + + VF P + LP +VDWR +GYVTPVKNQ+QCGSCW+
Sbjct: 82 DMTNEEFRQMMGCFRNQKFRKGKVFREPLFLD-LPKSVDWRKKGYVTPVKNQKQCGSCWA 140
Query: 391 FSATGSLEGQHFRKTGNLTSFSEQQLVDXXXXXXXXXXXXXLMDNAFEYI-EKFGIESED 567
FSATG+LEGQ FRKTG L S SEQ LVD M AF+Y+ E G++SE+
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEE 200
Query: 568 SYPYTAEDGTCLYDKSKVVGSCTGYVDIPEGSETSLATAAATVGPISVAIDASNYSFQLY 747
SYPY A D C Y V + TG+ + G E +L A ATVGPISVA+DA + SFQ Y
Sbjct: 201 SYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFY 260
Query: 748 KSGIYNEPDCSSTQLD 795
KSGIY EPDCSS LD
Sbjct: 261 KSGIYFEPDCSSKNLD 276
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
Cathepsin L light chain]
Length = 334
Score = 238 bits (608), Expect = 1e-62
Identities = 129/252 (51%), Positives = 158/252 (62%), Gaps = 1/252 (0%)
Frame = +1
Query: 43 NDDWESYKIKFGKKYESLNEISRRLIWESNLKYIQKHNIESDLGKHTYTLGLNHFADMTN 222
N W +K + Y + E RR +WE N++ IQ HN E GKH +T+ +N F DMTN
Sbjct: 26 NAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTN 85
Query: 223 EEFRAKYLSVPPSRKKISTVFMAPKNMGKLPDTVDWRTEGYVTPVKNQEQCGSCWSFSAT 402
EEFR + K +F P M ++P TVDWR +G VTPVKNQ QCGSCW+FSA+
Sbjct: 86 EEFRQIVNGYRHQKHKKGRLFQEPL-MLQIPKTVDWREKGCVTPVKNQGQCGSCWAFSAS 144
Query: 403 GSLEGQHFRKTGNLTSFSEQQLVDXXXXXXXXXXXXXLMDNAFEYI-EKFGIESEDSYPY 579
G LEGQ F KTG L S SEQ LVD LMD AF+YI E G++SE+SYPY
Sbjct: 145 GCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPY 204
Query: 580 TAEDGTCLYDKSKVVGSCTGYVDIPEGSETSLATAAATVGPISVAIDASNYSFQLYKSGI 759
A+DG+C Y V + TG+VDIP+ E +L A ATVGPISVA+DAS+ S Q Y SGI
Sbjct: 205 EAKDGSCKYRAEYAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGI 263
Query: 760 YNEPDCSSTQLD 795
Y EP+CSS LD
Sbjct: 264 YYEPNCSSKDLD 275
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 238 bits (606), Expect = 2e-62
Identities = 127/258 (49%), Positives = 161/258 (62%), Gaps = 2/258 (0%)
Frame = +1
Query: 28 VNSELNDDWESYKIKFGKKYESLNEISRRLIWESNLKYIQKHNIESDLGKHTYTLGLNHF 207
++ L+ DW +K G+ Y E RR +WE N+K I+ HN E GKH +++ +N F
Sbjct: 21 LDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAF 80
Query: 208 ADMTNEEFRAKYLSVPPSRKKISTVFMAPKNMGKLPDTVDWRTEGYVTPVKNQEQCGSCW 387
DMTNEEFR + K VF + ++P +VDWR +GYVT VKNQ QCGSCW
Sbjct: 81 GDMTNEEFRQVMNGFQNQKHKKGKVFHESLVL-EVPKSVDWREKGYVTAVKNQGQCGSCW 139
Query: 388 SFSATGSLEGQHFRKTGNLTSFSEQQLVDXXXXXXXXXXXXXLMDNAFEYI-EKFGIESE 564
+FSATG+LEGQ FRKTG L S SEQ LVD LMDNAF+Y+ + G+++E
Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTE 199
Query: 565 DSYPYTA-EDGTCLYDKSKVVGSCTGYVDIPEGSETSLATAAATVGPISVAIDASNYSFQ 741
+SYPY E +C Y + TG+VDIP+ E +L A ATVGPISVAIDA + SFQ
Sbjct: 200 ESYPYLGRETNSCTYKPECSAANDTGFVDIPQ-REKALMKAVATVGPISVAIDAGHSSFQ 258
Query: 742 LYKSGIYNEPDCSSTQLD 795
YKSGIY +PDCSS LD
Sbjct: 259 FYKSGIYYDPDCSSKDLD 276
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.313 0.130 0.385
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 86,822,253
Number of Sequences: 369166
Number of extensions: 1727161
Number of successful extensions: 4370
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3922
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4019
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 7522142940
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)