Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_003_J22
(752 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) ... 235 8e-62
sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted ... 235 1e-61
sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathe... 233 5e-61
sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathe... 230 4e-60
sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Catheps... 226 5e-59
sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathe... 220 4e-57
sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor 219 5e-57
sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted ... 219 8e-57
sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted pr... 218 1e-56
sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein) 216 5e-56
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
Length = 334
Score = 235 bits (600), Expect = 8e-62
Identities = 123/248 (49%), Positives = 159/248 (64%), Gaps = 1/248 (0%)
Frame = +1
Query: 7 TAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHT 186
+AVP KF N L+ +W +K T R Y + RR +WE+N+K I+ HN EY GKH
Sbjct: 16 SAVP-KFDQN--LDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHG 72
Query: 187 YSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVK 366
+++ +N F DMTNEEF+ + +G + + +G + P + LP SVDWR+KGYVTPVK
Sbjct: 73 FTMAMNAFGDMTNEEFR-QMMGCFRNQKFRKGKVFREPLFLD-LPKSVDWRKKGYVTPVK 130
Query: 367 NQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYI 546
NQ+QCGSCW+FSATG+LEGQ FRK +L+S SEQ LVDCS M AF+Y+
Sbjct: 131 NQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYV 190
Query: 547 KDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAI 723
K+ G++SE YPY A D CK P V TGFT + E L AVATVGP+SVA+
Sbjct: 191 KENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAM 250
Query: 724 DAGHASFQ 747
DAGH+SFQ
Sbjct: 251 DAGHSSFQ 258
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
Cathepsin L heavy chain; Cathepsin L light chain]
Length = 333
Score = 235 bits (599), Expect = 1e-61
Identities = 123/235 (52%), Positives = 150/235 (63%), Gaps = 1/235 (0%)
Frame = +1
Query: 43 LNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMT 222
L +W +K R Y + RR +WE+N+K I+ HN EY GKH++++ +N F DMT
Sbjct: 25 LEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMT 84
Query: 223 NEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFS 402
+EEF+ G KP +G + P P SVDWR+KGYVTPVKNQ QCGSCW+FS
Sbjct: 85 SEEFRQVMNGFQNRKPR-KGKVFQEPLFYEA-PRSVDWREKGYVTPVKNQGQCGSCWAFS 142
Query: 403 ATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDY 579
ATG+LEGQ FRK RLIS SEQ LVDCS LMD AF+Y++D G++SE Y
Sbjct: 143 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY 202
Query: 580 PYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASF 744
PY AT+ +CK NP V TGF DI Q E L AVATVGP+SVAIDAGH SF
Sbjct: 203 PYEATEESCKYNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHESF 256
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 333
Score = 233 bits (593), Expect = 5e-61
Identities = 125/250 (50%), Positives = 153/250 (61%), Gaps = 2/250 (0%)
Frame = +1
Query: 4 ITAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKH 183
I + KF + LN +W +K T R Y + RR +WE+N+K I+ HN EY GKH
Sbjct: 14 IASAAPKF--DQSLNAQWYQWKATHRRLYGMNEEGWRRAVWEKNMKMIELHNREYSQGKH 71
Query: 184 TYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPV 363
+++ +N F DMTNEEF+ G K +G + P +P SVDWR+KGYVTPV
Sbjct: 72 GFTMAMNAFGDMTNEEFRQVMNGFQNQKHK-KGKMFQEPL-FAEIPKSVDWREKGYVTPV 129
Query: 364 KNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRY 543
KNQ QCGSCW+FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAFRY
Sbjct: 130 KNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRY 189
Query: 544 IKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSV 717
+KD G++SE YPY D TC P TGF D+ Q E L AVAT+GP+SV
Sbjct: 190 VKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKALMKAVATLGPISV 248
Query: 718 AIDAGHASFQ 747
AIDAGH SFQ
Sbjct: 249 AIDAGHQSFQ 258
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 230 bits (586), Expect = 4e-60
Identities = 124/241 (51%), Positives = 148/241 (61%), Gaps = 2/241 (0%)
Frame = +1
Query: 31 VNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQF 210
++ L+ W +K T R Y + RR +WE+N K I HN EY GKH + + +N F
Sbjct: 21 LDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQEYSEGKHAFRMAMNAF 80
Query: 211 ADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSC 390
DMTNEEF+ G K +G + P + V P SVDW +KGYVTPVKNQ QCGSC
Sbjct: 81 GDMTNEEFRQVMNGFQNQKHK-KGKLFHEPLLVDV-PKSVDWTKKGYVTPVKNQGQCGSC 138
Query: 391 WSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIES 567
W+FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF+YIKD G++S
Sbjct: 139 WAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDS 198
Query: 568 EGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASF 744
E YPY ATD +C P TGF DI Q E L AVATVGP+SVAIDAGH SF
Sbjct: 199 EESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHTSF 257
Query: 745 Q 747
Q
Sbjct: 258 Q 258
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 226 bits (576), Expect = 5e-59
Identities = 120/241 (49%), Positives = 151/241 (62%), Gaps = 2/241 (0%)
Frame = +1
Query: 31 VNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQF 210
++ L+ +W +K T GR Y + RR +WE+N+K I+ HN EY GKH +S+ +N F
Sbjct: 21 LDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAF 80
Query: 211 ADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSC 390
DMTNEEF+ G K +G + + V P SVDWR+KGYVT VKNQ QCGSC
Sbjct: 81 GDMTNEEFRQVMNGFQNQKHK-KGKVFHESLVLEV-PKSVDWREKGYVTAVKNQGQCGSC 138
Query: 391 WSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIES 567
W+FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF+Y+KD G+++
Sbjct: 139 WAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDT 198
Query: 568 EGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASF 744
E YPY + +C P TGF DI Q E L AVATVGP+SVAIDAGH+SF
Sbjct: 199 EESYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHSSF 257
Query: 745 Q 747
Q
Sbjct: 258 Q 258
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 339
Score = 220 bits (560), Expect = 4e-57
Identities = 123/259 (47%), Positives = 158/259 (61%), Gaps = 9/259 (3%)
Frame = +1
Query: 1 AITAVPQKFSVNSELNEEWETYKTTFGRKY-DELTDITRRLIWEQNLKYIQKHNLEYDLG 177
A+ A+ Q S + EEW TYK + Y +E+ + R I+ +N I KHN + G
Sbjct: 10 ALVALTQAISPLDLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQG 69
Query: 178 KHTYSLGLNQFADMTNEEFKAKYLG-------IMKTKPTLEGSTYMAPENIGVLPASVDW 336
K +Y LGLN++ADM + EFK G +M+ + L G+TY+ P ++ V P SVDW
Sbjct: 70 KVSYKLGLNKYADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTV-PKSVDW 128
Query: 337 RQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXX 516
R+ G VT VK+Q CGSCW+FS+TG+LEGQ+FRK L+S SEQ LVDCS
Sbjct: 129 REHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNG 188
Query: 517 XLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAV 693
LMDNAFRYIKD GI++E YPY D +C N + I TGF DI +E + AV
Sbjct: 189 GLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAV 248
Query: 694 ATVGPVSVAIDAGHASFQL 750
AT+GPVSVAIDA H SFQL
Sbjct: 249 ATMGPVSVAIDASHESFQL 267
>sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor
Length = 326
Score = 219 bits (559), Expect = 5e-57
Identities = 106/224 (47%), Positives = 140/224 (62%)
Frame = +1
Query: 55 WETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEF 234
W +K + ++Y+ D RR IWE+N+K+IQ+HNL +DLG TY+LGLNQF DMT EEF
Sbjct: 21 WHQWKRMYNKEYNGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEF 80
Query: 235 KAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGS 414
KAKYL M + N +P +DWR+ GYVT VK+Q CGSCW+FS TG+
Sbjct: 81 KAKYLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGT 140
Query: 415 LEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQGIESEGDYPYTAT 594
+EGQY + ISFSEQQLVDCS LM+NA++Y+K G+E+E YPYTA
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAV 200
Query: 595 DGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAID 726
+G C+ N V K TG+ + S +E +L N V P +VA+D
Sbjct: 201 EGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVD 244
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
proteinase) [Contains: Cathepsin L heavy chain;
Cathepsin L light chain]
Length = 334
Score = 219 bits (557), Expect = 8e-57
Identities = 119/233 (51%), Positives = 145/233 (62%), Gaps = 1/233 (0%)
Frame = +1
Query: 52 EWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEE 231
EW +K+T R Y + RR IWE+N++ IQ HN EY G+H +S+ +N F DMTNEE
Sbjct: 28 EWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEE 87
Query: 232 FKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATG 411
F+ G K +G + P + + P SVDWR+KG VTPVKNQ QCGSCW+FSA+G
Sbjct: 88 FRQVVNGYRHQKHK-KGRLFQEPLMLKI-PKSVDWREKGCVTPVKNQGQCGSCWAFSASG 145
Query: 412 SLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYT 588
LEGQ F K +LIS SEQ LVDCS LMD AF+YIK+ G++SE YPY
Sbjct: 146 CLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYE 205
Query: 589 ATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQ 747
A DG+CK V TGF DI Q E L AVATVGP+SVA+DA H S Q
Sbjct: 206 AKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSLQ 257
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
Cathepsin L light chain]
Length = 334
Score = 218 bits (556), Expect = 1e-56
Identities = 117/235 (49%), Positives = 146/235 (62%), Gaps = 1/235 (0%)
Frame = +1
Query: 46 NEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTN 225
N +W +K+T R Y + RR +WE+N++ IQ HN EY GKH +++ +N F DMTN
Sbjct: 26 NAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTN 85
Query: 226 EEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSA 405
EEF+ G K +G + P + + P +VDWR+KG VTPVKNQ QCGSCW+FSA
Sbjct: 86 EEFRQIVNGYRHQKHK-KGRLFQEPLMLQI-PKTVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 406 TGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYP 582
+G LEGQ F K +LIS SEQ LVDCS LMD AF+YIK+ G++SE YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203
Query: 583 YTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQ 747
Y A DG+CK V TGF DI Q E L AVATVGP+SVA+DA H S Q
Sbjct: 204 YEAKDGSCKYRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSLQ 257
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
Length = 329
Score = 216 bits (550), Expect = 5e-56
Identities = 116/239 (48%), Positives = 151/239 (63%), Gaps = 4/239 (1%)
Frame = +1
Query: 43 LNEEWETYKTTFGRKYDELTD-ITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADM 219
L+ +WE +K T+ ++Y+ D I+RRLIWE+NLK+I HNLE LG HTY L +N DM
Sbjct: 22 LDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDM 81
Query: 220 TNEEFKAKYLGIMKTKPTLEGS--TYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCW 393
T+EE K G+ K P+ S T P+ G P S+D+R+KGYVTPVKNQ QCGSCW
Sbjct: 82 TSEEVVQKMTGL-KVPPSRSHSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCW 140
Query: 394 SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYI-KDQGIESE 570
+FS+ G+LEGQ +K +L++ S Q LVDC M NAF+Y+ +++GI+SE
Sbjct: 141 AFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGGYMTNAFQYVQRNRGIDSE 198
Query: 571 GDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQ 747
YPY D +C NP+ KC G+ +I NE L AVA VGPVSVAIDA SFQ
Sbjct: 199 DAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQ 257
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.315 0.131 0.389
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 81,216,881
Number of Sequences: 369166
Number of extensions: 1562889
Number of successful extensions: 4556
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4102
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4224
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 6873311200
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)