Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_012_B18
(771 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) ... 238 1e-62
sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted ... 237 3e-62
sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathe... 236 7e-62
sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathe... 233 3e-61
sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Catheps... 229 8e-60
sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathe... 224 3e-58
sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted ... 221 1e-57
sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor 221 2e-57
sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted pr... 221 2e-57
sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein) 219 6e-57
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
Length = 334
Score = 238 bits (608), Expect = 1e-62
Identities = 128/262 (48%), Positives = 165/262 (62%), Gaps = 6/262 (2%)
Frame = +3
Query: 3 SLVVVAI-----TAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQ 167
SLV+ A +AVP KF N L+ +W +K T R Y + RR +WE+N+K I+
Sbjct: 4 SLVLAAFCLGIASAVP-KFDQN--LDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIE 60
Query: 168 KHNLEYDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASV 347
HN EY GKH +++ +N F DMTNEEF+ + +G + + +G + P + LP SV
Sbjct: 61 LHNGEYSQGKHGFTMAMNAFGDMTNEEFR-QMMGCFRNQKFRKGKVFREPLFLD-LPKSV 118
Query: 348 DWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXX 527
DWR+KGYVTPVKNQ+QCGSCW+FSATG+LEGQ FRK +L+S SEQ LVDCS
Sbjct: 119 DWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGC 178
Query: 528 XXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLAN 704
M AF+Y+K+ G++SE YPY A D CK P V TGFT + E L
Sbjct: 179 NGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMK 238
Query: 705 AVATVGPVSVAIDAGHASFQLY 770
AVATVGP+SVA+DAGH+SFQ Y
Sbjct: 239 AVATVGPISVAMDAGHSSFQFY 260
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
Cathepsin L heavy chain; Cathepsin L light chain]
Length = 333
Score = 237 bits (604), Expect = 3e-62
Identities = 124/238 (52%), Positives = 151/238 (63%), Gaps = 1/238 (0%)
Frame = +3
Query: 60 LNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMT 239
L +W +K R Y + RR +WE+N+K I+ HN EY GKH++++ +N F DMT
Sbjct: 25 LEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMT 84
Query: 240 NEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFS 419
+EEF+ G KP +G + P P SVDWR+KGYVTPVKNQ QCGSCW+FS
Sbjct: 85 SEEFRQVMNGFQNRKPR-KGKVFQEPLFYEA-PRSVDWREKGYVTPVKNQGQCGSCWAFS 142
Query: 420 ATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDY 596
ATG+LEGQ FRK RLIS SEQ LVDCS LMD AF+Y++D G++SE Y
Sbjct: 143 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY 202
Query: 597 PYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 770
PY AT+ +CK NP V TGF DI Q E L AVATVGP+SVAIDAGH SF Y
Sbjct: 203 PYEATEESCKYNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHESFLFY 259
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 333
Score = 236 bits (601), Expect = 7e-62
Identities = 126/254 (49%), Positives = 155/254 (61%), Gaps = 2/254 (0%)
Frame = +3
Query: 15 VAITAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLG 194
+ I + KF + LN +W +K T R Y + RR +WE+N+K I+ HN EY G
Sbjct: 12 LGIASAAPKF--DQSLNAQWYQWKATHRRLYGMNEEGWRRAVWEKNMKMIELHNREYSQG 69
Query: 195 KHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVT 374
KH +++ +N F DMTNEEF+ G K +G + P +P SVDWR+KGYVT
Sbjct: 70 KHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHK-KGKMFQEPL-FAEIPKSVDWREKGYVT 127
Query: 375 PVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAF 554
PVKNQ QCGSCW+FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAF 187
Query: 555 RYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPV 728
RY+KD G++SE YPY D TC P TGF D+ Q E L AVAT+GP+
Sbjct: 188 RYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKALMKAVATLGPI 246
Query: 729 SVAIDAGHASFQLY 770
SVAIDAGH SFQ Y
Sbjct: 247 SVAIDAGHQSFQFY 260
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 233 bits (595), Expect = 3e-61
Identities = 127/257 (49%), Positives = 153/257 (59%), Gaps = 2/257 (0%)
Frame = +3
Query: 6 LVVVAITAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEY 185
L V+ + ++ L+ W +K T R Y + RR +WE+N K I HN EY
Sbjct: 7 LTVLCLGVASAAPKLDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQEY 66
Query: 186 DLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKG 365
GKH + + +N F DMTNEEF+ G K +G + P + V P SVDW +KG
Sbjct: 67 SEGKHAFRMAMNAFGDMTNEEFRQVMNGFQNQKHK-KGKLFHEPLLVDV-PKSVDWTKKG 124
Query: 366 YVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMD 545
YVTPVKNQ QCGSCW+FSATG+LEGQ FRK +L+S SEQ LVDCS LMD
Sbjct: 125 YVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMD 184
Query: 546 NAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATV 719
NAF+YIKD G++SE YPY ATD +C P TGF DI Q E L AVATV
Sbjct: 185 NAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKALMKAVATV 243
Query: 720 GPVSVAIDAGHASFQLY 770
GP+SVAIDAGH SFQ Y
Sbjct: 244 GPISVAIDAGHTSFQFY 260
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 229 bits (583), Expect = 8e-60
Identities = 121/243 (49%), Positives = 152/243 (62%), Gaps = 2/243 (0%)
Frame = +3
Query: 48 VNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQF 227
++ L+ +W +K T GR Y + RR +WE+N+K I+ HN EY GKH +S+ +N F
Sbjct: 21 LDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAF 80
Query: 228 ADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSC 407
DMTNEEF+ G K +G + + V P SVDWR+KGYVT VKNQ QCGSC
Sbjct: 81 GDMTNEEFRQVMNGFQNQKHK-KGKVFHESLVLEV-PKSVDWREKGYVTAVKNQGQCGSC 138
Query: 408 WSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIES 584
W+FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF+Y+KD G+++
Sbjct: 139 WAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDT 198
Query: 585 EGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASF 761
E YPY + +C P TGF DI Q E L AVATVGP+SVAIDAGH+SF
Sbjct: 199 EESYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHSSF 257
Query: 762 QLY 770
Q Y
Sbjct: 258 QFY 260
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 339
Score = 224 bits (570), Expect = 3e-58
Identities = 124/264 (46%), Positives = 162/264 (61%), Gaps = 9/264 (3%)
Frame = +3
Query: 6 LVVVAITAVPQKFSVNSELNEEWETYKTTFGRKY-DELTDITRRLIWEQNLKYIQKHNLE 182
+ ++A+ A+ Q S + EEW TYK + Y +E+ + R I+ +N I KHN
Sbjct: 6 VALLALVALTQAISPLDLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQL 65
Query: 183 YDLGKHTYSLGLNQFADMTNEEFKAKYLG-------IMKTKPTLEGSTYMAPENIGVLPA 341
+ GK +Y LGLN++ADM + EFK G +M+ + L G+TY+ P ++ V P
Sbjct: 66 FAQGKVSYKLGLNKYADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTV-PK 124
Query: 342 SVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXX 521
SVDWR+ G VT VK+Q CGSCW+FS+TG+LEGQ+FRK L+S SEQ LVDCS
Sbjct: 125 SVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNN 184
Query: 522 XXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDL 698
LMDNAFRYIKD GI++E YPY D +C N + I TGF DI +E +
Sbjct: 185 GCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKM 244
Query: 699 ANAVATVGPVSVAIDAGHASFQLY 770
AVAT+GPVSVAIDA H SFQLY
Sbjct: 245 KKAVATMGPVSVAIDASHESFQLY 268
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
proteinase) [Contains: Cathepsin L heavy chain;
Cathepsin L light chain]
Length = 334
Score = 221 bits (564), Expect = 1e-57
Identities = 120/235 (51%), Positives = 146/235 (62%), Gaps = 1/235 (0%)
Frame = +3
Query: 69 EWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEE 248
EW +K+T R Y + RR IWE+N++ IQ HN EY G+H +S+ +N F DMTNEE
Sbjct: 28 EWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEE 87
Query: 249 FKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATG 428
F+ G K +G + P + + P SVDWR+KG VTPVKNQ QCGSCW+FSA+G
Sbjct: 88 FRQVVNGYRHQKHK-KGRLFQEPLMLKI-PKSVDWREKGCVTPVKNQGQCGSCWAFSASG 145
Query: 429 SLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYT 605
LEGQ F K +LIS SEQ LVDCS LMD AF+YIK+ G++SE YPY
Sbjct: 146 CLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYE 205
Query: 606 ATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 770
A DG+CK V TGF DI Q E L AVATVGP+SVA+DA H S Q Y
Sbjct: 206 AKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSLQFY 259
>sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor
Length = 326
Score = 221 bits (563), Expect = 2e-57
Identities = 108/233 (46%), Positives = 144/233 (61%)
Frame = +3
Query: 72 WETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEF 251
W +K + ++Y+ D RR IWE+N+K+IQ+HNL +DLG TY+LGLNQF DMT EEF
Sbjct: 21 WHQWKRMYNKEYNGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEF 80
Query: 252 KAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGS 431
KAKYL M + N +P +DWR+ GYVT VK+Q CGSCW+FS TG+
Sbjct: 81 KAKYLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGT 140
Query: 432 LEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQGIESEGDYPYTAT 611
+EGQY + ISFSEQQLVDCS LM+NA++Y+K G+E+E YPYTA
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAV 200
Query: 612 DGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 770
+G C+ N V K TG+ + S +E +L N V P +VA+D + F +Y
Sbjct: 201 EGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDV-ESDFMMY 252
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
Cathepsin L light chain]
Length = 334
Score = 221 bits (563), Expect = 2e-57
Identities = 118/237 (49%), Positives = 147/237 (62%), Gaps = 1/237 (0%)
Frame = +3
Query: 63 NEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTN 242
N +W +K+T R Y + RR +WE+N++ IQ HN EY GKH +++ +N F DMTN
Sbjct: 26 NAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTN 85
Query: 243 EEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSA 422
EEF+ G K +G + P + + P +VDWR+KG VTPVKNQ QCGSCW+FSA
Sbjct: 86 EEFRQIVNGYRHQKHK-KGRLFQEPLMLQI-PKTVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 423 TGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYP 599
+G LEGQ F K +LIS SEQ LVDCS LMD AF+YIK+ G++SE YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203
Query: 600 YTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 770
Y A DG+CK V TGF DI Q E L AVATVGP+SVA+DA H S Q Y
Sbjct: 204 YEAKDGSCKYRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSLQFY 259
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
Length = 329
Score = 219 bits (558), Expect = 6e-57
Identities = 121/259 (46%), Positives = 159/259 (61%), Gaps = 4/259 (1%)
Frame = +3
Query: 6 LVVVAITAVPQKFSVNSELNEEWETYKTTFGRKYDELTD-ITRRLIWEQNLKYIQKHNLE 182
L VV+ P++ L+ +WE +K T+ ++Y+ D I+RRLIWE+NLK+I HNLE
Sbjct: 9 LPVVSFALHPEEI-----LDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLE 63
Query: 183 YDLGKHTYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGS--TYMAPENIGVLPASVDWR 356
LG HTY L +N DMT+EE K G+ K P+ S T P+ G P S+D+R
Sbjct: 64 ASLGVHTYELAMNHLGDMTSEEVVQKMTGL-KVPPSRSHSNDTLYIPDWEGRTPDSIDYR 122
Query: 357 QKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXX 536
+KGYVTPVKNQ QCGSCW+FS+ G+LEGQ +K +L++ S Q LVDC
Sbjct: 123 KKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGG 180
Query: 537 LMDNAFRYI-KDQGIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVA 713
M NAF+Y+ +++GI+SE YPY D +C NP+ KC G+ +I NE L AVA
Sbjct: 181 YMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVA 240
Query: 714 TVGPVSVAIDAGHASFQLY 770
VGPVSVAIDA SFQ Y
Sbjct: 241 RVGPVSVAIDASLTSFQFY 259
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 83,373,739
Number of Sequences: 369166
Number of extensions: 1600622
Number of successful extensions: 4692
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4221
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4355
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 7163732800
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)