Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_022_D22
(754 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) ... 238 1e-62
sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted ... 237 3e-62
sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathe... 235 8e-62
sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathe... 233 5e-61
sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Catheps... 229 8e-60
sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathe... 223 6e-58
sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted ... 221 1e-57
sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor 221 2e-57
sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted pr... 221 2e-57
sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein) 219 8e-57
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
Length = 334
Score = 238 bits (607), Expect = 1e-62
Identities = 124/250 (49%), Positives = 160/250 (64%), Gaps = 1/250 (0%)
Frame = +2
Query: 8 TAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHT 187
+AVP KF N L+ +W +K T R Y + RR +WE+N+K I+ HN EY GKH
Sbjct: 16 SAVP-KFDQN--LDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIELHNGEYSQGKHG 72
Query: 188 YSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVK 367
+++ +N F DMTNEEF+ + +G + + +G + P + LP SVDWR+KGYVTPVK
Sbjct: 73 FTMAMNAFGDMTNEEFR-QMMGCFRNQKFRKGKVFREPLFLD-LPKSVDWRKKGYVTPVK 130
Query: 368 NQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYI 547
NQ+QCGSCW+FSATG+LEGQ FRK +L+S SEQ LVDCS M AF+Y+
Sbjct: 131 NQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYV 190
Query: 548 KDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAI 724
K+ G++SE YPY A D CK P V TGFT + E L AVATVGP+SVA+
Sbjct: 191 KENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAM 250
Query: 725 DAGHASFQLY 754
DAGH+SFQ Y
Sbjct: 251 DAGHSSFQFY 260
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
Cathepsin L heavy chain; Cathepsin L light chain]
Length = 333
Score = 237 bits (604), Expect = 3e-62
Identities = 124/238 (52%), Positives = 151/238 (63%), Gaps = 1/238 (0%)
Frame = +2
Query: 44 LNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMT 223
L +W +K R Y + RR +WE+N+K I+ HN EY GKH++++ +N F DMT
Sbjct: 25 LEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMT 84
Query: 224 NEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFS 403
+EEF+ G KP +G + P P SVDWR+KGYVTPVKNQ QCGSCW+FS
Sbjct: 85 SEEFRQVMNGFQNRKPR-KGKVFQEPLFYEA-PRSVDWREKGYVTPVKNQGQCGSCWAFS 142
Query: 404 ATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDY 580
ATG+LEGQ FRK RLIS SEQ LVDCS LMD AF+Y++D G++SE Y
Sbjct: 143 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESY 202
Query: 581 PYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 754
PY AT+ +CK NP V TGF DI Q E L AVATVGP+SVAIDAGH SF Y
Sbjct: 203 PYEATEESCKYNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHESFLFY 259
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 333
Score = 235 bits (600), Expect = 8e-62
Identities = 126/252 (50%), Positives = 154/252 (61%), Gaps = 2/252 (0%)
Frame = +2
Query: 5 ITAVPQKFSVNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKH 184
I + KF + LN +W +K T R Y + RR +WE+N+K I+ HN EY GKH
Sbjct: 14 IASAAPKF--DQSLNAQWYQWKATHRRLYGMNEEGWRRAVWEKNMKMIELHNREYSQGKH 71
Query: 185 TYSLGLNQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPV 364
+++ +N F DMTNEEF+ G K +G + P +P SVDWR+KGYVTPV
Sbjct: 72 GFTMAMNAFGDMTNEEFRQVMNGFQNQKHK-KGKMFQEPL-FAEIPKSVDWREKGYVTPV 129
Query: 365 KNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRY 544
KNQ QCGSCW+FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAFRY
Sbjct: 130 KNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRY 189
Query: 545 IKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSV 718
+KD G++SE YPY D TC P TGF D+ Q E L AVAT+GP+SV
Sbjct: 190 VKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKALMKAVATLGPISV 248
Query: 719 AIDAGHASFQLY 754
AIDAGH SFQ Y
Sbjct: 249 AIDAGHQSFQFY 260
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 233 bits (593), Expect = 5e-61
Identities = 125/243 (51%), Positives = 149/243 (61%), Gaps = 2/243 (0%)
Frame = +2
Query: 32 VNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQF 211
++ L+ W +K T R Y + RR +WE+N K I HN EY GKH + + +N F
Sbjct: 21 LDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQEYSEGKHAFRMAMNAF 80
Query: 212 ADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSC 391
DMTNEEF+ G K +G + P + V P SVDW +KGYVTPVKNQ QCGSC
Sbjct: 81 GDMTNEEFRQVMNGFQNQKHK-KGKLFHEPLLVDV-PKSVDWTKKGYVTPVKNQGQCGSC 138
Query: 392 WSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIES 568
W+FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF+YIKD G++S
Sbjct: 139 WAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDS 198
Query: 569 EGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASF 745
E YPY ATD +C P TGF DI Q E L AVATVGP+SVAIDAGH SF
Sbjct: 199 EESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHTSF 257
Query: 746 QLY 754
Q Y
Sbjct: 258 QFY 260
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 229 bits (583), Expect = 8e-60
Identities = 121/243 (49%), Positives = 152/243 (62%), Gaps = 2/243 (0%)
Frame = +2
Query: 32 VNSELNEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQF 211
++ L+ +W +K T GR Y + RR +WE+N+K I+ HN EY GKH +S+ +N F
Sbjct: 21 LDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAF 80
Query: 212 ADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSC 391
DMTNEEF+ G K +G + + V P SVDWR+KGYVT VKNQ QCGSC
Sbjct: 81 GDMTNEEFRQVMNGFQNQKHK-KGKVFHESLVLEV-PKSVDWREKGYVTAVKNQGQCGSC 138
Query: 392 WSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIES 568
W+FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF+Y+KD G+++
Sbjct: 139 WAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDT 198
Query: 569 EGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASF 745
E YPY + +C P TGF DI Q E L AVATVGP+SVAIDAGH+SF
Sbjct: 199 EESYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHSSF 257
Query: 746 QLY 754
Q Y
Sbjct: 258 QFY 260
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 339
Score = 223 bits (567), Expect = 6e-58
Identities = 124/260 (47%), Positives = 159/260 (61%), Gaps = 9/260 (3%)
Frame = +2
Query: 2 AITAVPQKFSVNSELNEEWETYKTTFGRKY-DELTDITRRLIWEQNLKYIQKHNLEYDLG 178
A+ A+ Q S + EEW TYK + Y +E+ + R I+ +N I KHN + G
Sbjct: 10 ALVALTQAISPLDLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQG 69
Query: 179 KHTYSLGLNQFADMTNEEFKAKYLG-------IMKTKPTLEGSTYMAPENIGVLPASVDW 337
K +Y LGLN++ADM + EFK G +M+ + L G+TY+ P ++ V P SVDW
Sbjct: 70 KVSYKLGLNKYADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTV-PKSVDW 128
Query: 338 RQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXX 517
R+ G VT VK+Q CGSCW+FS+TG+LEGQ+FRK L+S SEQ LVDCS
Sbjct: 129 REHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNG 188
Query: 518 XLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAV 694
LMDNAFRYIKD GI++E YPY D +C N + I TGF DI +E + AV
Sbjct: 189 GLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAV 248
Query: 695 ATVGPVSVAIDAGHASFQLY 754
AT+GPVSVAIDA H SFQLY
Sbjct: 249 ATMGPVSVAIDASHESFQLY 268
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
proteinase) [Contains: Cathepsin L heavy chain;
Cathepsin L light chain]
Length = 334
Score = 221 bits (564), Expect = 1e-57
Identities = 120/235 (51%), Positives = 146/235 (62%), Gaps = 1/235 (0%)
Frame = +2
Query: 53 EWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEE 232
EW +K+T R Y + RR IWE+N++ IQ HN EY G+H +S+ +N F DMTNEE
Sbjct: 28 EWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEE 87
Query: 233 FKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATG 412
F+ G K +G + P + + P SVDWR+KG VTPVKNQ QCGSCW+FSA+G
Sbjct: 88 FRQVVNGYRHQKHK-KGRLFQEPLMLKI-PKSVDWREKGCVTPVKNQGQCGSCWAFSASG 145
Query: 413 SLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYT 589
LEGQ F K +LIS SEQ LVDCS LMD AF+YIK+ G++SE YPY
Sbjct: 146 CLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYE 205
Query: 590 ATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 754
A DG+CK V TGF DI Q E L AVATVGP+SVA+DA H S Q Y
Sbjct: 206 AKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSLQFY 259
>sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor
Length = 326
Score = 221 bits (563), Expect = 2e-57
Identities = 108/233 (46%), Positives = 144/233 (61%)
Frame = +2
Query: 56 WETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTNEEF 235
W +K + ++Y+ D RR IWE+N+K+IQ+HNL +DLG TY+LGLNQF DMT EEF
Sbjct: 21 WHQWKRMYNKEYNGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFEEF 80
Query: 236 KAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGS 415
KAKYL M + N +P +DWR+ GYVT VK+Q CGSCW+FS TG+
Sbjct: 81 KAKYLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGT 140
Query: 416 LEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQGIESEGDYPYTAT 595
+EGQY + ISFSEQQLVDCS LM+NA++Y+K G+E+E YPYTA
Sbjct: 141 MEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAV 200
Query: 596 DGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 754
+G C+ N V K TG+ + S +E +L N V P +VA+D + F +Y
Sbjct: 201 EGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDV-ESDFMMY 252
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
Cathepsin L light chain]
Length = 334
Score = 221 bits (563), Expect = 2e-57
Identities = 118/237 (49%), Positives = 147/237 (62%), Gaps = 1/237 (0%)
Frame = +2
Query: 47 NEEWETYKTTFGRKYDELTDITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADMTN 226
N +W +K+T R Y + RR +WE+N++ IQ HN EY GKH +++ +N F DMTN
Sbjct: 26 NAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTN 85
Query: 227 EEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSA 406
EEF+ G K +G + P + + P +VDWR+KG VTPVKNQ QCGSCW+FSA
Sbjct: 86 EEFRQIVNGYRHQKHK-KGRLFQEPLMLQI-PKTVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 407 TGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYP 583
+G LEGQ F K +LIS SEQ LVDCS LMD AF+YIK+ G++SE YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203
Query: 584 YTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 754
Y A DG+CK V TGF DI Q E L AVATVGP+SVA+DA H S Q Y
Sbjct: 204 YEAKDGSCKYRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDASHPSLQFY 259
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
Length = 329
Score = 219 bits (557), Expect = 8e-57
Identities = 117/241 (48%), Positives = 152/241 (63%), Gaps = 4/241 (1%)
Frame = +2
Query: 44 LNEEWETYKTTFGRKYDELTD-ITRRLIWEQNLKYIQKHNLEYDLGKHTYSLGLNQFADM 220
L+ +WE +K T+ ++Y+ D I+RRLIWE+NLK+I HNLE LG HTY L +N DM
Sbjct: 22 LDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDM 81
Query: 221 TNEEFKAKYLGIMKTKPTLEGS--TYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCW 394
T+EE K G+ K P+ S T P+ G P S+D+R+KGYVTPVKNQ QCGSCW
Sbjct: 82 TSEEVVQKMTGL-KVPPSRSHSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCW 140
Query: 395 SFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYI-KDQGIESE 571
+FS+ G+LEGQ +K +L++ S Q LVDC M NAF+Y+ +++GI+SE
Sbjct: 141 AFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGGYMTNAFQYVQRNRGIDSE 198
Query: 572 GDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 751
YPY D +C NP+ KC G+ +I NE L AVA VGPVSVAIDA SFQ
Sbjct: 199 DAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQF 258
Query: 752 Y 754
Y
Sbjct: 259 Y 259
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 81,530,022
Number of Sequences: 369166
Number of extensions: 1568010
Number of successful extensions: 4573
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4114
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4236
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 6873311200
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)