Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_003_I18
(725 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precu... 209 8e-54
sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine protei... 203 3e-52
sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precu... 201 2e-51
sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathe... 200 3e-51
sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) ... 200 3e-51
sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L hea... 199 5e-51
sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted ... 195 1e-49
sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathe... 194 2e-49
sp|Q9GLE3|CATK_PIG Cathepsin K precursor 194 2e-49
sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted ... 193 5e-49
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
Length = 323
Score = 209 bits (531), Expect = 8e-54
Identities = 104/204 (50%), Positives = 133/204 (65%), Gaps = 1/204 (0%)
Frame = +2
Query: 2 PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
PVK+Q CGSC+AFS TGSLEGQ+F +T L+S +EQQ+VDCS +G +GC GG+ +F
Sbjct: 121 PVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAF 180
Query: 182 DXXXXXXXXXXXXXXXXXXXX-RCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
D CR++ + V G TNI S E L QAV IGPIS
Sbjct: 181 DYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPIS 240
Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGSKDGKNFWIVKNSWGTTWGRK 538
V IDA+ SF YS +G+YY+P+C +L HAVL VGYGS+ G++FW+VKNSW T+WG
Sbjct: 241 VTIDAAHSSFQFYS-SGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDA 299
Query: 539 GYILMSKDEDNQCGIATEASYPLI 610
GYI MS++ +N CGIAT ASYPL+
Sbjct: 300 GYIKMSRNRNNNCGIATVASYPLV 323
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
L heavy chain; Cathepsin L light chain]
Length = 341
Score = 203 bits (517), Expect = 3e-52
Identities = 100/204 (49%), Positives = 138/204 (67%), Gaps = 2/204 (0%)
Frame = +2
Query: 5 VKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSFD 184
VK+Q CGSC+AFS+TG+LEGQ+FR++ LVS SEQ +VDCS ++GN GC GG +F
Sbjct: 139 VKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFR 198
Query: 185 -XXXXXXXXXXXXXXXXXXXXRCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPISV 361
C +NK V +GFT+I DE+ +A+AVA +GP+SV
Sbjct: 199 YIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSV 258
Query: 362 RIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGS-KDGKNFWIVKNSWGTTWGRK 538
IDAS SF YS G+Y +P CD+ +L H VLVVG+G+ + G+++W+VKNSWGTTWG K
Sbjct: 259 AIDASHESFQFYS-EGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDK 317
Query: 539 GYILMSKDEDNQCGIATEASYPLI 610
G+I M ++++NQCGIA+ +SYPL+
Sbjct: 318 GFIKMLRNKENQCGIASASSYPLV 341
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
Length = 321
Score = 201 bits (511), Expect = 2e-51
Identities = 100/204 (49%), Positives = 135/204 (66%), Gaps = 1/204 (0%)
Frame = +2
Query: 2 PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
PVK+Q+ CGSC+AFSATG+LEGQ+F + +LVS SEQQ+VDCS ++GN GCGGG+ +F
Sbjct: 120 PVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAF 179
Query: 182 DXXXXXXXXXXXXXXXXXXXXR-CRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
D R CR++ + + G ++ EEAL +AV+ +GPIS
Sbjct: 180 DYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTGSVEVQ-HTEEALQEAVSGVGPIS 238
Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGSKDGKNFWIVKNSWGTTWGRK 538
V IDAS SF YS +G+YY+ NC L H VL VGYG++ K++W+VKNSWG++WG
Sbjct: 239 VAIDASHFSFQFYS-SGVYYEQNCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDA 297
Query: 539 GYILMSKDEDNQCGIATEASYPLI 610
GYI MS++ DN CGIA+E SYP +
Sbjct: 298 GYIKMSRNRDNNCGIASEPSYPTV 321
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 339
Score = 200 bits (509), Expect = 3e-51
Identities = 99/204 (48%), Positives = 133/204 (65%), Gaps = 2/204 (0%)
Frame = +2
Query: 5 VKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSFD 184
VK+Q CGSC+AFS+TG+LEGQ+FR+ LVS SEQ +VDCS ++GN GC GG +F
Sbjct: 137 VKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFR 196
Query: 185 -XXXXXXXXXXXXXXXXXXXXRCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPISV 361
C +NK+ + GF +I DEE + +AVA +GP+SV
Sbjct: 197 YIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSV 256
Query: 362 RIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGS-KDGKNFWIVKNSWGTTWGRK 538
IDAS SF YS G+Y +P CD +L H VLVVGYG+ + G ++W+VKNSWGTTWG +
Sbjct: 257 AIDASHESFQLYS-EGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQ 315
Query: 539 GYILMSKDEDNQCGIATEASYPLI 610
GYI M+++++NQCGIAT +SYP +
Sbjct: 316 GYIKMARNQNNQCGIATASSYPTV 339
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
Length = 334
Score = 200 bits (509), Expect = 3e-51
Identities = 100/206 (48%), Positives = 129/206 (62%), Gaps = 5/206 (2%)
Frame = +2
Query: 2 PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
PVKNQK CGSC+AFSATG+LEGQ FR+T KLVS SEQ +VDCS GN+GC GGF +F
Sbjct: 128 PVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAF 187
Query: 182 DXXXXXXXXXXXXXXXXXXXXR-CRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
C+Y + GFT + E+AL +AVA +GPIS
Sbjct: 188 QYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPIS 247
Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYG----SKDGKNFWIVKNSWGTT 526
V +DA SF ++ +GIY++P+C S +L H VLVVGYG + + +W+VKNSWG
Sbjct: 248 VAMDAGHSSF-QFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPE 306
Query: 527 WGRKGYILMSKDEDNQCGIATEASYP 604
WG GY+ ++KD++N CGIAT ASYP
Sbjct: 307 WGSNGYVKIAKDKNNHCGIATAASYP 332
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
chain]
Length = 217
Score = 199 bits (507), Expect = 5e-51
Identities = 107/205 (52%), Positives = 127/205 (61%), Gaps = 2/205 (0%)
Frame = +2
Query: 2 PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
PVKNQ CGSC+AFSATG+LEGQ FR+T KLVS SEQ +VD S GN+GC GG +F
Sbjct: 15 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQGNQGCNGGLMDNAF 74
Query: 182 D-XXXXXXXXXXXXXXXXXXXXRCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
C Y K GF +I R E+AL +AVA +GPIS
Sbjct: 75 QYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDIPQR-EKALMKAVATVGPIS 133
Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGSKDGKN-FWIVKNSWGTTWGR 535
V IDA SF ++ +GIYYDP+C S L H VLVVGYG + N FWIVKNSWG WG
Sbjct: 134 VAIDAGHSSF-QFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKFWIVKNSWGPEWGN 192
Query: 536 KGYILMSKDEDNQCGIATEASYPLI 610
KGY+ M+KD++N CGIAT ASYP +
Sbjct: 193 KGYVKMAKDQNNHCGIATAASYPTV 217
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
Cathepsin L heavy chain; Cathepsin L light chain]
Length = 333
Score = 195 bits (495), Expect = 1e-49
Identities = 99/208 (47%), Positives = 125/208 (60%), Gaps = 5/208 (2%)
Frame = +2
Query: 2 PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
PVKNQ CGSC+AFSATG+LEGQ FR+T +L+S SEQ +VDCS GN GC GG +F
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAF 187
Query: 182 DXXXXXXXXXXXXXXXXXXXXR-CRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
C+YN + GF +I + E+AL +AVA +GPIS
Sbjct: 188 QYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDI-PKQEKALMKAVATVGPIS 246
Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYG----SKDGKNFWIVKNSWGTT 526
V IDA SF+ Y GIY++P+C S+ + H VLVVGYG D +W+VKNSWG
Sbjct: 247 VAIDAGHESFLFYK-EGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEE 305
Query: 527 WGRKGYILMSKDEDNQCGIATEASYPLI 610
WG GY+ M+KD N CGIA+ ASYP +
Sbjct: 306 WGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 194 bits (493), Expect = 2e-49
Identities = 104/209 (49%), Positives = 124/209 (59%), Gaps = 6/209 (2%)
Frame = +2
Query: 2 PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
PVKNQ CGSC+AFSATG+LEGQ FR+T KLVS SEQ +VDCS GN+GC GG +F
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAF 187
Query: 182 D--XXXXXXXXXXXXXXXXXXXXRCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPI 355
C Y GF +I R E+AL +AVA +GPI
Sbjct: 188 QYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQR-EKALMKAVATVGPI 246
Query: 356 SVRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYG----SKDGKNFWIVKNSWGT 523
SV IDA SF ++ +GIYYDP+C L H VLVVGYG + FWIVKNSWG
Sbjct: 247 SVAIDAGHTSF-QFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGP 305
Query: 524 TWGRKGYILMSKDEDNQCGIATEASYPLI 610
WG GY+ M+KD++N CGIAT ASYP +
Sbjct: 306 EWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>sp|Q9GLE3|CATK_PIG Cathepsin K precursor
Length = 330
Score = 194 bits (493), Expect = 2e-49
Identities = 99/202 (49%), Positives = 128/202 (63%), Gaps = 1/202 (0%)
Frame = +2
Query: 2 PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
PVKNQ CGSC+AFS+ G+LEGQ ++T KL++ S Q +VDC E N GCGGG+ +F
Sbjct: 130 PVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAF 187
Query: 182 DXXXXXXXXXXXXXXXXXXXXR-CRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
C YN + K +G+ I +E+AL +AVA +GP+S
Sbjct: 188 QYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVS 247
Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGSKDGKNFWIVKNSWGTTWGRK 538
V IDAS SF YS G+YYD NC+SD+L HAVL VGYG + GK WI+KNSWG WG K
Sbjct: 248 VAIDASLTSFQFYS-KGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNK 306
Query: 539 GYILMSKDEDNQCGIATEASYP 604
GYILM+++++N CGIA AS+P
Sbjct: 307 GYILMARNKNNACGIANLASFP 328
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
proteinase) [Contains: Cathepsin L heavy chain;
Cathepsin L light chain]
Length = 334
Score = 193 bits (490), Expect = 5e-49
Identities = 99/208 (47%), Positives = 131/208 (62%), Gaps = 5/208 (2%)
Frame = +2
Query: 2 PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
PVKNQ CGSC+AFSA+G LEGQ F +T KL+S SEQ +VDCS GN+GC GG +F
Sbjct: 128 PVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAF 187
Query: 182 D-XXXXXXXXXXXXXXXXXXXXRCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
C+Y + GF +I + E+AL +AVA +GPIS
Sbjct: 188 QYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPIS 246
Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGY---GSKDGKN-FWIVKNSWGTT 526
V +DAS S +++ +GIYY+PNC S +L H VL+VGY G+ KN +W+VKNSWG+
Sbjct: 247 VAMDASHPS-LQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSE 305
Query: 527 WGRKGYILMSKDEDNQCGIATEASYPLI 610
WG +GYI ++KD DN CG+AT ASYP++
Sbjct: 306 WGMEGYIKIAKDRDNHCGLATAASYPVV 333
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 75,956,470
Number of Sequences: 369166
Number of extensions: 1453623
Number of successful extensions: 4198
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3712
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3854
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 6510836890
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)