Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_002_G08
(832 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) ... 285 8e-77
sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted ... 285 1e-76
sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathe... 282 9e-76
sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathe... 281 2e-75
sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathe... 281 2e-75
sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine protei... 276 4e-74
sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precu... 275 9e-74
sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precu... 274 2e-73
sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L hea... 270 3e-72
sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Catheps... 270 5e-72
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
Length = 334
Score = 285 bits (730), Expect = 8e-77
Identities = 143/259 (55%), Positives = 172/259 (66%), Gaps = 5/259 (1%)
Frame = +2
Query: 2 NQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQC 181
N F DMTNEEF+ + +G + + +G + P + LP SVDWR+KGYVTPVKNQ+QC
Sbjct: 78 NAFGDMTNEEFR-QMMGCFRNQKFRKGKVFREPLFLD-LPKSVDWRKKGYVTPVKNQKQC 135
Query: 182 GSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-G 358
GSCW+FSATG+LEGQ FRK +L+S SEQ LVDCS M AF+Y+K+ G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195
Query: 359 IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHA 538
++SE YPY A D CK P V TGFT + E L AVATVGP+SVA+DAGH+
Sbjct: 196 LDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHS 255
Query: 539 SFQLYKSGIYNEESCSTTQLDHGVLAVGYG----TQIGKKYWIVKNSWDVTWGESGYIKM 706
SFQ YKSGIY E CS+ LDHGVL VGYG KYW+VKNSW WG +GY+K+
Sbjct: 256 SFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKI 315
Query: 707 SKDKKNQCGIATMASYPLV 763
+KDK N CGIAT ASYP V
Sbjct: 316 AKDKNNHCGIATAASYPNV 334
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
Cathepsin L heavy chain; Cathepsin L light chain]
Length = 333
Score = 285 bits (728), Expect = 1e-76
Identities = 147/259 (56%), Positives = 171/259 (66%), Gaps = 5/259 (1%)
Frame = +2
Query: 2 NQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQC 181
N F DMT+EEF+ G KP +G + P P SVDWR+KGYVTPVKNQ QC
Sbjct: 78 NAFGDMTSEEFRQVMNGFQNRKPR-KGKVFQEPLFYEA-PRSVDWREKGYVTPVKNQGQC 135
Query: 182 GSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-G 358
GSCW+FSATG+LEGQ FRK RLIS SEQ LVDCS LMD AF+Y++D G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGG 195
Query: 359 IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHA 538
++SE YPY AT+ +CK NP V TGF DI Q E L AVATVGP+SVAIDAGH
Sbjct: 196 LDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHE 254
Query: 539 SFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQI----GKKYWIVKNSWDVTWGESGYIKM 706
SF YK GIY E CS+ +DHGVL VGYG + KYW+VKNSW WG GY+KM
Sbjct: 255 SFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKM 314
Query: 707 SKDKKNQCGIATMASYPLV 763
+KD++N CGIA+ ASYP V
Sbjct: 315 AKDRRNHCGIASAASYPTV 333
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 333
Score = 282 bits (721), Expect = 9e-76
Identities = 147/259 (56%), Positives = 169/259 (65%), Gaps = 5/259 (1%)
Frame = +2
Query: 2 NQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQC 181
N F DMTNEEF+ G K +G + P +P SVDWR+KGYVTPVKNQ QC
Sbjct: 78 NAFGDMTNEEFRQVMNGFQNQKHK-KGKMFQEPL-FAEIPKSVDWREKGYVTPVKNQGQC 135
Query: 182 GSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-G 358
GSCW+FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAFRY+KD G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGG 195
Query: 359 IESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGH 535
++SE YPY D TC P TGF D+ Q E L AVAT+GP+SVAIDAGH
Sbjct: 196 LDSEESYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKALMKAVATLGPISVAIDAGH 254
Query: 536 ASFQLYKSGIYNEESCSTTQLDHGVLAVGY---GTQIGKKYWIVKNSWDVTWGESGYIKM 706
SFQ YKSGIY + CS+ LDHGVL VGY GT K+WIVKNSW WG +GY+KM
Sbjct: 255 QSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVKM 314
Query: 707 SKDKKNQCGIATMASYPLV 763
+KD+ N CGIAT ASYP V
Sbjct: 315 AKDQNNHCGIATAASYPTV 333
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 339
Score = 281 bits (719), Expect = 2e-75
Identities = 143/263 (54%), Positives = 175/263 (66%), Gaps = 9/263 (3%)
Frame = +2
Query: 2 NQFADMTNEEFKAKYLG-------IMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTP 160
N++ADM + EFK G +M+ + L G+TY+ P ++ V P SVDWR+ G VT
Sbjct: 78 NKYADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTV-PKSVDWREHGAVTG 136
Query: 161 VKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFR 340
VK+Q CGSCW+FS+TG+LEGQ+FRK L+S SEQ LVDCS LMDNAFR
Sbjct: 137 VKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFR 196
Query: 341 YIKDQG-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSV 517
YIKD G I++E YPY D +C N + I TGF DI +E + AVAT+GPVSV
Sbjct: 197 YIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSV 256
Query: 518 AIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGT-QIGKKYWIVKNSWDVTWGESG 694
AIDA H SFQLY G+YNE C LDHGVL VGYGT + G YW+VKNSW TWGE G
Sbjct: 257 AIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQG 316
Query: 695 YIKMSKDKKNQCGIATMASYPLV 763
YIKM++++ NQCGIAT +SYP V
Sbjct: 317 YIKMARNQNNQCGIATASSYPTV 339
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 281 bits (718), Expect = 2e-75
Identities = 149/260 (57%), Positives = 170/260 (65%), Gaps = 6/260 (2%)
Frame = +2
Query: 2 NQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQC 181
N F DMTNEEF+ G K +G + P + V P SVDW +KGYVTPVKNQ QC
Sbjct: 78 NAFGDMTNEEFRQVMNGFQNQKHK-KGKLFHEPLLVDV-PKSVDWTKKGYVTPVKNQGQC 135
Query: 182 GSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-G 358
GSCW+FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF+YIKD G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGG 195
Query: 359 IESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGH 535
++SE YPY ATD +C P TGF DI Q E L AVATVGP+SVAIDAGH
Sbjct: 196 LDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGH 254
Query: 536 ASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKKYWIVKNSWDVTWGESGYIK 703
SFQ YKSGIY + CS LDHGVL VGYG + K+WIVKNSW WG +GY+K
Sbjct: 255 TSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVK 314
Query: 704 MSKDKKNQCGIATMASYPLV 763
M+KD+ N CGIAT ASYP V
Sbjct: 315 MAKDQNNHCGIATAASYPTV 334
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
L heavy chain; Cathepsin L light chain]
Length = 341
Score = 276 bits (707), Expect = 4e-74
Identities = 139/264 (52%), Positives = 176/264 (66%), Gaps = 10/264 (3%)
Frame = +2
Query: 2 NQFADMTNEEFKAKYLGIMKT--------KPTLEGSTYMAPENIGVLPASVDWRQKGYVT 157
N++AD+ + EF+ G T + +G T+++P ++ LP SVDWR KG VT
Sbjct: 79 NKYADLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHV-TLPKSVDWRTKGAVT 137
Query: 158 PVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAF 337
VK+Q CGSCW+FS+TG+LEGQ+FRK+ L+S SEQ LVDCS LMDNAF
Sbjct: 138 AVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAF 197
Query: 338 RYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVS 514
RYIKD GI++E YPY A D +C N + GFTDI +E +A AVATVGPVS
Sbjct: 198 RYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVS 257
Query: 515 VAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGT-QIGKKYWIVKNSWDVTWGES 691
VAIDA H SFQ Y G+YNE C LDHGVL VG+GT + G+ YW+VKNSW TWG+
Sbjct: 258 VAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDK 317
Query: 692 GYIKMSKDKKNQCGIATMASYPLV 763
G+IKM ++K+NQCGIA+ +SYPLV
Sbjct: 318 GFIKMLRNKENQCGIASASSYPLV 341
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
Length = 321
Score = 275 bits (704), Expect = 9e-74
Identities = 139/255 (54%), Positives = 172/255 (67%), Gaps = 1/255 (0%)
Frame = +2
Query: 2 NQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQC 181
NQF DMTNEEF A G K + + A G + A VDWR K VTPVK+Q+QC
Sbjct: 70 NQFGDMTNEEFNAVMKGYKKGSRGEPKAVFTA--EAGPMAADVDWRTKALVTPVKDQEQC 127
Query: 182 GSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG- 358
GSCW+FSATG+LEGQ+F KN+ L+S SEQQLVDCS M +AF YIKD G
Sbjct: 128 GSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGG 187
Query: 359 IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHA 538
I++E YPY A D +C+ + + I CTG ++Q E L AV+ VGP+SVAIDA H
Sbjct: 188 IDTESSYPYEAEDRSCRFDANSIGAICTGSVEVQHTEEA-LQEAVSGVGPISVAIDASHF 246
Query: 539 SFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKNSWDVTWGESGYIKMSKDK 718
SFQ Y SG+Y E++CS T LDHGVLAVGYGT+ K YW+VKNSW +WG++GYIKMS+++
Sbjct: 247 SFQFYSSGVYYEQNCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNR 306
Query: 719 KNQCGIATMASYPLV 763
N CGIA+ SYP V
Sbjct: 307 DNNCGIASEPSYPTV 321
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
Length = 323
Score = 274 bits (701), Expect = 2e-73
Identities = 134/255 (52%), Positives = 174/255 (68%), Gaps = 1/255 (0%)
Frame = +2
Query: 2 NQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQC 181
N+F DMT EEF A G + + + S + + G VDWR KG VTPVK+Q QC
Sbjct: 70 NKFGDMTLEEFNAVMKGNIPRR-SAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQC 128
Query: 182 GSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIK-DQG 358
GSCW+FS TGSLEGQ+F K LIS +EQQLVDCS M++AF YIK + G
Sbjct: 129 GSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNG 188
Query: 359 IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHA 538
I++E YPY A DG+C+ + + + C+G T+I S +ET L AV +GP+SV IDA H+
Sbjct: 189 IDTEAAYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHS 248
Query: 539 SFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKNSWDVTWGESGYIKMSKDK 718
SFQ Y SG+Y E SCS + LDH VLAVGYG++ G+ +W+VKNSW +WG++GYIKMS+++
Sbjct: 249 SFQFYSSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNR 308
Query: 719 KNQCGIATMASYPLV 763
N CGIAT+ASYPLV
Sbjct: 309 NNNCGIATVASYPLV 323
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
chain]
Length = 217
Score = 270 bits (691), Expect = 3e-72
Identities = 134/218 (61%), Positives = 154/218 (70%), Gaps = 2/218 (0%)
Frame = +2
Query: 116 LPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXX 295
+P SVDW +KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK +L+S SEQ LVD S
Sbjct: 1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60
Query: 296 XXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNE 472
LMDNAF+YIK+ G++SE YPY ATD +C P K TGF DI Q E
Sbjct: 61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDI-PQRE 119
Query: 473 TDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ-IGKKY 649
L AVATVGP+SVAIDAGH+SFQ YKSGIY + CS+ LDHGVL VGYG + K+
Sbjct: 120 KALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKF 179
Query: 650 WIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 763
WIVKNSW WG GY+KM+KD+ N CGIAT ASYP V
Sbjct: 180 WIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 270 bits (689), Expect = 5e-72
Identities = 142/260 (54%), Positives = 169/260 (65%), Gaps = 6/260 (2%)
Frame = +2
Query: 2 NQFADMTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQC 181
N F DMTNEEF+ G K +G + + V P SVDWR+KGYVT VKNQ QC
Sbjct: 78 NAFGDMTNEEFRQVMNGFQNQKHK-KGKVFHESLVLEV-PKSVDWREKGYVTAVKNQGQC 135
Query: 182 GSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-G 358
GSCW+FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF+Y+KD G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGG 195
Query: 359 IESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGH 535
+++E YPY + +C P TGF DI Q E L AVATVGP+SVAIDAGH
Sbjct: 196 LDTEESYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGH 254
Query: 536 ASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKKYWIVKNSWDVTWGESGYIK 703
+SFQ YKSGIY + CS+ LDHGVL VGYG + K+WIVKNSW WG +GY+K
Sbjct: 255 SSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVK 314
Query: 704 MSKDKKNQCGIATMASYPLV 763
M+KD+ N CGI+T ASYP V
Sbjct: 315 MAKDQNNHCGISTAASYPTV 334
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 92,962,174
Number of Sequences: 369166
Number of extensions: 1845692
Number of successful extensions: 5721
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4985
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5156
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 8052550455
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)