Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_001_N22
(817 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) ... 279 7e-75
sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted ... 278 1e-74
sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathe... 275 8e-74
sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathe... 274 2e-73
sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathe... 273 3e-73
sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine protei... 271 2e-72
sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L hea... 270 3e-72
sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precu... 267 3e-71
sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precu... 266 4e-71
sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Catheps... 263 4e-70
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
Length = 334
Score = 279 bits (713), Expect = 7e-75
Identities = 140/254 (55%), Positives = 169/254 (66%), Gaps = 5/254 (1%)
Frame = +2
Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181
MTNEEF+ + +G + + +G + P + LP SVDWR+KGYVTPVKNQ+QCGSCW+
Sbjct: 83 MTNEEFR-QMMGCFRNQKFRKGKVFREPLFLD-LPKSVDWRKKGYVTPVKNQKQCGSCWA 140
Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEG 358
FSATG+LEGQ FRK +L+S SEQ LVDCS M AF+Y+K+ G++SE
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEE 200
Query: 359 DYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 538
YPY A D CK P V TGFT + E L AVATVGP+SVA+DAGH+SFQ Y
Sbjct: 201 SYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFY 260
Query: 539 KSGIYNEESCSTTQLDHGVLAVGYG----TQIGKKYWIVKNSWDVTWGESGYIKMSKDKK 706
KSGIY E CS+ LDHGVL VGYG KYW+VKNSW WG +GY+K++KDK
Sbjct: 261 KSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKN 320
Query: 707 NQCGIATMASYPLV 748
N CGIAT ASYP V
Sbjct: 321 NHCGIATAASYPNV 334
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
Cathepsin L heavy chain; Cathepsin L light chain]
Length = 333
Score = 278 bits (711), Expect = 1e-74
Identities = 144/254 (56%), Positives = 168/254 (66%), Gaps = 5/254 (1%)
Frame = +2
Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181
MT+EEF+ G KP +G + P P SVDWR+KGYVTPVKNQ QCGSCW+
Sbjct: 83 MTSEEFRQVMNGFQNRKPR-KGKVFQEPLFYEA-PRSVDWREKGYVTPVKNQGQCGSCWA 140
Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEG 358
FSATG+LEGQ FRK RLIS SEQ LVDCS LMD AF+Y++D G++SE
Sbjct: 141 FSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEE 200
Query: 359 DYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 538
YPY AT+ +CK NP V TGF DI Q E L AVATVGP+SVAIDAGH SF Y
Sbjct: 201 SYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHESFLFY 259
Query: 539 KSGIYNEESCSTTQLDHGVLAVGYGTQI----GKKYWIVKNSWDVTWGESGYIKMSKDKK 706
K GIY E CS+ +DHGVL VGYG + KYW+VKNSW WG GY+KM+KD++
Sbjct: 260 KEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRR 319
Query: 707 NQCGIATMASYPLV 748
N CGIA+ ASYP V
Sbjct: 320 NHCGIASAASYPTV 333
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 333
Score = 275 bits (704), Expect = 8e-74
Identities = 144/254 (56%), Positives = 166/254 (65%), Gaps = 5/254 (1%)
Frame = +2
Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181
MTNEEF+ G K +G + P +P SVDWR+KGYVTPVKNQ QCGSCW+
Sbjct: 83 MTNEEFRQVMNGFQNQKHK-KGKMFQEPL-FAEIPKSVDWREKGYVTPVKNQGQCGSCWA 140
Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEG 358
FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAFRY+KD G++SE
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEE 200
Query: 359 DYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 535
YPY D TC P TGF D+ Q E L AVAT+GP+SVAIDAGH SFQ
Sbjct: 201 SYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKALMKAVATLGPISVAIDAGHQSFQF 259
Query: 536 YKSGIYNEESCSTTQLDHGVLAVGY---GTQIGKKYWIVKNSWDVTWGESGYIKMSKDKK 706
YKSGIY + CS+ LDHGVL VGY GT K+WIVKNSW WG +GY+KM+KD+
Sbjct: 260 YKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVKMAKDQN 319
Query: 707 NQCGIATMASYPLV 748
N CGIAT ASYP V
Sbjct: 320 NHCGIATAASYPTV 333
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 274 bits (701), Expect = 2e-73
Identities = 146/255 (57%), Positives = 167/255 (65%), Gaps = 6/255 (2%)
Frame = +2
Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181
MTNEEF+ G K +G + P + V P SVDW +KGYVTPVKNQ QCGSCW+
Sbjct: 83 MTNEEFRQVMNGFQNQKHK-KGKLFHEPLLVDV-PKSVDWTKKGYVTPVKNQGQCGSCWA 140
Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEG 358
FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF+YIKD G++SE
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEE 200
Query: 359 DYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 535
YPY ATD +C P TGF DI Q E L AVATVGP+SVAIDAGH SFQ
Sbjct: 201 SYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHTSFQF 259
Query: 536 YKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKKYWIVKNSWDVTWGESGYIKMSKDK 703
YKSGIY + CS LDHGVL VGYG + K+WIVKNSW WG +GY+KM+KD+
Sbjct: 260 YKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQ 319
Query: 704 KNQCGIATMASYPLV 748
N CGIAT ASYP V
Sbjct: 320 NNHCGIATAASYPTV 334
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 339
Score = 273 bits (699), Expect = 3e-73
Identities = 140/258 (54%), Positives = 170/258 (65%), Gaps = 9/258 (3%)
Frame = +2
Query: 2 MTNEEFKAKYLG-------IMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQ 160
M + EFK G +M+ + L G+TY+ P ++ V P SVDWR+ G VT VK+Q
Sbjct: 83 MLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTV-PKSVDWREHGAVTGVKDQG 141
Query: 161 QCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ 340
CGSCW+FS+TG+LEGQ+FRK L+S SEQ LVDCS LMDNAFRYIKD
Sbjct: 142 HCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 201
Query: 341 G-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAG 517
G I++E YPY D +C N + I TGF DI +E + AVAT+GPVSVAIDA
Sbjct: 202 GGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDAS 261
Query: 518 HASFQLYKSGIYNEESCSTTQLDHGVLAVGYGT-QIGKKYWIVKNSWDVTWGESGYIKMS 694
H SFQLY G+YNE C LDHGVL VGYGT + G YW+VKNSW TWGE GYIKM+
Sbjct: 262 HESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMA 321
Query: 695 KDKKNQCGIATMASYPLV 748
+++ NQCGIAT +SYP V
Sbjct: 322 RNQNNQCGIATASSYPTV 339
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
L heavy chain; Cathepsin L light chain]
Length = 341
Score = 271 bits (693), Expect = 2e-72
Identities = 132/233 (56%), Positives = 164/233 (70%), Gaps = 2/233 (0%)
Frame = +2
Query: 56 TLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRL 235
+ +G T+++P ++ LP SVDWR KG VT VK+Q CGSCW+FS+TG+LEGQ+FRK+ L
Sbjct: 110 SFKGVTFISPAHV-TLPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVL 168
Query: 236 ISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKI 412
+S SEQ LVDCS LMDNAFRYIKD GI++E YPY A D +C N +
Sbjct: 169 VSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTV 228
Query: 413 VTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHG 592
GFTDI +E +A AVATVGPVSVAIDA H SFQ Y G+YNE C LDHG
Sbjct: 229 GATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHG 288
Query: 593 VLAVGYGT-QIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 748
VL VG+GT + G+ YW+VKNSW TWG+ G+IKM ++K+NQCGIA+ +SYPLV
Sbjct: 289 VLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
chain]
Length = 217
Score = 270 bits (691), Expect = 3e-72
Identities = 134/218 (61%), Positives = 154/218 (70%), Gaps = 2/218 (0%)
Frame = +2
Query: 101 LPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXX 280
+P SVDW +KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK +L+S SEQ LVD S
Sbjct: 1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60
Query: 281 XXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNE 457
LMDNAF+YIK+ G++SE YPY ATD +C P K TGF DI Q E
Sbjct: 61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDI-PQRE 119
Query: 458 TDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ-IGKKY 634
L AVATVGP+SVAIDAGH+SFQ YKSGIY + CS+ LDHGVL VGYG + K+
Sbjct: 120 KALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKF 179
Query: 635 WIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 748
WIVKNSW WG GY+KM+KD+ N CGIAT ASYP V
Sbjct: 180 WIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
Length = 323
Score = 267 bits (682), Expect = 3e-71
Identities = 131/250 (52%), Positives = 170/250 (68%), Gaps = 1/250 (0%)
Frame = +2
Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181
MT EEF A G + + + S + + G VDWR KG VTPVK+Q QCGSCW+
Sbjct: 75 MTLEEFNAVMKGNIPRR-SAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWA 133
Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIK-DQGIESEG 358
FS TGSLEGQ+F K LIS +EQQLVDCS M++AF YIK + GI++E
Sbjct: 134 FSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEA 193
Query: 359 DYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 538
YPY A DG+C+ + + + C+G T+I S +ET L AV +GP+SV IDA H+SFQ Y
Sbjct: 194 AYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFY 253
Query: 539 KSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCG 718
SG+Y E SCS + LDH VLAVGYG++ G+ +W+VKNSW +WG++GYIKMS+++ N CG
Sbjct: 254 SSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCG 313
Query: 719 IATMASYPLV 748
IAT+ASYPLV
Sbjct: 314 IATVASYPLV 323
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
Length = 321
Score = 266 bits (681), Expect = 4e-71
Identities = 135/250 (54%), Positives = 168/250 (67%), Gaps = 1/250 (0%)
Frame = +2
Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181
MTNEEF A G K + + A G + A VDWR K VTPVK+Q+QCGSCW+
Sbjct: 75 MTNEEFNAVMKGYKKGSRGEPKAVFTA--EAGPMAADVDWRTKALVTPVKDQEQCGSCWA 132
Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEG 358
FSATG+LEGQ+F KN+ L+S SEQQLVDCS M +AF YIKD G I++E
Sbjct: 133 FSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTES 192
Query: 359 DYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 538
YPY A D +C+ + + I CTG ++Q E L AV+ VGP+SVAIDA H SFQ Y
Sbjct: 193 SYPYEAEDRSCRFDANSIGAICTGSVEVQHTEEA-LQEAVSGVGPISVAIDASHFSFQFY 251
Query: 539 KSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCG 718
SG+Y E++CS T LDHGVLAVGYGT+ K YW+VKNSW +WG++GYIKMS+++ N CG
Sbjct: 252 SSGVYYEQNCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCG 311
Query: 719 IATMASYPLV 748
IA+ SYP V
Sbjct: 312 IASEPSYPTV 321
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 263 bits (672), Expect = 4e-70
Identities = 139/255 (54%), Positives = 166/255 (65%), Gaps = 6/255 (2%)
Frame = +2
Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181
MTNEEF+ G K +G + + V P SVDWR+KGYVT VKNQ QCGSCW+
Sbjct: 83 MTNEEFRQVMNGFQNQKHK-KGKVFHESLVLEV-PKSVDWREKGYVTAVKNQGQCGSCWA 140
Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEG 358
FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF+Y+KD G+++E
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEE 200
Query: 359 DYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 535
YPY + +C P TGF DI Q E L AVATVGP+SVAIDAGH+SFQ
Sbjct: 201 SYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHSSFQF 259
Query: 536 YKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKKYWIVKNSWDVTWGESGYIKMSKDK 703
YKSGIY + CS+ LDHGVL VGYG + K+WIVKNSW WG +GY+KM+KD+
Sbjct: 260 YKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQ 319
Query: 704 KNQCGIATMASYPLV 748
N CGI+T ASYP V
Sbjct: 320 NNHCGISTAASYPTV 334
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 91,363,911
Number of Sequences: 369166
Number of extensions: 1807801
Number of successful extensions: 5551
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4882
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5050
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 7811456130
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)