Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_016_M11
(761 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) ... 192 1e-48
sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor 188 1e-47
sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathe... 187 3e-47
sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine protei... 185 1e-46
sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathe... 181 2e-45
sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Catheps... 178 1e-44
sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted ... 178 2e-44
sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathe... 178 2e-44
sp|O35186|CATK_RAT Cathepsin K precursor 177 3e-44
sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted ... 176 5e-44
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
Length = 334
Score = 192 bits (487), Expect = 1e-48
Identities = 102/257 (39%), Positives = 144/257 (56%), Gaps = 5/257 (1%)
Frame = +3
Query: 6 HNLQFDLGKTKFSVGLNEFSDMNQKEFQSNILSVXXXXXXXXXXXXXXXXXXLSRNDFIT 185
HN ++ GK F++ +N F DM +EF+ + +
Sbjct: 62 HNGEYSQGKHGFTMAMNAFGDMTNEEFRQMM--------------GCFRNQKFRKGKVFR 107
Query: 186 TPNNIKLPDSWDWREKGAVSPVGNQRNNSCGYAFAAAGALEGQNFNLTKTLEILSKQQII 365
P + LP S DWR+KG V+PV NQ+ +AF+A GALEGQ F T L LS+Q ++
Sbjct: 108 EPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 167
Query: 366 DCSIYYGNSGCYGGILSKAYAYLADYGS-ELDEDYPFVGCNSNCKYDKSLATVKPYGFKY 542
DCS GN GC GG +++A+ Y+ + G + +E YP+V + CKY + GF
Sbjct: 168 DCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTV 227
Query: 543 VSRGKEADLMNAVYNIGPISAAIDASPTSFKQYKTGIYDDTSCSSNNVNHAVLVIGYGEE 722
V+ GKE LM AV +GPIS A+DA +SF+ YK+GIY + CSS N++H VLV+GYG E
Sbjct: 228 VAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFE 287
Query: 723 SGQS----FWIIKNSWG 761
S +W++KNSWG
Sbjct: 288 GANSNNSKYWLVKNSWG 304
>sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor
Length = 326
Score = 188 bits (478), Expect = 1e-47
Identities = 97/259 (37%), Positives = 141/259 (54%), Gaps = 6/259 (2%)
Frame = +3
Query: 3 QHNLQFDLGKTKFSVGLNEFSDMNQKEFQSNILSVXXXXXXXXXXXXXXXXXXLSRNDFI 182
+HNL+ DLG +++GLN+F+DM +EF++ L+ +SR I
Sbjct: 53 EHNLRHDLGLVTYTLGLNQFTDMTFEEFKAKYLT------------------EMSRASDI 94
Query: 183 TT------PNNIKLPDSWDWREKGAVSPVGNQRNNSCGYAFAAAGALEGQNFNLTKTLEI 344
+ NN +PD DWRE G V+ V +Q N +AF+ G +EGQ +T
Sbjct: 95 LSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSIS 154
Query: 345 LSKQQIIDCSIYYGNSGCYGGILSKAYAYLADYGSELDEDYPFVGCNSNCKYDKSLATVK 524
S+QQ++DCS +GN+GC GG++ AY YL +G E + YP+ C+Y+K L K
Sbjct: 155 FSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPYTAVEGQCRYNKQLGVAK 214
Query: 525 PYGFKYVSRGKEADLMNAVYNIGPISAAIDASPTSFKQYKTGIYDDTSCSSNNVNHAVLV 704
G+ V G E +L N V P + A+D + F Y++GIY +CS VNHAVL
Sbjct: 215 VTGYYTVHSGSEVELKNLVGARRPAAVAVDVE-SDFMMYRSGIYQSQTCSPLRVNHAVLA 273
Query: 705 IGYGEESGQSFWIIKNSWG 761
+GYG + G +WI+KNSWG
Sbjct: 274 VGYGTQGGTDYWIVKNSWG 292
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 339
Score = 187 bits (474), Expect = 3e-47
Identities = 95/255 (37%), Positives = 146/255 (57%), Gaps = 2/255 (0%)
Frame = +3
Query: 3 QHNLQFDLGKTKFSVGLNEFSDMNQKEFQSNILSVXXXXXXXXXXXXXXXXXXLSRNDFI 182
+HN F GK + +GLN+++DM EF+ + L +I
Sbjct: 61 KHNQLFAQGKVSYKLGLNKYADMLHHEFKETM-----NGYNHTLRQLMRERTGLVGATYI 115
Query: 183 TTPNNIKLPDSWDWREKGAVSPVGNQRNNSCGYAFAAAGALEGQNFNLTKTLEILSKQQI 362
P ++ +P S DWRE GAV+ V +Q + +AF++ GALEGQ+F L LS+Q +
Sbjct: 116 -PPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNL 174
Query: 363 IDCSIYYGNSGCYGGILSKAYAYLADYGS-ELDEDYPFVGCNSNCKYDKSLATVKPYGFK 539
+DCS YGN+GC GG++ A+ Y+ D G + ++ YP+ G + +C ++K+ GF
Sbjct: 175 VDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFV 234
Query: 540 YVSRGKEADLMNAVYNIGPISAAIDASPTSFKQYKTGIYDDTSCSSNNVNHAVLVIGYG- 716
+ G E + AV +GP+S AIDAS SF+ Y G+Y++ C N++H VLV+GYG
Sbjct: 235 DIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGT 294
Query: 717 EESGQSFWIIKNSWG 761
+ESG +W++KNSWG
Sbjct: 295 DESGMDYWLVKNSWG 309
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
L heavy chain; Cathepsin L light chain]
Length = 341
Score = 185 bits (469), Expect = 1e-46
Identities = 92/255 (36%), Positives = 148/255 (58%), Gaps = 2/255 (0%)
Frame = +3
Query: 3 QHNLQFDLGKTKFSVGLNEFSDMNQKEFQSNILSVXXXXXXXXXXXXXXXXXXLSRNDFI 182
+HN +F GK F + +N+++D+ EF+ + FI
Sbjct: 62 KHNQRFAEGKVSFKLAVNKYADLLHHEFRQ----LMNGFNYTLHKQLRAADESFKGVTFI 117
Query: 183 TTPNNIKLPDSWDWREKGAVSPVGNQRNNSCGYAFAAAGALEGQNFNLTKTLEILSKQQI 362
+ P ++ LP S DWR KGAV+ V +Q + +AF++ GALEGQ+F + L LS+Q +
Sbjct: 118 S-PAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNL 176
Query: 363 IDCSIYYGNSGCYGGILSKAYAYLADYGS-ELDEDYPFVGCNSNCKYDKSLATVKPYGFK 539
+DCS YGN+GC GG++ A+ Y+ D G + ++ YP+ + +C ++K GF
Sbjct: 177 VDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFT 236
Query: 540 YVSRGKEADLMNAVYNIGPISAAIDASPTSFKQYKTGIYDDTSCSSNNVNHAVLVIGYG- 716
+ +G E + AV +GP+S AIDAS SF+ Y G+Y++ C + N++H VLV+G+G
Sbjct: 237 DIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT 296
Query: 717 EESGQSFWIIKNSWG 761
+ESG+ +W++KNSWG
Sbjct: 297 DESGEDYWLVKNSWG 311
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 333
Score = 181 bits (459), Expect = 2e-45
Identities = 102/257 (39%), Positives = 141/257 (54%), Gaps = 5/257 (1%)
Frame = +3
Query: 6 HNLQFDLGKTKFSVGLNEFSDMNQKEFQSNILSVXXXXXXXXXXXXXXXXXXLSRNDFIT 185
HN ++ GK F++ +N F DM +EF+ + +
Sbjct: 62 HNREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKH--------------KKGKMFQ 107
Query: 186 TPNNIKLPDSWDWREKGAVSPVGNQRNNSCGYAFAAAGALEGQNFNLTKTLEILSKQQII 365
P ++P S DWREKG V+PV NQ +AF+A GALEGQ F T L LS+Q ++
Sbjct: 108 EPLFAEIPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 167
Query: 366 DCSIYYGNSGCYGGILSKAYAYLADYGS-ELDEDYPFVGCNS-NCKYDKSLATVKPYGFK 539
DCS GN GC GG++ A+ Y+ D G + +E YP++G ++ C Y + GF
Sbjct: 168 DCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFV 227
Query: 540 YVSRGKEADLMNAVYNIGPISAAIDASPTSFKQYKTGIYDDTSCSSNNVNHAVLVIGYGE 719
+ + +E LM AV +GPIS AIDA SF+ YK+GIY D CSS +++H VLV+GYG
Sbjct: 228 DLPQ-REKALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGF 286
Query: 720 E---SGQSFWIIKNSWG 761
E S FWI+KNSWG
Sbjct: 287 EGTDSNNKFWIVKNSWG 303
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 178 bits (452), Expect = 1e-44
Identities = 102/258 (39%), Positives = 141/258 (54%), Gaps = 6/258 (2%)
Frame = +3
Query: 6 HNLQFDLGKTKFSVGLNEFSDMNQKEFQSNILSVXXXXXXXXXXXXXXXXXXLSRNDFIT 185
HN ++ GK FS+ +N F DM +EF+ + +
Sbjct: 62 HNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQNQKH--------------KKGKVFH 107
Query: 186 TPNNIKLPDSWDWREKGAVSPVGNQRNNSCGYAFAAAGALEGQNFNLTKTLEILSKQQII 365
+++P S DWREKG V+ V NQ +AF+A GALEGQ F T L LS+Q ++
Sbjct: 108 ESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 167
Query: 366 DCSIYYGNSGCYGGILSKAYAYLADYGS-ELDEDYPFVGCNSN-CKYDKSLATVKPYGFK 539
DCS GN GC GG++ A+ Y+ D G + +E YP++G +N C Y + GF
Sbjct: 168 DCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFV 227
Query: 540 YVSRGKEADLMNAVYNIGPISAAIDASPTSFKQYKTGIYDDTSCSSNNVNHAVLVIGYGE 719
+ + +E LM AV +GPIS AIDA +SF+ YK+GIY D CSS +++H VLV+GYG
Sbjct: 228 DIPQ-REKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGF 286
Query: 720 ESGQS----FWIIKNSWG 761
E S FWI+KNSWG
Sbjct: 287 EGTDSNSSKFWIVKNSWG 304
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
Cathepsin L heavy chain; Cathepsin L light chain]
Length = 333
Score = 178 bits (451), Expect = 2e-44
Identities = 100/257 (38%), Positives = 140/257 (54%), Gaps = 5/257 (1%)
Frame = +3
Query: 6 HNLQFDLGKTKFSVGLNEFSDMNQKEFQSNILSVXXXXXXXXXXXXXXXXXXLSRNDFIT 185
HN ++ GK F++ +N F DM +EF+ + +
Sbjct: 62 HNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR--------------KGKVFQ 107
Query: 186 TPNNIKLPDSWDWREKGAVSPVGNQRNNSCGYAFAAAGALEGQNFNLTKTLEILSKQQII 365
P + P S DWREKG V+PV NQ +AF+A GALEGQ F T L LS+Q ++
Sbjct: 108 EPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLV 167
Query: 366 DCSIYYGNSGCYGGILSKAYAYLADYGS-ELDEDYPFVGCNSNCKYDKSLATVKPYGFKY 542
DCS GN GC GG++ A+ Y+ D G + +E YP+ +CKY+ + GF
Sbjct: 168 DCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVD 227
Query: 543 VSRGKEADLMNAVYNIGPISAAIDASPTSFKQYKTGIYDDTSCSSNNVNHAVLVIGYGEE 722
+ + ++A LM AV +GPIS AIDA SF YK GIY + CSS +++H VLV+GYG E
Sbjct: 228 IPKQEKA-LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFE 286
Query: 723 SGQS----FWIIKNSWG 761
S +S +W++KNSWG
Sbjct: 287 STESDNNKYWLVKNSWG 303
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
L light chain]
Length = 334
Score = 178 bits (451), Expect = 2e-44
Identities = 100/258 (38%), Positives = 139/258 (53%), Gaps = 6/258 (2%)
Frame = +3
Query: 6 HNLQFDLGKTKFSVGLNEFSDMNQKEFQSNILSVXXXXXXXXXXXXXXXXXXLSRNDFIT 185
HN ++ GK F + +N F DM +EF+ + +
Sbjct: 62 HNQEYSEGKHAFRMAMNAFGDMTNEEFRQVMNGFQNQKH--------------KKGKLFH 107
Query: 186 TPNNIKLPDSWDWREKGAVSPVGNQRNNSCGYAFAAAGALEGQNFNLTKTLEILSKQQII 365
P + +P S DW +KG V+PV NQ +AF+A GALEGQ F T L LS+Q ++
Sbjct: 108 EPLLVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 167
Query: 366 DCSIYYGNSGCYGGILSKAYAYLADYGS-ELDEDYPFVGCNSN-CKYDKSLATVKPYGFK 539
DCS GN GC GG++ A+ Y+ D G + +E YP++ ++N C Y + GF
Sbjct: 168 DCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFV 227
Query: 540 YVSRGKEADLMNAVYNIGPISAAIDASPTSFKQYKTGIYDDTSCSSNNVNHAVLVIGYGE 719
+ + +E LM AV +GPIS AIDA TSF+ YK+GIY D CS +++H VLV+GYG
Sbjct: 228 DIPQ-REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGF 286
Query: 720 ESGQS----FWIIKNSWG 761
E S FWI+KNSWG
Sbjct: 287 EGTDSNNNKFWIVKNSWG 304
>sp|O35186|CATK_RAT Cathepsin K precursor
Length = 329
Score = 177 bits (449), Expect = 3e-44
Identities = 99/254 (38%), Positives = 139/254 (54%), Gaps = 2/254 (0%)
Frame = +3
Query: 6 HNLQFDLGKTKFSVGLNEFSDMNQKEFQSNILSVXXXXXXXXXXXXXXXXXXLSRNDFIT 185
HNL+ LG + + +N DM +E + + ND +
Sbjct: 60 HNLEASLGAHTYELAMNHLGDMTSEEVVQKMTGLRVPPSRSFS------------NDTLY 107
Query: 186 TPN-NIKLPDSWDWREKGAVSPVGNQRNNSCGYAFAAAGALEGQNFNLTKTLEILSKQQI 362
TP ++PDS D+R+KG V+PV NQ +AF++AGALEGQ T L LS Q +
Sbjct: 108 TPEWEGRVPDSIDYRKKGYVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNL 167
Query: 363 IDCSIYYGNSGCYGGILSKAYAYLADYGSELDED-YPFVGCNSNCKYDKSLATVKPYGFK 539
+DC N GC GG ++ A+ Y+ G ED YP+VG + +C Y+ + K G++
Sbjct: 168 VDC--VSENYGCGGGYMTTAFQYVQQNGGIDSEDAYPYVGQDESCMYNATAKAAKCRGYR 225
Query: 540 YVSRGKEADLMNAVYNIGPISAAIDASPTSFKQYKTGIYDDTSCSSNNVNHAVLVIGYGE 719
+ G E L AV +GP+S +IDAS TSF+ Y G+Y D +C +NVNHAVLV+GYG
Sbjct: 226 EIPVGNEKALKRAVARVGPVSVSIDASLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGT 285
Query: 720 ESGQSFWIIKNSWG 761
+ G +WIIKNSWG
Sbjct: 286 QKGNKYWIIKNSWG 299
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
proteinase) [Contains: Cathepsin L heavy chain;
Cathepsin L light chain]
Length = 334
Score = 176 bits (447), Expect = 5e-44
Identities = 97/257 (37%), Positives = 142/257 (55%), Gaps = 5/257 (1%)
Frame = +3
Query: 6 HNLQFDLGKTKFSVGLNEFSDMNQKEFQSNILSVXXXXXXXXXXXXXXXXXXLSRNDFIT 185
HN ++ G+ FS+ +N F DM +EF+ + +
Sbjct: 62 HNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKH--------------KKGRLFQ 107
Query: 186 TPNNIKLPDSWDWREKGAVSPVGNQRNNSCGYAFAAAGALEGQNFNLTKTLEILSKQQII 365
P +K+P S DWREKG V+PV NQ +AF+A+G LEGQ F T L LS+Q ++
Sbjct: 108 EPLMLKIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLV 167
Query: 366 DCSIYYGNSGCYGGILSKAYAYLADYGS-ELDEDYPFVGCNSNCKYDKSLATVKPYGFKY 542
DCS GN GC GG++ A+ Y+ + G + +E YP+ + +CKY A GF
Sbjct: 168 DCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVD 227
Query: 543 VSRGKEADLMNAVYNIGPISAAIDASPTSFKQYKTGIYDDTSCSSNNVNHAVLVIGYGEE 722
+ + ++A LM AV +GPIS A+DAS S + Y +GIY + +CSS N++H VL++GYG E
Sbjct: 228 IPQQEKA-LMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYE 286
Query: 723 SGQS----FWIIKNSWG 761
S +W++KNSWG
Sbjct: 287 GTDSNKNKYWLVKNSWG 303
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 76,102,904
Number of Sequences: 369166
Number of extensions: 1367992
Number of successful extensions: 4411
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3811
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3969
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 7018522000
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)