Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dr_sW_001_N22 (817 letters) Database: Non-redundant SwissProt sequences 184,735 sequences; 68,354,980 total letters Score E Sequences producing significant alignments: (bits) Value sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) ... 279 7e-75 sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted ... 278 1e-74 sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathe... 275 8e-74 sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathe... 274 2e-73 sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathe... 273 3e-73 sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine protei... 271 2e-72 sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L hea... 270 3e-72 sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precu... 267 3e-71 sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precu... 266 4e-71 sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Catheps... 263 4e-70
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U) Length = 334 Score = 279 bits (713), Expect = 7e-75 Identities = 140/254 (55%), Positives = 169/254 (66%), Gaps = 5/254 (1%) Frame = +2 Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181 MTNEEF+ + +G + + +G + P + LP SVDWR+KGYVTPVKNQ+QCGSCW+ Sbjct: 83 MTNEEFR-QMMGCFRNQKFRKGKVFREPLFLD-LPKSVDWRKKGYVTPVKNQKQCGSCWA 140 Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEG 358 FSATG+LEGQ FRK +L+S SEQ LVDCS M AF+Y+K+ G++SE Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEE 200 Query: 359 DYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 538 YPY A D CK P V TGFT + E L AVATVGP+SVA+DAGH+SFQ Y Sbjct: 201 SYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFY 260 Query: 539 KSGIYNEESCSTTQLDHGVLAVGYG----TQIGKKYWIVKNSWDVTWGESGYIKMSKDKK 706 KSGIY E CS+ LDHGVL VGYG KYW+VKNSW WG +GY+K++KDK Sbjct: 261 KSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKN 320 Query: 707 NQCGIATMASYPLV 748 N CGIAT ASYP V Sbjct: 321 NHCGIATAASYPNV 334
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] Length = 333 Score = 278 bits (711), Expect = 1e-74 Identities = 144/254 (56%), Positives = 168/254 (66%), Gaps = 5/254 (1%) Frame = +2 Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181 MT+EEF+ G KP +G + P P SVDWR+KGYVTPVKNQ QCGSCW+ Sbjct: 83 MTSEEFRQVMNGFQNRKPR-KGKVFQEPLFYEA-PRSVDWREKGYVTPVKNQGQCGSCWA 140 Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEG 358 FSATG+LEGQ FRK RLIS SEQ LVDCS LMD AF+Y++D G++SE Sbjct: 141 FSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEE 200 Query: 359 DYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 538 YPY AT+ +CK NP V TGF DI Q E L AVATVGP+SVAIDAGH SF Y Sbjct: 201 SYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHESFLFY 259 Query: 539 KSGIYNEESCSTTQLDHGVLAVGYGTQI----GKKYWIVKNSWDVTWGESGYIKMSKDKK 706 K GIY E CS+ +DHGVL VGYG + KYW+VKNSW WG GY+KM+KD++ Sbjct: 260 KEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRR 319 Query: 707 NQCGIATMASYPLV 748 N CGIA+ ASYP V Sbjct: 320 NHCGIASAASYPTV 333
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin L light chain] Length = 333 Score = 275 bits (704), Expect = 8e-74 Identities = 144/254 (56%), Positives = 166/254 (65%), Gaps = 5/254 (1%) Frame = +2 Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181 MTNEEF+ G K +G + P +P SVDWR+KGYVTPVKNQ QCGSCW+ Sbjct: 83 MTNEEFRQVMNGFQNQKHK-KGKMFQEPL-FAEIPKSVDWREKGYVTPVKNQGQCGSCWA 140 Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEG 358 FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAFRY+KD G++SE Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEE 200 Query: 359 DYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 535 YPY D TC P TGF D+ Q E L AVAT+GP+SVAIDAGH SFQ Sbjct: 201 SYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKALMKAVATLGPISVAIDAGHQSFQF 259 Query: 536 YKSGIYNEESCSTTQLDHGVLAVGY---GTQIGKKYWIVKNSWDVTWGESGYIKMSKDKK 706 YKSGIY + CS+ LDHGVL VGY GT K+WIVKNSW WG +GY+KM+KD+ Sbjct: 260 YKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGWNGYVKMAKDQN 319 Query: 707 NQCGIATMASYPLV 748 N CGIAT ASYP V Sbjct: 320 NHCGIATAASYPTV 333
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin L light chain] Length = 334 Score = 274 bits (701), Expect = 2e-73 Identities = 146/255 (57%), Positives = 167/255 (65%), Gaps = 6/255 (2%) Frame = +2 Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181 MTNEEF+ G K +G + P + V P SVDW +KGYVTPVKNQ QCGSCW+ Sbjct: 83 MTNEEFRQVMNGFQNQKHK-KGKLFHEPLLVDV-PKSVDWTKKGYVTPVKNQGQCGSCWA 140 Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEG 358 FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF+YIKD G++SE Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEE 200 Query: 359 DYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 535 YPY ATD +C P TGF DI Q E L AVATVGP+SVAIDAGH SFQ Sbjct: 201 SYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHTSFQF 259 Query: 536 YKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKKYWIVKNSWDVTWGESGYIKMSKDK 703 YKSGIY + CS LDHGVL VGYG + K+WIVKNSW WG +GY+KM+KD+ Sbjct: 260 YKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQ 319 Query: 704 KNQCGIATMASYPLV 748 N CGIAT ASYP V Sbjct: 320 NNHCGIATAASYPTV 334
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin L light chain] Length = 339 Score = 273 bits (699), Expect = 3e-73 Identities = 140/258 (54%), Positives = 170/258 (65%), Gaps = 9/258 (3%) Frame = +2 Query: 2 MTNEEFKAKYLG-------IMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQ 160 M + EFK G +M+ + L G+TY+ P ++ V P SVDWR+ G VT VK+Q Sbjct: 83 MLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTV-PKSVDWREHGAVTGVKDQG 141 Query: 161 QCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ 340 CGSCW+FS+TG+LEGQ+FRK L+S SEQ LVDCS LMDNAFRYIKD Sbjct: 142 HCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 201 Query: 341 G-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAG 517 G I++E YPY D +C N + I TGF DI +E + AVAT+GPVSVAIDA Sbjct: 202 GGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDAS 261 Query: 518 HASFQLYKSGIYNEESCSTTQLDHGVLAVGYGT-QIGKKYWIVKNSWDVTWGESGYIKMS 694 H SFQLY G+YNE C LDHGVL VGYGT + G YW+VKNSW TWGE GYIKM+ Sbjct: 262 HESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMA 321 Query: 695 KDKKNQCGIATMASYPLV 748 +++ NQCGIAT +SYP V Sbjct: 322 RNQNNQCGIATASSYPTV 339
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin L heavy chain; Cathepsin L light chain] Length = 341 Score = 271 bits (693), Expect = 2e-72 Identities = 132/233 (56%), Positives = 164/233 (70%), Gaps = 2/233 (0%) Frame = +2 Query: 56 TLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRL 235 + +G T+++P ++ LP SVDWR KG VT VK+Q CGSCW+FS+TG+LEGQ+FRK+ L Sbjct: 110 SFKGVTFISPAHV-TLPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVL 168 Query: 236 ISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKI 412 +S SEQ LVDCS LMDNAFRYIKD GI++E YPY A D +C N + Sbjct: 169 VSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTV 228 Query: 413 VTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHG 592 GFTDI +E +A AVATVGPVSVAIDA H SFQ Y G+YNE C LDHG Sbjct: 229 GATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHG 288 Query: 593 VLAVGYGT-QIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 748 VL VG+GT + G+ YW+VKNSW TWG+ G+IKM ++K+NQCGIA+ +SYPLV Sbjct: 289 VLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light chain] Length = 217 Score = 270 bits (691), Expect = 3e-72 Identities = 134/218 (61%), Positives = 154/218 (70%), Gaps = 2/218 (0%) Frame = +2 Query: 101 LPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXX 280 +P SVDW +KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK +L+S SEQ LVD S Sbjct: 1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60 Query: 281 XXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNE 457 LMDNAF+YIK+ G++SE YPY ATD +C P K TGF DI Q E Sbjct: 61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDI-PQRE 119 Query: 458 TDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ-IGKKY 634 L AVATVGP+SVAIDAGH+SFQ YKSGIY + CS+ LDHGVL VGYG + K+ Sbjct: 120 KALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKF 179 Query: 635 WIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 748 WIVKNSW WG GY+KM+KD+ N CGIAT ASYP V Sbjct: 180 WIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor Length = 323 Score = 267 bits (682), Expect = 3e-71 Identities = 131/250 (52%), Positives = 170/250 (68%), Gaps = 1/250 (0%) Frame = +2 Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181 MT EEF A G + + + S + + G VDWR KG VTPVK+Q QCGSCW+ Sbjct: 75 MTLEEFNAVMKGNIPRR-SAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWA 133 Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIK-DQGIESEG 358 FS TGSLEGQ+F K LIS +EQQLVDCS M++AF YIK + GI++E Sbjct: 134 FSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEA 193 Query: 359 DYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 538 YPY A DG+C+ + + + C+G T+I S +ET L AV +GP+SV IDA H+SFQ Y Sbjct: 194 AYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFY 253 Query: 539 KSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCG 718 SG+Y E SCS + LDH VLAVGYG++ G+ +W+VKNSW +WG++GYIKMS+++ N CG Sbjct: 254 SSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCG 313 Query: 719 IATMASYPLV 748 IAT+ASYPLV Sbjct: 314 IATVASYPLV 323
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor Length = 321 Score = 266 bits (681), Expect = 4e-71 Identities = 135/250 (54%), Positives = 168/250 (67%), Gaps = 1/250 (0%) Frame = +2 Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181 MTNEEF A G K + + A G + A VDWR K VTPVK+Q+QCGSCW+ Sbjct: 75 MTNEEFNAVMKGYKKGSRGEPKAVFTA--EAGPMAADVDWRTKALVTPVKDQEQCGSCWA 132 Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEG 358 FSATG+LEGQ+F KN+ L+S SEQQLVDCS M +AF YIKD G I++E Sbjct: 133 FSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTES 192 Query: 359 DYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLY 538 YPY A D +C+ + + I CTG ++Q E L AV+ VGP+SVAIDA H SFQ Y Sbjct: 193 SYPYEAEDRSCRFDANSIGAICTGSVEVQHTEEA-LQEAVSGVGPISVAIDASHFSFQFY 251 Query: 539 KSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCG 718 SG+Y E++CS T LDHGVLAVGYGT+ K YW+VKNSW +WG++GYIKMS+++ N CG Sbjct: 252 SSGVYYEQNCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCG 311 Query: 719 IATMASYPLV 748 IA+ SYP V Sbjct: 312 IASEPSYPTV 321
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin L light chain] Length = 334 Score = 263 bits (672), Expect = 4e-70 Identities = 139/255 (54%), Positives = 166/255 (65%), Gaps = 6/255 (2%) Frame = +2 Query: 2 MTNEEFKAKYLGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWS 181 MTNEEF+ G K +G + + V P SVDWR+KGYVT VKNQ QCGSCW+ Sbjct: 83 MTNEEFRQVMNGFQNQKHK-KGKVFHESLVLEV-PKSVDWREKGYVTAVKNQGQCGSCWA 140 Query: 182 FSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEG 358 FSATG+LEGQ FRK +L+S SEQ LVDCS LMDNAF+Y+KD G+++E Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEE 200 Query: 359 DYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQL 535 YPY + +C P TGF DI Q E L AVATVGP+SVAIDAGH+SFQ Sbjct: 201 SYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVATVGPISVAIDAGHSSFQF 259 Query: 536 YKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKKYWIVKNSWDVTWGESGYIKMSKDK 703 YKSGIY + CS+ LDHGVL VGYG + K+WIVKNSW WG +GY+KM+KD+ Sbjct: 260 YKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQ 319 Query: 704 KNQCGIATMASYPLV 748 N CGI+T ASYP V Sbjct: 320 NNHCGISTAASYPTV 334
Database: Non-redundant SwissProt sequences Posted date: Dec 6, 2005 7:40 AM Number of letters in database: 68,354,980 Number of sequences in database: 184,735 Database: swissprot.01 Posted date: Dec 6, 2005 8:18 AM Number of letters in database: 66,202,850 Number of sequences in database: 184,431 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 91,363,911 Number of Sequences: 369166 Number of extensions: 1807801 Number of successful extensions: 5551 Number of sequences better than 10.0: 10 Number of HSP's better than 10.0 without gapping: 4882 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 5050 length of database: 68,354,980 effective HSP length: 109 effective length of database: 48,218,865 effective search space used: 7811456130 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits)