Planaria EST Database


DrC_00852

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_00852
         (1046 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   355   2e-97
sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   354   3e-97
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   352   1e-96
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   348   1e-95
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   347   3e-95
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   345   1e-94
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   342   1e-93
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   340   4e-93
sp|P43236|CATK_RABIT  Cathepsin K precursor (OC-2 protein)        336   6e-92
sp|P43235|CATK_HUMAN  Cathepsin K precursor (Cathepsin O) (C...   333   6e-91
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  355 bits (910), Expect = 2e-97
 Identities = 182/336 (54%), Positives = 229/336 (68%), Gaps = 5/336 (1%)
 Frame = +1

Query: 13   ITSVLLLFTILLRSSAFKFQINEELNDEWNLYKTRYQKSYLSIVELTRRIIWEENLRYIQ 192
            ++ VL  F + + S+  KF  N  L+ +W  +K  +++ Y +  E  RR +WE+N++ I+
Sbjct: 3    LSLVLAAFCLGIASAVPKFDQN--LDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIE 60

Query: 193  HHNLEYDLGKHSYYLGVNEYADLTHEEFKIKFLGLKPINRNYSRTIFMPPENMDNYPKSV 372
             HN EY  GKH + + +N + D+T+EEF+ + +G     +     +F  P  +D  PKSV
Sbjct: 61   LHNGEYSQGKHGFTMAMNAFGDMTNEEFR-QMMGCFRNQKFRKGKVFREPLFLD-LPKSV 118

Query: 373  DWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQNNGC 552
            DWRKKGYVTPVKNQ QCGSCW+FS TGALEGQ FRK  +LVSLSEQ LVDCS+   N GC
Sbjct: 119  DWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGC 178

Query: 553  NGGLMDNAFQYIQKY-GLESEADYPYTAMDGPCKYDSTKVVAHCTGFVDIKKGNEKDLTK 729
            NGG M  AFQY+++  GL+SE  YPY A+D  CKY     VA+ TGF  +  G EK L K
Sbjct: 179  NGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMK 238

Query: 730  AVATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEGKN----HFWI 897
            AVATVGPISVA+DA   SFQ YK GIY E +CSS NLDHGVL VGYG EG N     +W+
Sbjct: 239  AVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWL 298

Query: 898  VKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 1005
            VKNSWGP WG +GY+K++KDK N CGIAT ASYP V
Sbjct: 299  VKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic protein
            2) (CP-2) [Contains: Cathepsin L heavy chain; Cathepsin L
            light chain]
          Length = 334

 Score =  354 bits (908), Expect = 3e-97
 Identities = 182/336 (54%), Positives = 232/336 (69%), Gaps = 5/336 (1%)
 Frame = +1

Query: 13   ITSVLLLFTILLRSSAFKFQINEELNDEWNLYKTRYQKSYLSIVELTRRIIWEENLRYIQ 192
            +T +LLL  + L ++    + ++  N +W+ +K+ +++ Y +  E  RR +WE+N+R IQ
Sbjct: 1    MTPLLLLAVLCLGTALATPKFDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQ 60

Query: 193  HHNLEYDLGKHSYYLGVNEYADLTHEEFKIKFLGLKPINRNYSRTIFMPPENMDNYPKSV 372
             HN EY  GKH + + +N + D+T+EEF+    G +       R +F  P  M   PK+V
Sbjct: 61   LHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRHQKHKKGR-LFQEPL-MLQIPKTV 118

Query: 373  DWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQNNGC 552
            DWR+KG VTPVKNQGQCGSCW+FS +G LEGQ F K  +L+SLSEQ LVDCS D  N GC
Sbjct: 119  DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGC 178

Query: 553  NGGLMDNAFQYIQKY-GLESEADYPYTAMDGPCKYDSTKVVAHCTGFVDIKKGNEKDLTK 729
            NGGLMD AFQYI++  GL+SE  YPY A DG CKY +   VA+ TGFVDI +  EK L K
Sbjct: 179  NGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQ-QEKALMK 237

Query: 730  AVATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG----KNHFWI 897
            AVATVGPISVA+DAS PS Q Y  GIY E NCSS +LDHGVL VGYG EG    K+ +W+
Sbjct: 238  AVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWL 297

Query: 898  VKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 1005
            VKNSWG  WG+ GYIK++KD+ N CG+AT ASYP V
Sbjct: 298  VKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
            proteinase) [Contains: Cathepsin L heavy chain; Cathepsin
            L light chain]
          Length = 334

 Score =  352 bits (902), Expect = 1e-96
 Identities = 183/333 (54%), Positives = 229/333 (68%), Gaps = 5/333 (1%)
 Frame = +1

Query: 22   VLLLFTILLRSSAFKFQINEELNDEWNLYKTRYQKSYLSIVELTRRIIWEENLRYIQHHN 201
            +LLL  + L ++    + ++  + EW+ +K+ +++ Y +  E  RR IWE+N+R IQ HN
Sbjct: 4    LLLLAVLCLGTALATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHN 63

Query: 202  LEYDLGKHSYYLGVNEYADLTHEEFKIKFLGLKPINRNYSRTIFMPPENMDNYPKSVDWR 381
             EY  G+H + + +N + D+T+EEF+    G +       R +F  P  M   PKSVDWR
Sbjct: 64   GEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRHQKHKKGR-LFQEPL-MLKIPKSVDWR 121

Query: 382  KKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQNNGCNGG 561
            +KG VTPVKNQGQCGSCW+FS +G LEGQ F K  +L+SLSEQ LVDCS    N GCNGG
Sbjct: 122  EKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGG 181

Query: 562  LMDNAFQYIQKY-GLESEADYPYTAMDGPCKYDSTKVVAHCTGFVDIKKGNEKDLTKAVA 738
            LMD AFQYI++  GL+SE  YPY A DG CKY +   VA+ TGFVDI +  EK L KAVA
Sbjct: 182  LMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKALMKAVA 240

Query: 739  TVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG----KNHFWIVKN 906
            TVGPISVA+DAS PS Q Y  GIY E NCSS NLDHGVL VGYG EG    KN +W+VKN
Sbjct: 241  TVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKN 300

Query: 907  SWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 1005
            SWG  WG+ GYIK++KD+ N CG+AT ASYP V
Sbjct: 301  SWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin L
            light chain]
          Length = 333

 Score =  348 bits (893), Expect = 1e-95
 Identities = 179/325 (55%), Positives = 223/325 (68%), Gaps = 5/325 (1%)
 Frame = +1

Query: 46   LRSSAFKFQINEELNDEWNLYKTRYQKSYLSIVELTRRIIWEENLRYIQHHNLEYDLGKH 225
            + S+A KF  ++ LN +W  +K  +++ Y    E  RR +WE+N++ I+ HN EY  GKH
Sbjct: 14   IASAAPKF--DQSLNAQWYQWKATHRRLYGMNEEGWRRAVWEKNMKMIELHNREYSQGKH 71

Query: 226  SYYLGVNEYADLTHEEFKIKFLGLKPINRNYSRTIFMPPENMDNYPKSVDWRKKGYVTPV 405
             + + +N + D+T+EEF+    G +  N+ + +            PKSVDWR+KGYVTPV
Sbjct: 72   GFTMAMNAFGDMTNEEFRQVMNGFQ--NQKHKKGKMFQEPLFAEIPKSVDWREKGYVTPV 129

Query: 406  KNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQNNGCNGGLMDNAFQY 585
            KNQGQCGSCW+FS TGALEGQ FRK  +LVSLSEQ LVDCS+   N GCNGGLMDNAF+Y
Sbjct: 130  KNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRY 189

Query: 586  IQ-KYGLESEADYPYTAMD-GPCKYDSTKVVAHCTGFVDIKKGNEKDLTKAVATVGPISV 759
            ++   GL+SE  YPY   D   C Y      A+ TGFVD+ +  EK L KAVAT+GPISV
Sbjct: 190  VKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQ-REKALMKAVATLGPISV 248

Query: 760  AIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG---KNHFWIVKNSWGPTWGI 930
            AIDA   SFQ YK GIY + +CSS +LDHGVL VGYG EG    N FWIVKNSWGP WG 
Sbjct: 249  AIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWIVKNSWGPEWGW 308

Query: 931  SGYIKMSKDKKNQCGIATMASYPTV 1005
            +GY+KM+KD+ N CGIAT ASYPTV
Sbjct: 309  NGYVKMAKDQNNHCGIATAASYPTV 333
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin L
            light chain]
          Length = 334

 Score =  347 bits (890), Expect = 3e-95
 Identities = 178/318 (55%), Positives = 218/318 (68%), Gaps = 6/318 (1%)
 Frame = +1

Query: 70   QINEELNDEWNLYKTRYQKSYLSIVELTRRIIWEENLRYIQHHNLEYDLGKHSYYLGVNE 249
            +++  L+  W+ +K  +++ Y    E  RR +WE+N + I  HN EY  GKH++ + +N 
Sbjct: 20   KLDPNLDAHWHQWKATHRRLYGMNEEEWRRAVWEKNKKIIDLHNQEYSEGKHAFRMAMNA 79

Query: 250  YADLTHEEFKIKFLGLKPINRNYSRTIFMPPENMDNYPKSVDWRKKGYVTPVKNQGQCGS 429
            + D+T+EEF+    G +  N+ + +        + + PKSVDW KKGYVTPVKNQGQCGS
Sbjct: 80   FGDMTNEEFRQVMNGFQ--NQKHKKGKLFHEPLLVDVPKSVDWTKKGYVTPVKNQGQCGS 137

Query: 430  CWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQNNGCNGGLMDNAFQYIQ-KYGLE 606
            CW+FS TGALEGQ FRK  +LVSLSEQ LVDCS+   N GCNGGLMDNAFQYI+   GL+
Sbjct: 138  CWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLD 197

Query: 607  SEADYPYTAMD-GPCKYDSTKVVAHCTGFVDIKKGNEKDLTKAVATVGPISVAIDASRPS 783
            SE  YPY A D   C Y      A+ TGFVDI +  EK L KAVATVGPISVAIDA   S
Sbjct: 198  SEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ-REKALMKAVATVGPISVAIDAGHTS 256

Query: 784  FQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG----KNHFWIVKNSWGPTWGISGYIKMS 951
            FQ YK GIY + +CS  +LDHGVL VGYG EG     N FWIVKNSWGP WG +GY+KM+
Sbjct: 257  FQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMA 316

Query: 952  KDKKNQCGIATMASYPTV 1005
            KD+ N CGIAT ASYPTV
Sbjct: 317  KDQNNHCGIATAASYPTV 334
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
            Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  345 bits (885), Expect = 1e-94
 Identities = 179/335 (53%), Positives = 222/335 (66%), Gaps = 5/335 (1%)
 Frame = +1

Query: 16   TSVLLLFTILLRSSAFKFQINEELNDEWNLYKTRYQKSYLSIVELTRRIIWEENLRYIQH 195
            T +L  F + + S+   F  +  L  +W  +K  + + Y    E  RR +WE+N++ I+ 
Sbjct: 4    TLILAAFCLGIASATLTF--DHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIEL 61

Query: 196  HNLEYDLGKHSYYLGVNEYADLTHEEFKIKFLGLKPINRNYSRTIFMPPENMDNYPKSVD 375
            HN EY  GKHS+ + +N + D+T EEF+    G +  NR   +            P+SVD
Sbjct: 62   HNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ--NRKPRKGKVFQEPLFYEAPRSVD 119

Query: 376  WRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQNNGCN 555
            WR+KGYVTPVKNQGQCGSCW+FS TGALEGQ FRK  +L+SLSEQ LVDCS    N GCN
Sbjct: 120  WREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCN 179

Query: 556  GGLMDNAFQYIQ-KYGLESEADYPYTAMDGPCKYDSTKVVAHCTGFVDIKKGNEKDLTKA 732
            GGLMD AFQY+Q   GL+SE  YPY A +  CKY+    VA+ TGFVDI K  EK L KA
Sbjct: 180  GGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALMKA 238

Query: 733  VATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG----KNHFWIV 900
            VATVGPISVAIDA   SF  YK GIY E +CSS ++DHGVL VGYG E      N +W+V
Sbjct: 239  VATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV 298

Query: 901  KNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 1005
            KNSWG  WG+ GY+KM+KD++N CGIA+ ASYPTV
Sbjct: 299  KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin L
            light chain]
          Length = 339

 Score =  342 bits (877), Expect = 1e-93
 Identities = 173/317 (54%), Positives = 218/317 (68%), Gaps = 10/317 (3%)
 Frame = +1

Query: 85   LNDEWNLYKTRYQKSYLSIVELTRRI-IWEENLRYIQHHNLEYDLGKHSYYLGVNEYADL 261
            + +EW+ YK +++K+Y + VE   R+ I+ EN   I  HN  +  GK SY LG+N+YAD+
Sbjct: 24   IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83

Query: 262  THEEFKIKFLGLKPINRNYSR-------TIFMPPENMDNYPKSVDWRKKGYVTPVKNQGQ 420
             H EFK    G     R   R         ++PP ++   PKSVDWR+ G VT VK+QG 
Sbjct: 84   LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHV-TVPKSVDWREHGAVTGVKDQGH 142

Query: 421  CGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQNNGCNGGLMDNAFQYIQ-KY 597
            CGSCW+FS+TGALEGQHFRK   LVSLSEQ LVDCS  Y NNGCNGGLMDNAF+YI+   
Sbjct: 143  CGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 202

Query: 598  GLESEADYPYTAMDGPCKYDSTKVVAHCTGFVDIKKGNEKDLTKAVATVGPISVAIDASR 777
            G+++E  YPY  +D  C ++   + A  TGFVDI +G+E+ + KAVAT+GP+SVAIDAS 
Sbjct: 203  GIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASH 262

Query: 778  PSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGA-EGKNHFWIVKNSWGPTWGISGYIKMSK 954
             SFQLY  G+YNE  C   NLDHGVL VGYG  E    +W+VKNSWG TWG  GYIKM++
Sbjct: 263  ESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMAR 322

Query: 955  DKKNQCGIATMASYPTV 1005
            ++ NQCGIAT +SYPTV
Sbjct: 323  NQNNQCGIATASSYPTV 339
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin L
            light chain]
          Length = 334

 Score =  340 bits (872), Expect = 4e-93
 Identities = 175/333 (52%), Positives = 224/333 (67%), Gaps = 6/333 (1%)
 Frame = +1

Query: 25   LLLFTILLRSSAFKFQINEELNDEWNLYKTRYQKSYLSIVELTRRIIWEENLRYIQHHNL 204
            L L  + L  ++   ++++ L+ +W  +K  + + Y    E  RR +WE+N++ I+ HN 
Sbjct: 5    LFLTALCLGIASAAPKLDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHNQ 64

Query: 205  EYDLGKHSYYLGVNEYADLTHEEFKIKFLGLKPINRNYSRTIFMPPENMDNYPKSVDWRK 384
            EY  GKH + + +N + D+T+EEF+    G +  N+ + +        +   PKSVDWR+
Sbjct: 65   EYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQ--NQKHKKGKVFHESLVLEVPKSVDWRE 122

Query: 385  KGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQNNGCNGGL 564
            KGYVT VKNQGQCGSCW+FS TGALEGQ FRK  +LVSLSEQ LVDCS+   N GCNGGL
Sbjct: 123  KGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGL 182

Query: 565  MDNAFQYIQ-KYGLESEADYPYTAMD-GPCKYDSTKVVAHCTGFVDIKKGNEKDLTKAVA 738
            MDNAFQY++   GL++E  YPY   +   C Y      A+ TGFVDI +  EK L KAVA
Sbjct: 183  MDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQ-REKALMKAVA 241

Query: 739  TVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG----KNHFWIVKN 906
            TVGPISVAIDA   SFQ YK GIY + +CSS +LDHGVL VGYG EG     + FWIVKN
Sbjct: 242  TVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKN 301

Query: 907  SWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 1005
            SWGP WG +GY+KM+KD+ N CGI+T ASYPTV
Sbjct: 302  SWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
>sp|P43236|CATK_RABIT Cathepsin K precursor (OC-2 protein)
          Length = 329

 Score =  336 bits (862), Expect = 6e-92
 Identities = 170/326 (52%), Positives = 220/326 (67%), Gaps = 3/326 (0%)
 Frame = +1

Query: 31  LFTILLRSSAFKFQINEELNDEWNLYKTRYQKSYLSIV-ELTRRIIWEENLRYIQHHNLE 207
           L  +LL   +F     E L+ +W L+K  Y K Y S V E++RR+IWE+NL++I  HNLE
Sbjct: 4   LKVLLLPVVSFALHPEEILDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLE 63

Query: 208 YDLGKHSYYLGVNEYADLTHEEFKIKFLGLK-PINRNYSRTIFMPPENMDNYPKSVDWRK 384
             LG H+Y L +N   D+T EE   K  GLK P +R++S      P+     P S+D+RK
Sbjct: 64  ASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSRSHSNDTLYIPDWEGRTPDSIDYRK 123

Query: 385 KGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQNNGCNGGL 564
           KGYVTPVKNQGQCGSCW+FS+ GALEGQ  +K  +L++LS Q LVDC  +  N GC GG 
Sbjct: 124 KGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NYGCGGGY 181

Query: 565 MDNAFQYIQKY-GLESEADYPYTAMDGPCKYDSTKVVAHCTGFVDIKKGNEKDLTKAVAT 741
           M NAFQY+Q+  G++SE  YPY   D  C Y+ T   A C G+ +I +GNEK L +AVA 
Sbjct: 182 MTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKALKRAVAR 241

Query: 742 VGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEGKNHFWIVKNSWGPT 921
           VGP+SVAIDAS  SFQ Y  G+Y + NCSS+N++H VLAVGYG +  N  WI+KNSWG +
Sbjct: 242 VGPVSVAIDASLTSFQFYSKGVYYDENCSSDNVNHAVLAVGYGIQKGNKHWIIKNSWGES 301

Query: 922 WGISGYIKMSKDKKNQCGIATMASYP 999
           WG  GYI M+++K N CGIA +AS+P
Sbjct: 302 WGNKGYILMARNKNNACGIANLASFP 327
>sp|P43235|CATK_HUMAN Cathepsin K precursor (Cathepsin O) (Cathepsin X) (Cathepsin O2)
          Length = 329

 Score =  333 bits (853), Expect = 6e-91
 Identities = 169/326 (51%), Positives = 219/326 (67%), Gaps = 3/326 (0%)
 Frame = +1

Query: 31  LFTILLRSSAFKFQINEELNDEWNLYKTRYQKSYLSIV-ELTRRIIWEENLRYIQHHNLE 207
           L  +LL   +F     E L+  W L+K  ++K Y + V E++RR+IWE+NL+YI  HNLE
Sbjct: 4   LKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLE 63

Query: 208 YDLGKHSYYLGVNEYADLTHEEFKIKFLGLK-PINRNYSRTIFMPPENMDNYPKSVDWRK 384
             LG H+Y L +N   D+T EE   K  GLK P++ + S      PE     P SVD+RK
Sbjct: 64  ASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRK 123

Query: 385 KGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQNNGCNGGL 564
           KGYVTPVKNQGQCGSCW+FS+ GALEGQ  +K  +L++LS Q LVDC  +  N+GC GG 
Sbjct: 124 KGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGY 181

Query: 565 MDNAFQYIQKY-GLESEADYPYTAMDGPCKYDSTKVVAHCTGFVDIKKGNEKDLTKAVAT 741
           M NAFQY+QK  G++SE  YPY   +  C Y+ T   A C G+ +I +GNEK L +AVA 
Sbjct: 182 MTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVAR 241

Query: 742 VGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEGKNHFWIVKNSWGPT 921
           VGP+SVAIDAS  SFQ Y  G+Y + +C+S+NL+H VLAVGYG +  N  WI+KNSWG  
Sbjct: 242 VGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGEN 301

Query: 922 WGISGYIKMSKDKKNQCGIATMASYP 999
           WG  GYI M+++K N CGIA +AS+P
Sbjct: 302 WGNKGYILMARNKNNACGIANLASFP 327
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 119,714,775
Number of Sequences: 369166
Number of extensions: 2541671
Number of successful extensions: 7510
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 6544
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6832
length of database: 68,354,980
effective HSP length: 111
effective length of database: 47,849,395
effective search space used: 11340306615
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)

Cluster detail

DrC_00852

  1. Dr_sW_021_L06
  2. Dr_sW_025_L04
  3. Dr_sW_014_I02
  4. Dr_sW_012_K17
  5. Dr_sW_007_H07
  6. Dr_sW_012_C02
  7. Dr_sW_001_L07
  8. Dr_sW_008_E04
  9. Dr_sW_004_D08