Planarian EST Database


Dr_sW_003_I18

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_003_I18
         (725 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   209   8e-54
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   203   3e-52
sp|P25784|CYSP3_HOMAM  Digestive cysteine proteinase 3 precu...   201   2e-51
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   200   3e-51
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   200   3e-51
sp|Q10991|CATL_SHEEP  Cathepsin L [Contains: Cathepsin L hea...   199   5e-51
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   195   1e-49
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   194   2e-49
sp|Q9GLE3|CATK_PIG  Cathepsin K precursor                         194   2e-49
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   193   5e-49
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  209 bits (531), Expect = 8e-54
 Identities = 104/204 (50%), Positives = 133/204 (65%), Gaps = 1/204 (0%)
 Frame = +2

Query: 2   PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
           PVK+Q  CGSC+AFS TGSLEGQ+F +T  L+S +EQQ+VDCS  +G +GC GG+   +F
Sbjct: 121 PVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAF 180

Query: 182 DXXXXXXXXXXXXXXXXXXXX-RCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
           D                      CR++ + V     G TNI S  E  L QAV  IGPIS
Sbjct: 181 DYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPIS 240

Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGSKDGKNFWIVKNSWGTTWGRK 538
           V IDA+  SF  YS +G+YY+P+C   +L HAVL VGYGS+ G++FW+VKNSW T+WG  
Sbjct: 241 VTIDAAHSSFQFYS-SGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDA 299

Query: 539 GYILMSKDEDNQCGIATEASYPLI 610
           GYI MS++ +N CGIAT ASYPL+
Sbjct: 300 GYIKMSRNRNNNCGIATVASYPLV 323
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  203 bits (517), Expect = 3e-52
 Identities = 100/204 (49%), Positives = 138/204 (67%), Gaps = 2/204 (0%)
 Frame = +2

Query: 5   VKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSFD 184
           VK+Q  CGSC+AFS+TG+LEGQ+FR++  LVS SEQ +VDCS ++GN GC GG    +F 
Sbjct: 139 VKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFR 198

Query: 185 -XXXXXXXXXXXXXXXXXXXXRCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPISV 361
                                 C +NK  V    +GFT+I   DE+ +A+AVA +GP+SV
Sbjct: 199 YIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSV 258

Query: 362 RIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGS-KDGKNFWIVKNSWGTTWGRK 538
            IDAS  SF  YS  G+Y +P CD+ +L H VLVVG+G+ + G+++W+VKNSWGTTWG K
Sbjct: 259 AIDASHESFQFYS-EGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDK 317

Query: 539 GYILMSKDEDNQCGIATEASYPLI 610
           G+I M ++++NQCGIA+ +SYPL+
Sbjct: 318 GFIKMLRNKENQCGIASASSYPLV 341
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
          Length = 321

 Score =  201 bits (511), Expect = 2e-51
 Identities = 100/204 (49%), Positives = 135/204 (66%), Gaps = 1/204 (0%)
 Frame = +2

Query: 2   PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
           PVK+Q+ CGSC+AFSATG+LEGQ+F +  +LVS SEQQ+VDCS ++GN GCGGG+   +F
Sbjct: 120 PVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAF 179

Query: 182 DXXXXXXXXXXXXXXXXXXXXR-CRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
           D                    R CR++ + +     G   ++   EEAL +AV+ +GPIS
Sbjct: 180 DYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTGSVEVQ-HTEEALQEAVSGVGPIS 238

Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGSKDGKNFWIVKNSWGTTWGRK 538
           V IDAS  SF  YS +G+YY+ NC    L H VL VGYG++  K++W+VKNSWG++WG  
Sbjct: 239 VAIDASHFSFQFYS-SGVYYEQNCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDA 297

Query: 539 GYILMSKDEDNQCGIATEASYPLI 610
           GYI MS++ DN CGIA+E SYP +
Sbjct: 298 GYIKMSRNRDNNCGIASEPSYPTV 321
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  200 bits (509), Expect = 3e-51
 Identities = 99/204 (48%), Positives = 133/204 (65%), Gaps = 2/204 (0%)
 Frame = +2

Query: 5   VKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSFD 184
           VK+Q  CGSC+AFS+TG+LEGQ+FR+   LVS SEQ +VDCS ++GN GC GG    +F 
Sbjct: 137 VKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFR 196

Query: 185 -XXXXXXXXXXXXXXXXXXXXRCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPISV 361
                                 C +NK+ +     GF +I   DEE + +AVA +GP+SV
Sbjct: 197 YIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSV 256

Query: 362 RIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGS-KDGKNFWIVKNSWGTTWGRK 538
            IDAS  SF  YS  G+Y +P CD  +L H VLVVGYG+ + G ++W+VKNSWGTTWG +
Sbjct: 257 AIDASHESFQLYS-EGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQ 315

Query: 539 GYILMSKDEDNQCGIATEASYPLI 610
           GYI M+++++NQCGIAT +SYP +
Sbjct: 316 GYIKMARNQNNQCGIATASSYPTV 339
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  200 bits (509), Expect = 3e-51
 Identities = 100/206 (48%), Positives = 129/206 (62%), Gaps = 5/206 (2%)
 Frame = +2

Query: 2   PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
           PVKNQK CGSC+AFSATG+LEGQ FR+T KLVS SEQ +VDCS   GN+GC GGF   +F
Sbjct: 128 PVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAF 187

Query: 182 DXXXXXXXXXXXXXXXXXXXXR-CRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
                                  C+Y     +    GFT +    E+AL +AVA +GPIS
Sbjct: 188 QYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPIS 247

Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYG----SKDGKNFWIVKNSWGTT 526
           V +DA   SF ++  +GIY++P+C S +L H VLVVGYG    + +   +W+VKNSWG  
Sbjct: 248 VAMDAGHSSF-QFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPE 306

Query: 527 WGRKGYILMSKDEDNQCGIATEASYP 604
           WG  GY+ ++KD++N CGIAT ASYP
Sbjct: 307 WGSNGYVKIAKDKNNHCGIATAASYP 332
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]
          Length = 217

 Score =  199 bits (507), Expect = 5e-51
 Identities = 107/205 (52%), Positives = 127/205 (61%), Gaps = 2/205 (0%)
 Frame = +2

Query: 2   PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
           PVKNQ  CGSC+AFSATG+LEGQ FR+T KLVS SEQ +VD S   GN+GC GG    +F
Sbjct: 15  PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQGNQGCNGGLMDNAF 74

Query: 182 D-XXXXXXXXXXXXXXXXXXXXRCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
                                  C Y       K  GF +I  R E+AL +AVA +GPIS
Sbjct: 75  QYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDIPQR-EKALMKAVATVGPIS 133

Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGSKDGKN-FWIVKNSWGTTWGR 535
           V IDA   SF ++  +GIYYDP+C S  L H VLVVGYG +   N FWIVKNSWG  WG 
Sbjct: 134 VAIDAGHSSF-QFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKFWIVKNSWGPEWGN 192

Query: 536 KGYILMSKDEDNQCGIATEASYPLI 610
           KGY+ M+KD++N CGIAT ASYP +
Sbjct: 193 KGYVKMAKDQNNHCGIATAASYPTV 217
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  195 bits (495), Expect = 1e-49
 Identities = 99/208 (47%), Positives = 125/208 (60%), Gaps = 5/208 (2%)
 Frame = +2

Query: 2   PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
           PVKNQ  CGSC+AFSATG+LEGQ FR+T +L+S SEQ +VDCS   GN GC GG    +F
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAF 187

Query: 182 DXXXXXXXXXXXXXXXXXXXXR-CRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
                                  C+YN    +    GF +I  + E+AL +AVA +GPIS
Sbjct: 188 QYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDI-PKQEKALMKAVATVGPIS 246

Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYG----SKDGKNFWIVKNSWGTT 526
           V IDA   SF+ Y   GIY++P+C S+ + H VLVVGYG      D   +W+VKNSWG  
Sbjct: 247 VAIDAGHESFLFYK-EGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEE 305

Query: 527 WGRKGYILMSKDEDNQCGIATEASYPLI 610
           WG  GY+ M+KD  N CGIA+ ASYP +
Sbjct: 306 WGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  194 bits (493), Expect = 2e-49
 Identities = 104/209 (49%), Positives = 124/209 (59%), Gaps = 6/209 (2%)
 Frame = +2

Query: 2   PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
           PVKNQ  CGSC+AFSATG+LEGQ FR+T KLVS SEQ +VDCS   GN+GC GG    +F
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAF 187

Query: 182 D--XXXXXXXXXXXXXXXXXXXXRCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPI 355
                                   C Y          GF +I  R E+AL +AVA +GPI
Sbjct: 188 QYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQR-EKALMKAVATVGPI 246

Query: 356 SVRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYG----SKDGKNFWIVKNSWGT 523
           SV IDA   SF ++  +GIYYDP+C    L H VLVVGYG      +   FWIVKNSWG 
Sbjct: 247 SVAIDAGHTSF-QFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGP 305

Query: 524 TWGRKGYILMSKDEDNQCGIATEASYPLI 610
            WG  GY+ M+KD++N CGIAT ASYP +
Sbjct: 306 EWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>sp|Q9GLE3|CATK_PIG Cathepsin K precursor
          Length = 330

 Score =  194 bits (493), Expect = 2e-49
 Identities = 99/202 (49%), Positives = 128/202 (63%), Gaps = 1/202 (0%)
 Frame = +2

Query: 2   PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
           PVKNQ  CGSC+AFS+ G+LEGQ  ++T KL++ S Q +VDC  E  N GCGGG+   +F
Sbjct: 130 PVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE--NDGCGGGYMTNAF 187

Query: 182 DXXXXXXXXXXXXXXXXXXXXR-CRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
                                  C YN +    K +G+  I   +E+AL +AVA +GP+S
Sbjct: 188 QYVQKNRGIDSEDAYPYVGQDENCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVS 247

Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGYGSKDGKNFWIVKNSWGTTWGRK 538
           V IDAS  SF  YS  G+YYD NC+SD+L HAVL VGYG + GK  WI+KNSWG  WG K
Sbjct: 248 VAIDASLTSFQFYS-KGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNK 306

Query: 539 GYILMSKDEDNQCGIATEASYP 604
           GYILM+++++N CGIA  AS+P
Sbjct: 307 GYILMARNKNNACGIANLASFP 328
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  193 bits (490), Expect = 5e-49
 Identities = 99/208 (47%), Positives = 131/208 (62%), Gaps = 5/208 (2%)
 Frame = +2

Query: 2   PVKNQKTCGSCYAFSATGSLEGQYFRETKKLVSFSEQQIVDCSEEFGNRGCGGGFSKLSF 181
           PVKNQ  CGSC+AFSA+G LEGQ F +T KL+S SEQ +VDCS   GN+GC GG    +F
Sbjct: 128 PVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAF 187

Query: 182 D-XXXXXXXXXXXXXXXXXXXXRCRYNKSKVIVKSKGFTNIRSRDEEALAQAVAYIGPIS 358
                                  C+Y     +    GF +I  + E+AL +AVA +GPIS
Sbjct: 188 QYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPIS 246

Query: 359 VRIDASRRSFIEYSGNGIYYDPNCDSDHLRHAVLVVGY---GSKDGKN-FWIVKNSWGTT 526
           V +DAS  S +++  +GIYY+PNC S +L H VL+VGY   G+   KN +W+VKNSWG+ 
Sbjct: 247 VAMDASHPS-LQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSE 305

Query: 527 WGRKGYILMSKDEDNQCGIATEASYPLI 610
           WG +GYI ++KD DN CG+AT ASYP++
Sbjct: 306 WGMEGYIKIAKDRDNHCGLATAASYPVV 333
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 75,956,470
Number of Sequences: 369166
Number of extensions: 1453623
Number of successful extensions: 4198
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3712
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3854
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 6510836890
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)