Planarian EST Database


Dr_sW_025_P20

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_025_P20
         (820 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   281   2e-75
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   280   3e-75
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   280   4e-75
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   276   6e-74
sp|Q10991|CATL_SHEEP  Cathepsin L [Contains: Cathepsin L hea...   271   2e-72
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   271   2e-72
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   268   1e-71
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   267   2e-71
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   266   7e-71
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   265   1e-70
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  281 bits (718), Expect = 2e-75
 Identities = 133/220 (60%), Positives = 165/220 (75%), Gaps = 3/220 (1%)
 Frame = +2

Query: 17  KLPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSND 196
           ++P +V+W +KG VT VKNQGQCGSCWAFS +G +EGQ F    +L+S SEQ LVDCS+D
Sbjct: 113 QIPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD 172

Query: 197 FGNQGCNGGLMDSAFQYIMQY-GLEKESDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGS 373
            GNQGCNGGLMD AFQYI +  GL+ E  YPY A+DG C+Y  E  VA+  GF DI    
Sbjct: 173 QGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQ-Q 231

Query: 374 EEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAVGYGND--EDS 547
           E+ L + +ATVGPISV +DAS+PS QFY  G+Y E NCSS  LDHGVL VGYG +  + +
Sbjct: 232 EKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSN 291

Query: 548 QQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 667
           +  YWLVKNSWGK WG++GYIK++KD++N CG+AT ASYP
Sbjct: 292 KDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYP 331
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  280 bits (717), Expect = 3e-75
 Identities = 135/221 (61%), Positives = 161/221 (72%), Gaps = 3/221 (1%)
 Frame = +2

Query: 20  LPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSNDF 199
           LP SV+W KKGYVT VKNQ QCGSCWAFS TG++EGQ F+   +LVS SEQ LVDCS   
Sbjct: 114 LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ 173

Query: 200 GNQGCNGGLMDSAFQYIMQY-GLEKESDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGSE 376
           GNQGCNGG M  AFQY+ +  GL+ E  YPY A D  C+Y  E  VA+  GFT ++ G E
Sbjct: 174 GNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKE 233

Query: 377 EDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAVGYGNDEDSQQN 556
           + L + +ATVGPISV +DA + SFQFYK G+Y E +CSS  LDHGVL VGYG +  +  N
Sbjct: 234 KALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNN 293

Query: 557 --YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 673
             YWLVKNSWG  WG NGY+K++KDK+N CGIAT ASYPN+
Sbjct: 294 SKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  280 bits (715), Expect = 4e-75
 Identities = 137/220 (62%), Positives = 163/220 (74%), Gaps = 3/220 (1%)
 Frame = +2

Query: 17  KLPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSND 196
           K+P SV+W +KG VT VKNQGQCGSCWAFS +G +EGQ F    +L+S SEQ LVDCS+ 
Sbjct: 113 KIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHA 172

Query: 197 FGNQGCNGGLMDSAFQYIMQY-GLEKESDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGS 373
            GNQGCNGGLMD AFQYI +  GL+ E  YPY A+DG C+Y  E  VA+  GF DI    
Sbjct: 173 QGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-Q 231

Query: 374 EEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAVGYGND-EDSQ 550
           E+ L + +ATVGPISV +DAS+PS QFY  G+Y E NCSS  LDHGVL VGYG +  DS 
Sbjct: 232 EKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSN 291

Query: 551 QN-YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 667
           +N YWLVKNSWG  WG+ GYIK++KD+DN CG+AT ASYP
Sbjct: 292 KNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYP 331
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  276 bits (705), Expect = 6e-74
 Identities = 130/217 (59%), Positives = 163/217 (75%), Gaps = 1/217 (0%)
 Frame = +2

Query: 20  LPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSNDF 199
           LP SV+W  KG VT VK+QG CGSCWAFS+TG++EGQ+F+ +  LVS SEQ LVDCS  +
Sbjct: 124 LPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKY 183

Query: 200 GNQGCNGGLMDSAFQYIMQY-GLEKESDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGSE 376
           GN GCNGGLMD+AF+YI    G++ E  YPY A D  C ++K  V A  +GFTDI  G E
Sbjct: 184 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDE 243

Query: 377 EDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAVGYGNDEDSQQN 556
           + +AE +ATVGP+SV IDAS+ SFQFY  GVY+E  C +  LDHGVL VG+G DE S ++
Sbjct: 244 KKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDE-SGED 302

Query: 557 YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 667
           YWLVKNSWG +WG  G+IKM ++K+NQCGIA+ +SYP
Sbjct: 303 YWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYP 339
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]
          Length = 217

 Score =  271 bits (693), Expect = 2e-72
 Identities = 132/219 (60%), Positives = 156/219 (71%), Gaps = 1/219 (0%)
 Frame = +2

Query: 20  LPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSNDF 199
           +P SV+W KKGYVT VKNQGQCGSCWAFS TG++EGQ F+   +LVS SEQ LVD S   
Sbjct: 1   VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query: 200 GNQGCNGGLMDSAFQYIMQY-GLEKESDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGSE 376
           GNQGCNGGLMD+AFQYI +  GL+ E  YPY A D  C Y  E   A   GF DI    E
Sbjct: 61  GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDIPQ-RE 119

Query: 377 EDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAVGYGNDEDSQQN 556
           + L + +ATVGPISV IDA + SFQFYK G+Y + +CSS  LDHGVL VGYG  E +   
Sbjct: 120 KALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYG-FEGTNNK 178

Query: 557 YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 673
           +W+VKNSWG  WG  GY+KM+KD++N CGIAT ASYP +
Sbjct: 179 FWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  271 bits (692), Expect = 2e-72
 Identities = 135/222 (60%), Positives = 158/222 (71%), Gaps = 4/222 (1%)
 Frame = +2

Query: 20  LPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSNDF 199
           +P SV+W KKGYVT VKNQGQCGSCWAFS TG++EGQ F+   +LVS SEQ LVDCS   
Sbjct: 114 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ 173

Query: 200 GNQGCNGGLMDSAFQYIMQY-GLEKESDYPYTAQD-GDCEYSKEKVVAHCQGFTDISHGS 373
           GNQGCNGGLMD+AFQYI    GL+ E  YPY A D   C Y  E   A+  GF DI    
Sbjct: 174 GNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ-R 232

Query: 374 EEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAVGYG-NDEDSQ 550
           E+ L + +ATVGPISV IDA + SFQFYK G+Y + +CS   LDHGVL VGYG    DS 
Sbjct: 233 EKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSN 292

Query: 551 QN-YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 673
            N +W+VKNSWG  WG NGY+KM+KD++N CGIAT ASYP +
Sbjct: 293 NNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  268 bits (685), Expect = 1e-71
 Identities = 128/220 (58%), Positives = 158/220 (71%), Gaps = 3/220 (1%)
 Frame = +2

Query: 23  PDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSNDFG 202
           P SV+W +KGYVT VKNQGQCGSCWAFS TG++EGQ F+   +L+S SEQ LVDCS   G
Sbjct: 115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 174

Query: 203 NQGCNGGLMDSAFQYIMQY-GLEKESDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGSEE 379
           N+GCNGGLMD AFQY+    GL+ E  YPY A +  C+Y+ +  VA+  GF DI    E+
Sbjct: 175 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEK 233

Query: 380 DLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAVGYG--NDEDSQQ 553
            L + +ATVGPISV IDA + SF FYK G+Y E +CSS  +DHGVL VGYG  + E    
Sbjct: 234 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 293

Query: 554 NYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 673
            YWLVKNSWG+ WG+ GY+KM+KD+ N CGIA+ ASYP +
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  267 bits (683), Expect = 2e-71
 Identities = 128/222 (57%), Positives = 160/222 (72%), Gaps = 3/222 (1%)
 Frame = +2

Query: 17  KLPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSND 196
           ++P SV+W +KGYVT VKNQGQCGSCWAFS TG++EGQ F+   +LVS SEQ LVDCS  
Sbjct: 113 EIPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRA 172

Query: 197 FGNQGCNGGLMDSAFQYIMQY-GLEKESDYPYTAQDGD-CEYSKEKVVAHCQGFTDISHG 370
            GN+GCNGGLMD+AF+Y+    GL+ E  YPY  +D + C Y  E   A+  GF D+   
Sbjct: 173 QGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQ- 231

Query: 371 SEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAVGYG-NDEDS 547
            E+ L + +AT+GPISV IDA + SFQFYK G+Y + +CSS  LDHGVL VGYG    DS
Sbjct: 232 REKALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDS 291

Query: 548 QQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 673
              +W+VKNSWG  WG NGY+KM+KD++N CGIAT ASYP +
Sbjct: 292 NNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  266 bits (679), Expect = 7e-71
 Identities = 128/223 (57%), Positives = 159/223 (71%), Gaps = 4/223 (1%)
 Frame = +2

Query: 17  KLPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSND 196
           ++P SV+W +KGYVT VKNQGQCGSCWAFS TG++EGQ F+   +LVS SEQ LVDCS  
Sbjct: 113 EVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172

Query: 197 FGNQGCNGGLMDSAFQYIMQY-GLEKESDYPYTAQD-GDCEYSKEKVVAHCQGFTDISHG 370
            GNQGCNGGLMD+AFQY+    GL+ E  YPY  ++   C Y  E   A+  GF DI   
Sbjct: 173 QGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQ- 231

Query: 371 SEEDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAVGYG--NDED 544
            E+ L + +ATVGPISV IDA + SFQFYK G+Y + +CSS  LDHGVL VGYG    + 
Sbjct: 232 REKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDS 291

Query: 545 SQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 673
           +   +W+VKNSWG  WG NGY+KM+KD++N CGI+T ASYP +
Sbjct: 292 NSSKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  265 bits (677), Expect = 1e-70
 Identities = 124/219 (56%), Positives = 157/219 (71%), Gaps = 1/219 (0%)
 Frame = +2

Query: 20  LPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSNDF 199
           +P SV+W + G VT VK+QG CGSCWAFS+TG++EGQ+F+    LVS SEQ LVDCS  +
Sbjct: 122 VPKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKY 181

Query: 200 GNQGCNGGLMDSAFQYIMQY-GLEKESDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGSE 376
           GN GCNGGLMD+AF+YI    G++ E  YPY   D  C ++K  + A   GF DI  G E
Sbjct: 182 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDE 241

Query: 377 EDLAEKLATVGPISVGIDASNPSFQFYKGGVYDEENCSSTQLDHGVLAVGYGNDEDSQQN 556
           E + + +AT+GP+SV IDAS+ SFQ Y  GVY+E  C    LDHGVL VGYG DE S  +
Sbjct: 242 EKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDE-SGMD 300

Query: 557 YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 673
           YWLVKNSWG +WG  GYIKM+++++NQCGIAT +SYP +
Sbjct: 301 YWLVKNSWGTTWGEQGYIKMARNQNNQCGIATASSYPTV 339
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 96,660,745
Number of Sequences: 369166
Number of extensions: 2017483
Number of successful extensions: 5975
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 5121
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5367
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 7859674995
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)