Planarian EST Database


Dr_sW_018_I20

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_018_I20
         (891 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   281   2e-75
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   281   2e-75
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   280   4e-75
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   278   1e-74
sp|Q10991|CATL_SHEEP  Cathepsin L [Contains: Cathepsin L hea...   271   1e-72
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   271   2e-72
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   268   1e-71
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   268   2e-71
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   266   6e-71
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   266   6e-71
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  281 bits (719), Expect = 2e-75
 Identities = 133/220 (60%), Positives = 166/220 (75%), Gaps = 3/220 (1%)
 Frame = +1

Query: 88  KLPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSND 267
           ++P +V+W +KG VT VKNQGQCGSCWAFS +G +EGQ F    +L+S SEQ LVDCS+D
Sbjct: 113 QIPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHD 172

Query: 268 FGNQGCNGGLMDSAFQYIMQY-GLEKERDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGS 444
            GNQGCNGGLMD AFQYI +  GL+ E  YPY A+DG C+Y  E  VA+  GF DI    
Sbjct: 173 QGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQ-Q 231

Query: 445 EEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVGYGND--EDS 618
           E+ L + +ATVGPISV +DAS+PS QFY +G+Y E NCSS  LDHGVL VGYG +  + +
Sbjct: 232 EKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSN 291

Query: 619 QQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 738
           +  YWLVKNSWGK WG++GYIK++KD++N CG+AT ASYP
Sbjct: 292 KDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYP 331
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  281 bits (718), Expect = 2e-75
 Identities = 135/221 (61%), Positives = 162/221 (73%), Gaps = 3/221 (1%)
 Frame = +1

Query: 91  LPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSNDF 270
           LP SV+W KKGYVT VKNQ QCGSCWAFS TG++EGQ F+   +LVS SEQ LVDCS   
Sbjct: 114 LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ 173

Query: 271 GNQGCNGGLMDSAFQYIMQY-GLEKERDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGSE 447
           GNQGCNGG M  AFQY+ +  GL+ E  YPY A D  C+Y  E  VA+  GFT ++ G E
Sbjct: 174 GNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKE 233

Query: 448 EDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVGYGNDEDSQQN 627
           + L + +ATVGPISV +DA + SFQFYK+G+Y E +CSS  LDHGVL VGYG +  +  N
Sbjct: 234 KALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNN 293

Query: 628 --YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 744
             YWLVKNSWG  WG NGY+K++KDK+N CGIAT ASYPN+
Sbjct: 294 SKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  280 bits (716), Expect = 4e-75
 Identities = 137/220 (62%), Positives = 164/220 (74%), Gaps = 3/220 (1%)
 Frame = +1

Query: 88  KLPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSND 267
           K+P SV+W +KG VT VKNQGQCGSCWAFS +G +EGQ F    +L+S SEQ LVDCS+ 
Sbjct: 113 KIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHA 172

Query: 268 FGNQGCNGGLMDSAFQYIMQY-GLEKERDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGS 444
            GNQGCNGGLMD AFQYI +  GL+ E  YPY A+DG C+Y  E  VA+  GF DI    
Sbjct: 173 QGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-Q 231

Query: 445 EEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVGYGND-EDSQ 621
           E+ L + +ATVGPISV +DAS+PS QFY +G+Y E NCSS  LDHGVL VGYG +  DS 
Sbjct: 232 EKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSN 291

Query: 622 QN-YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 738
           +N YWLVKNSWG  WG+ GYIK++KD+DN CG+AT ASYP
Sbjct: 292 KNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYP 331
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  278 bits (712), Expect = 1e-74
 Identities = 134/243 (55%), Positives = 172/243 (70%), Gaps = 3/243 (1%)
 Frame = +1

Query: 19  YLQYKPMVKLDHPIKSANYSTKTK--LPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSM 192
           Y  +K +   D   K   + +     LP SV+W  KG VT VK+QG CGSCWAFS+TG++
Sbjct: 98  YTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGAL 157

Query: 193 EGQYFKNNKQLVSFSEQQLVDCSNDFGNQGCNGGLMDSAFQYIMQY-GLEKERDYPYTAQ 369
           EGQ+F+ +  LVS SEQ LVDCS  +GN GCNGGLMD+AF+YI    G++ E+ YPY A 
Sbjct: 158 EGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAI 217

Query: 370 DGDCEYSKEKVVAHCQGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDE 549
           D  C ++K  V A  +GFTDI  G E+ +AE +ATVGP+SV IDAS+ SFQFY  GVY+E
Sbjct: 218 DDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNE 277

Query: 550 ENCSSTQLDHGVLAVGYGNDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMA 729
             C +  LDHGVL VG+G DE S ++YWLVKNSWG +WG  G+IKM ++K+NQCGIA+ +
Sbjct: 278 PQCDAQNLDHGVLVVGFGTDE-SGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASAS 336

Query: 730 SYP 738
           SYP
Sbjct: 337 SYP 339
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]
          Length = 217

 Score =  271 bits (694), Expect = 1e-72
 Identities = 132/219 (60%), Positives = 157/219 (71%), Gaps = 1/219 (0%)
 Frame = +1

Query: 91  LPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSNDF 270
           +P SV+W KKGYVT VKNQGQCGSCWAFS TG++EGQ F+   +LVS SEQ LVD S   
Sbjct: 1   VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query: 271 GNQGCNGGLMDSAFQYIMQY-GLEKERDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGSE 447
           GNQGCNGGLMD+AFQYI +  GL+ E  YPY A D  C Y  E   A   GF DI    E
Sbjct: 61  GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDIPQ-RE 119

Query: 448 EDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVGYGNDEDSQQN 627
           + L + +ATVGPISV IDA + SFQFYK+G+Y + +CSS  LDHGVL VGYG  E +   
Sbjct: 120 KALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYG-FEGTNNK 178

Query: 628 YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 744
           +W+VKNSWG  WG  GY+KM+KD++N CGIAT ASYP +
Sbjct: 179 FWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  271 bits (693), Expect = 2e-72
 Identities = 135/222 (60%), Positives = 159/222 (71%), Gaps = 4/222 (1%)
 Frame = +1

Query: 91  LPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSNDF 270
           +P SV+W KKGYVT VKNQGQCGSCWAFS TG++EGQ F+   +LVS SEQ LVDCS   
Sbjct: 114 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ 173

Query: 271 GNQGCNGGLMDSAFQYIMQY-GLEKERDYPYTAQD-GDCEYSKEKVVAHCQGFTDISHGS 444
           GNQGCNGGLMD+AFQYI    GL+ E  YPY A D   C Y  E   A+  GF DI    
Sbjct: 174 GNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ-R 232

Query: 445 EEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVGYG-NDEDSQ 621
           E+ L + +ATVGPISV IDA + SFQFYK+G+Y + +CS   LDHGVL VGYG    DS 
Sbjct: 233 EKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSN 292

Query: 622 QN-YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 744
            N +W+VKNSWG  WG NGY+KM+KD++N CGIAT ASYP +
Sbjct: 293 NNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  268 bits (686), Expect = 1e-71
 Identities = 128/220 (58%), Positives = 158/220 (71%), Gaps = 3/220 (1%)
 Frame = +1

Query: 94  PDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSNDFG 273
           P SV+W +KGYVT VKNQGQCGSCWAFS TG++EGQ F+   +L+S SEQ LVDCS   G
Sbjct: 115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 174

Query: 274 NQGCNGGLMDSAFQYIMQY-GLEKERDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGSEE 450
           N+GCNGGLMD AFQY+    GL+ E  YPY A +  C+Y+ +  VA+  GF DI    E+
Sbjct: 175 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEK 233

Query: 451 DLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVGYG--NDEDSQQ 624
            L + +ATVGPISV IDA + SF FYK G+Y E +CSS  +DHGVL VGYG  + E    
Sbjct: 234 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 293

Query: 625 NYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 744
            YWLVKNSWG+ WG+ GY+KM+KD+ N CGIA+ ASYP +
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  268 bits (684), Expect = 2e-71
 Identities = 128/222 (57%), Positives = 161/222 (72%), Gaps = 3/222 (1%)
 Frame = +1

Query: 88  KLPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSND 267
           ++P SV+W +KGYVT VKNQGQCGSCWAFS TG++EGQ F+   +LVS SEQ LVDCS  
Sbjct: 113 EIPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRA 172

Query: 268 FGNQGCNGGLMDSAFQYIMQY-GLEKERDYPYTAQDGD-CEYSKEKVVAHCQGFTDISHG 441
            GN+GCNGGLMD+AF+Y+    GL+ E  YPY  +D + C Y  E   A+  GF D+   
Sbjct: 173 QGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQ- 231

Query: 442 SEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVGYG-NDEDS 618
            E+ L + +AT+GPISV IDA + SFQFYK+G+Y + +CSS  LDHGVL VGYG    DS
Sbjct: 232 REKALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDS 291

Query: 619 QQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 744
              +W+VKNSWG  WG NGY+KM+KD++N CGIAT ASYP +
Sbjct: 292 NNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  266 bits (680), Expect = 6e-71
 Identities = 124/219 (56%), Positives = 158/219 (72%), Gaps = 1/219 (0%)
 Frame = +1

Query: 91  LPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSNDF 270
           +P SV+W + G VT VK+QG CGSCWAFS+TG++EGQ+F+    LVS SEQ LVDCS  +
Sbjct: 122 VPKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKY 181

Query: 271 GNQGCNGGLMDSAFQYIMQY-GLEKERDYPYTAQDGDCEYSKEKVVAHCQGFTDISHGSE 447
           GN GCNGGLMD+AF+YI    G++ E+ YPY   D  C ++K  + A   GF DI  G E
Sbjct: 182 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDE 241

Query: 448 EDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVGYGNDEDSQQN 627
           E + + +AT+GP+SV IDAS+ SFQ Y  GVY+E  C    LDHGVL VGYG DE S  +
Sbjct: 242 EKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDE-SGMD 300

Query: 628 YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 744
           YWLVKNSWG +WG  GYIKM+++++NQCGIAT +SYP +
Sbjct: 301 YWLVKNSWGTTWGEQGYIKMARNQNNQCGIATASSYPTV 339
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  266 bits (680), Expect = 6e-71
 Identities = 128/223 (57%), Positives = 160/223 (71%), Gaps = 4/223 (1%)
 Frame = +1

Query: 88  KLPDSVNWVKKGYVTQVKNQGQCGSCWAFSTTGSMEGQYFKNNKQLVSFSEQQLVDCSND 267
           ++P SV+W +KGYVT VKNQGQCGSCWAFS TG++EGQ F+   +LVS SEQ LVDCS  
Sbjct: 113 EVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172

Query: 268 FGNQGCNGGLMDSAFQYIMQY-GLEKERDYPYTAQD-GDCEYSKEKVVAHCQGFTDISHG 441
            GNQGCNGGLMD+AFQY+    GL+ E  YPY  ++   C Y  E   A+  GF DI   
Sbjct: 173 QGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQ- 231

Query: 442 SEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVGYG--NDED 615
            E+ L + +ATVGPISV IDA + SFQFYK+G+Y + +CSS  LDHGVL VGYG    + 
Sbjct: 232 REKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDS 291

Query: 616 SQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 744
           +   +W+VKNSWG  WG NGY+KM+KD++N CGI+T ASYP +
Sbjct: 292 NSSKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 100,371,015
Number of Sequences: 369166
Number of extensions: 2062493
Number of successful extensions: 6090
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 5226
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5486
length of database: 68,354,980
effective HSP length: 110
effective length of database: 48,034,130
effective search space used: 8934348180
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)