Planarian EST Database


Dr_sW_012_M20

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_012_M20
         (811 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   239   5e-63
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   235   1e-61
sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   224   2e-58
sp|Q24940|CATLP_FASHE  Cathepsin L-like proteinase precursor      223   5e-58
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   219   9e-57
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   219   9e-57
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   216   8e-56
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   215   1e-55
sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   215   1e-55
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   214   2e-55
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  239 bits (611), Expect = 5e-63
 Identities = 113/227 (49%), Positives = 157/227 (69%), Gaps = 2/227 (0%)
 Frame = +3

Query: 69  FVMPYKTKLNDDEVDWRKKGVVTPVKNQYGCNAGWAFAATGSCEGRYAMQTGVLMSFSEE 248
           F+ P    L    VDWR KG VT VK+Q  C + WAF++TG+ EG++  ++GVL+S SE+
Sbjct: 116 FISPAHVTL-PKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQ 174

Query: 249 QLIDCSDAYGNSGCKGGSPDNSFNYYKPY-GMNNEETYPYFGIKDRCKYNNNSVIAHVVS 425
            L+DCS  YGN+GC GG  DN+F Y K   G++ E++YPY  I D C +N  +V A    
Sbjct: 175 NLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRG 234

Query: 426 FIDVARNNEIQVAAAVSAEGPLTAIIDASLPSFQFYREGIYNDPTCSKTSPNHAVLIVGY 605
           F D+ + +E ++A AV+  GP++  IDAS  SFQFY EG+YN+P C   + +H VL+VG+
Sbjct: 235 FTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGF 294

Query: 606 GESE-GEDYWIVKNSWGAMWGDHGYIKMIRDGNNQCGIASSTSFPIL 743
           G  E GEDYW+VKNSWG  WGD G+IKM+R+  NQCGIAS++S+P++
Sbjct: 295 GTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  235 bits (599), Expect = 1e-61
 Identities = 107/212 (50%), Positives = 149/212 (70%), Gaps = 2/212 (0%)
 Frame = +3

Query: 108 VDWRKKGVVTPVKNQYGCNAGWAFAATGSCEGRYAMQTGVLMSFSEEQLIDCSDAYGNSG 287
           VDWR+ G VT VK+Q  C + WAF++TG+ EG++  + GVL+S SE+ L+DCS  YGN+G
Sbjct: 126 VDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNG 185

Query: 288 CKGGSPDNSFNYYKPY-GMNNEETYPYFGIKDRCKYNNNSVIAHVVSFIDVARNNEIQVA 464
           C GG  DN+F Y K   G++ E++YPY GI D C +N  ++ A    F+D+   +E ++ 
Sbjct: 186 CNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMK 245

Query: 465 AAVSAEGPLTAIIDASLPSFQFYREGIYNDPTCSKTSPNHAVLIVGYGESE-GEDYWIVK 641
            AV+  GP++  IDAS  SFQ Y EG+YN+P C + + +H VL+VGYG  E G DYW+VK
Sbjct: 246 KAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVK 305

Query: 642 NSWGAMWGDHGYIKMIRDGNNQCGIASSTSFP 737
           NSWG  WG+ GYIKM R+ NNQCGIA+++S+P
Sbjct: 306 NSWGTTWGEQGYIKMARNQNNQCGIATASSYP 337
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  224 bits (572), Expect = 2e-58
 Identities = 109/237 (45%), Positives = 152/237 (64%), Gaps = 1/237 (0%)
 Frame = +3

Query: 36  NLMSRSQINSKFVMPYKTKLNDDEVDWRKKGVVTPVKNQYGCNAGWAFAATGSCEGRYAM 215
           N+  RS   S F    +T     EVDWR KG VTPVK+Q  C + WAF+ TGS EG++ +
Sbjct: 87  NIPRRSAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFL 146

Query: 216 QTGVLMSFSEEQLIDCSDAYGNSGCKGGSPDNSFNYYKPY-GMNNEETYPYFGIKDRCKY 392
           +TG L+S +E+QL+DCS  YG  GC GG  +++F+Y K   G++ E  YPY      C++
Sbjct: 147 KTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGSCRF 206

Query: 393 NNNSVIAHVVSFIDVARNNEIQVAAAVSAEGPLTAIIDASLPSFQFYREGIYNDPTCSKT 572
           ++NSV A      ++A  +E  +  AV   GP++  IDA+  SFQFY  G+Y +P+CS +
Sbjct: 207 DSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPS 266

Query: 573 SPNHAVLIVGYGESEGEDYWIVKNSWGAMWGDHGYIKMIRDGNNQCGIASSTSFPIL 743
             +HAVL VGYG   G+D+W+VKNSW   WGD GYIKM R+ NN CGIA+  S+P++
Sbjct: 267 YLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323
>sp|Q24940|CATLP_FASHE Cathepsin L-like proteinase precursor
          Length = 326

 Score =  223 bits (568), Expect = 5e-58
 Identities = 105/247 (42%), Positives = 148/247 (59%), Gaps = 2/247 (0%)
 Frame = +3

Query: 9   EVRSKYFSKNLMSRSQINSKFVMPYKT--KLNDDEVDWRKKGVVTPVKNQYGCNAGWAFA 182
           E ++KY ++  MSR+       +PY+   +   D++DWR+ G VT VK+Q  C + WAF+
Sbjct: 79  EFKAKYLTE--MSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFS 136

Query: 183 ATGSCEGRYAMQTGVLMSFSEEQLIDCSDAYGNSGCKGGSPDNSFNYYKPYGMNNEETYP 362
            TG+ EG+Y       +SFSE+QL+DCS  +GN+GC GG  +N++ Y K +G+  E +YP
Sbjct: 137 TTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYP 196

Query: 363 YFGIKDRCKYNNNSVIAHVVSFIDVARNNEIQVAAAVSAEGPLTAIIDASLPSFQFYREG 542
           Y  ++ +C+YN    +A V  +  V   +E+++   V A  P    +D     F  YR G
Sbjct: 197 YTAVEGQCRYNKQLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDVE-SDFMMYRSG 255

Query: 543 IYNDPTCSKTSPNHAVLIVGYGESEGEDYWIVKNSWGAMWGDHGYIKMIRDGNNQCGIAS 722
           IY   TCS    NHAVL VGYG   G DYWIVKNSWG  WG+ GYI+M R+  N CGIAS
Sbjct: 256 IYQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGTYWGERGYIRMARNRGNMCGIAS 315

Query: 723 STSFPIL 743
             S P++
Sbjct: 316 LASLPMV 322
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  219 bits (557), Expect = 9e-57
 Identities = 105/215 (48%), Positives = 141/215 (65%), Gaps = 5/215 (2%)
 Frame = +3

Query: 108 VDWRKKGVVTPVKNQYGCNAGWAFAATGSCEGRYAMQTGVLMSFSEEQLIDCSDAYGNSG 287
           VDWR+KG VTPVKNQ  C + WAF+ATG+ EG+   +TG L+S SE+ L+DCS   GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG 177

Query: 288 CKGGSPDNSFNYYKPY-GMNNEETYPYFGIKDRCKYNNNSVIAHVVSFIDVARNNEIQVA 464
           C GG  D +F Y +   G+++EE+YPY   ++ CKYN    +A+   F+D+ +  E  + 
Sbjct: 178 CNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALM 236

Query: 465 AAVSAEGPLTAIIDASLPSFQFYREGIYNDPTCSKTSPNHAVLIVGYG----ESEGEDYW 632
            AV+  GP++  IDA   SF FY+EGIY +P CS    +H VL+VGYG    ES+   YW
Sbjct: 237 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYW 296

Query: 633 IVKNSWGAMWGDHGYIKMIRDGNNQCGIASSTSFP 737
           +VKNSWG  WG  GY+KM +D  N CGIAS+ S+P
Sbjct: 297 LVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYP 331
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  219 bits (557), Expect = 9e-57
 Identities = 103/215 (47%), Positives = 139/215 (64%), Gaps = 5/215 (2%)
 Frame = +3

Query: 108 VDWRKKGVVTPVKNQYGCNAGWAFAATGSCEGRYAMQTGVLMSFSEEQLIDCSDAYGNSG 287
           VDWRKKG VTPVKNQ  C + WAF+ATG+ EG+   +TG L+S SE+ L+DCS   GN G
Sbjct: 118 VDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177

Query: 288 CKGGSPDNSFNYYKPY-GMNNEETYPYFGIKDRCKYNNNSVIAHVVSFIDVARNNEIQVA 464
           C GG    +F Y K   G+++EE+YPY  + + CKY   + +A+   F  VA   E  + 
Sbjct: 178 CNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKALM 237

Query: 465 AAVSAEGPLTAIIDASLPSFQFYREGIYNDPTCSKTSPNHAVLIVGYG----ESEGEDYW 632
            AV+  GP++  +DA   SFQFY+ GIY +P CS  + +H VL+VGYG     S    YW
Sbjct: 238 KAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYW 297

Query: 633 IVKNSWGAMWGDHGYIKMIRDGNNQCGIASSTSFP 737
           +VKNSWG  WG +GY+K+ +D NN CGIA++ S+P
Sbjct: 298 LVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYP 332
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  216 bits (549), Expect = 8e-56
 Identities = 109/216 (50%), Positives = 142/216 (65%), Gaps = 6/216 (2%)
 Frame = +3

Query: 108 VDWRKKGVVTPVKNQYGCNAGWAFAATGSCEGRYAMQTGVLMSFSEEQLIDCSDAYGNSG 287
           VDWR+KG VTPVKNQ  C + WAF+ATG+ EG+   +TG L+S SE+ L+DCS A GN G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEG 177

Query: 288 CKGGSPDNSFNYYKPY-GMNNEETYPYFG-IKDRCKYNNNSVIAHVVSFIDVARNNEIQV 461
           C GG  DN+F Y K   G+++EE+YPY G   + C Y      A+   F+D+ +  E  +
Sbjct: 178 CNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQ-REKAL 236

Query: 462 AAAVSAEGPLTAIIDASLPSFQFYREGIYNDPTCSKTSPNHAVLIVGYGESEGED----Y 629
             AV+  GP++  IDA   SFQFY+ GIY DP CS    +H VL+VGYG  EG D    +
Sbjct: 237 MKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYG-FEGTDSNNKF 295

Query: 630 WIVKNSWGAMWGDHGYIKMIRDGNNQCGIASSTSFP 737
           WIVKNSWG  WG +GY+KM +D NN CGIA++ S+P
Sbjct: 296 WIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYP 331
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  215 bits (548), Expect = 1e-55
 Identities = 102/217 (47%), Positives = 143/217 (65%), Gaps = 5/217 (2%)
 Frame = +3

Query: 108 VDWRKKGVVTPVKNQYGCNAGWAFAATGSCEGRYAMQTGVLMSFSEEQLIDCSDAYGNSG 287
           VDWR+KG VTPVKNQ  C + WAF+A+G  EG+  ++TG L+S SE+ L+DCS A GN G
Sbjct: 118 VDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQG 177

Query: 288 CKGGSPDNSFNYYKPY-GMNNEETYPYFGIKDRCKYNNNSVIAHVVSFIDVARNNEIQVA 464
           C GG  D +F Y K   G+++EE+YPY      CKY     +A+   F+D+ +  E  + 
Sbjct: 178 CNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKALM 236

Query: 465 AAVSAEGPLTAIIDASLPSFQFYREGIYNDPTCSKTSPNHAVLIVGYG----ESEGEDYW 632
            AV+  GP++  +DAS PS QFY  GIY +P CS  + +H VL+VGYG    +S    YW
Sbjct: 237 KAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYW 296

Query: 633 IVKNSWGAMWGDHGYIKMIRDGNNQCGIASSTSFPIL 743
           +VKNSWG+ WG  GYIK+ +D +N CG+A++ S+P++
Sbjct: 297 LVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  215 bits (547), Expect = 1e-55
 Identities = 103/217 (47%), Positives = 141/217 (64%), Gaps = 5/217 (2%)
 Frame = +3

Query: 108 VDWRKKGVVTPVKNQYGCNAGWAFAATGSCEGRYAMQTGVLMSFSEEQLIDCSDAYGNSG 287
           VDWR+KG VTPVKNQ  C + WAF+A+G  EG+  ++TG L+S SE+ L+DCS   GN G
Sbjct: 118 VDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQG 177

Query: 288 CKGGSPDNSFNYYKPY-GMNNEETYPYFGIKDRCKYNNNSVIAHVVSFIDVARNNEIQVA 464
           C GG  D +F Y K   G+++EE+YPY      CKY     +A+   F+D+ +  E  + 
Sbjct: 178 CNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQ-QEKALM 236

Query: 465 AAVSAEGPLTAIIDASLPSFQFYREGIYNDPTCSKTSPNHAVLIVGYG----ESEGEDYW 632
            AV+  GP++  +DAS PS QFY  GIY +P CS    +H VL+VGYG    +S  + YW
Sbjct: 237 KAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYW 296

Query: 633 IVKNSWGAMWGDHGYIKMIRDGNNQCGIASSTSFPIL 743
           +VKNSWG  WG  GYIK+ +D NN CG+A++ S+PI+
Sbjct: 297 LVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  214 bits (546), Expect = 2e-55
 Identities = 106/216 (49%), Positives = 139/216 (64%), Gaps = 6/216 (2%)
 Frame = +3

Query: 108 VDWRKKGVVTPVKNQYGCNAGWAFAATGSCEGRYAMQTGVLMSFSEEQLIDCSDAYGNSG 287
           VDW KKG VTPVKNQ  C + WAF+ATG+ EG+   +TG L+S SE+ L+DCS A GN G
Sbjct: 118 VDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQG 177

Query: 288 CKGGSPDNSFNYYKPY-GMNNEETYPYFGI-KDRCKYNNNSVIAHVVSFIDVARNNEIQV 461
           C GG  DN+F Y K   G+++EE+YPY     + C Y      A+   F+D+ +  E  +
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ-REKAL 236

Query: 462 AAAVSAEGPLTAIIDASLPSFQFYREGIYNDPTCSKTSPNHAVLIVGYG----ESEGEDY 629
             AV+  GP++  IDA   SFQFY+ GIY DP CS    +H VL+VGYG    +S    +
Sbjct: 237 MKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNKF 296

Query: 630 WIVKNSWGAMWGDHGYIKMIRDGNNQCGIASSTSFP 737
           WIVKNSWG  WG +GY+KM +D NN CGIA++ S+P
Sbjct: 297 WIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYP 332
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 84,592,352
Number of Sequences: 369166
Number of extensions: 1657554
Number of successful extensions: 5051
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4479
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4629
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 7715018400
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)