Planarian EST Database


Dr_sW_001_L07

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_001_L07
         (761 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q10991|CATL_SHEEP  Cathepsin L [Contains: Cathepsin L hea...   302   6e-82
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   301   1e-81
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   298   8e-81
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   297   2e-80
sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   296   4e-80
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   295   7e-80
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   295   9e-80
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   295   9e-80
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   290   2e-78
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   290   3e-78
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]
          Length = 217

 Score =  302 bits (774), Expect = 6e-82
 Identities = 152/217 (70%), Positives = 166/217 (76%), Gaps = 2/217 (0%)
 Frame = +1

Query: 76  PKSVDWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQ 255
           PKSVDW KKGYVTPVKNQGQCGSCW+FS TGALEGQ FRK  +LVSLSEQ LVD S+   
Sbjct: 2   PKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQG 61

Query: 256 NNGCNGGLMDNAFQYIQKY-GLESEADYPYTAMDGPCKYDSTKVVAHCTGFVDIKKGNEK 432
           N GCNGGLMDNAFQYI++  GL+SE  YPY A D  C Y      A  TGFVDI +  EK
Sbjct: 62  NQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDIPQ-REK 120

Query: 433 DLTKAVATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEGKNH-FW 609
            L KAVATVGPISVAIDA   SFQ YK GIY + +CSS +LDHGVL VGYG EG N+ FW
Sbjct: 121 ALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKFW 180

Query: 610 IVKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 720
           IVKNSWGP WG  GY+KM+KD+ N CGIAT ASYPTV
Sbjct: 181 IVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  301 bits (772), Expect = 1e-81
 Identities = 150/231 (64%), Positives = 170/231 (73%), Gaps = 5/231 (2%)
 Frame = +1

Query: 43  IFMPPENMDNYPKSVDWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSE 222
           +F  P  +D  PKSVDWRKKGYVTPVKNQ QCGSCW+FS TGALEGQ FRK  +LVSLSE
Sbjct: 105 VFREPLFLD-LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSE 163

Query: 223 QQLVDCSKDYQNNGCNGGLMDNAFQYIQKY-GLESEADYPYTAMDGPCKYDSTKVVAHCT 399
           Q LVDCS+   N GCNGG M  AFQY+++  GL+SE  YPY A+D  CKY     VA+ T
Sbjct: 164 QNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDT 223

Query: 400 GFVDIKKGNEKDLTKAVATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVG 579
           GF  +  G EK L KAVATVGPISVA+DA   SFQ YK GIY E +CSS NLDHGVL VG
Sbjct: 224 GFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVG 283

Query: 580 YGAEGKN----HFWIVKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 720
           YG EG N     +W+VKNSWGP WG +GY+K++KDK N CGIAT ASYP V
Sbjct: 284 YGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  298 bits (764), Expect = 8e-81
 Identities = 152/221 (68%), Positives = 166/221 (75%), Gaps = 6/221 (2%)
 Frame = +1

Query: 76  PKSVDWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQ 255
           PKSVDW KKGYVTPVKNQGQCGSCW+FS TGALEGQ FRK  +LVSLSEQ LVDCS+   
Sbjct: 115 PKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQG 174

Query: 256 NNGCNGGLMDNAFQYIQ-KYGLESEADYPYTAMD-GPCKYDSTKVVAHCTGFVDIKKGNE 429
           N GCNGGLMDNAFQYI+   GL+SE  YPY A D   C Y      A+ TGFVDI +  E
Sbjct: 175 NQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ-RE 233

Query: 430 KDLTKAVATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG----K 597
           K L KAVATVGPISVAIDA   SFQ YK GIY + +CS  +LDHGVL VGYG EG     
Sbjct: 234 KALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNN 293

Query: 598 NHFWIVKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 720
           N FWIVKNSWGP WG +GY+KM+KD+ N CGIAT ASYPTV
Sbjct: 294 NKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  297 bits (761), Expect = 2e-80
 Identities = 148/220 (67%), Positives = 167/220 (75%), Gaps = 5/220 (2%)
 Frame = +1

Query: 76  PKSVDWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQ 255
           PKSVDWR+KG VTPVKNQGQCGSCW+FS +G LEGQ F K  +L+SLSEQ LVDCS    
Sbjct: 115 PKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQG 174

Query: 256 NNGCNGGLMDNAFQYIQKY-GLESEADYPYTAMDGPCKYDSTKVVAHCTGFVDIKKGNEK 432
           N GCNGGLMD AFQYI++  GL+SE  YPY A DG CKY +   VA+ TGFVDI +  EK
Sbjct: 175 NQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEK 233

Query: 433 DLTKAVATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG----KN 600
            L KAVATVGPISVA+DAS PS Q Y  GIY E NCSS NLDHGVL VGYG EG    KN
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKN 293

Query: 601 HFWIVKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 720
            +W+VKNSWG  WG+ GYIK++KD+ N CG+AT ASYP V
Sbjct: 294 KYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYPVV 333
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  296 bits (758), Expect = 4e-80
 Identities = 146/220 (66%), Positives = 168/220 (76%), Gaps = 5/220 (2%)
 Frame = +1

Query: 76  PKSVDWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQ 255
           PK+VDWR+KG VTPVKNQGQCGSCW+FS +G LEGQ F K  +L+SLSEQ LVDCS D  
Sbjct: 115 PKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQG 174

Query: 256 NNGCNGGLMDNAFQYIQKY-GLESEADYPYTAMDGPCKYDSTKVVAHCTGFVDIKKGNEK 432
           N GCNGGLMD AFQYI++  GL+SE  YPY A DG CKY +   VA+ TGFVDI +  EK
Sbjct: 175 NQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQ-QEK 233

Query: 433 DLTKAVATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG----KN 600
            L KAVATVGPISVA+DAS PS Q Y  GIY E NCSS +LDHGVL VGYG EG    K+
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKD 293

Query: 601 HFWIVKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 720
            +W+VKNSWG  WG+ GYIK++KD+ N CG+AT ASYP V
Sbjct: 294 KYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYPIV 333
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  295 bits (756), Expect = 7e-80
 Identities = 141/227 (62%), Positives = 172/227 (75%), Gaps = 2/227 (0%)
 Frame = +1

Query: 46  FMPPENMDNYPKSVDWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQ 225
           ++PP ++   PKSVDWR+ G VT VK+QG CGSCW+FS+TGALEGQHFRK   LVSLSEQ
Sbjct: 114 YIPPAHV-TVPKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQ 172

Query: 226 QLVDCSKDYQNNGCNGGLMDNAFQYIQ-KYGLESEADYPYTAMDGPCKYDSTKVVAHCTG 402
            LVDCS  Y NNGCNGGLMDNAF+YI+   G+++E  YPY  +D  C ++   + A  TG
Sbjct: 173 NLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTG 232

Query: 403 FVDIKKGNEKDLTKAVATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGY 582
           FVDI +G+E+ + KAVAT+GP+SVAIDAS  SFQLY  G+YNE  C   NLDHGVL VGY
Sbjct: 233 FVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGY 292

Query: 583 GA-EGKNHFWIVKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 720
           G  E    +W+VKNSWG TWG  GYIKM++++ NQCGIAT +SYPTV
Sbjct: 293 GTDESGMDYWLVKNSWGTTWGEQGYIKMARNQNNQCGIATASSYPTV 339
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  295 bits (755), Expect = 9e-80
 Identities = 146/220 (66%), Positives = 167/220 (75%), Gaps = 5/220 (2%)
 Frame = +1

Query: 76  PKSVDWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQ 255
           P+SVDWR+KGYVTPVKNQGQCGSCW+FS TGALEGQ FRK  +L+SLSEQ LVDCS    
Sbjct: 115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 174

Query: 256 NNGCNGGLMDNAFQYIQ-KYGLESEADYPYTAMDGPCKYDSTKVVAHCTGFVDIKKGNEK 432
           N GCNGGLMD AFQY+Q   GL+SE  YPY A +  CKY+    VA+ TGFVDI K  EK
Sbjct: 175 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEK 233

Query: 433 DLTKAVATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG----KN 600
            L KAVATVGPISVAIDA   SF  YK GIY E +CSS ++DHGVL VGYG E      N
Sbjct: 234 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 293

Query: 601 HFWIVKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 720
            +W+VKNSWG  WG+ GY+KM+KD++N CGIA+ ASYPTV
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  295 bits (755), Expect = 9e-80
 Identities = 148/220 (67%), Positives = 167/220 (75%), Gaps = 5/220 (2%)
 Frame = +1

Query: 76  PKSVDWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQ 255
           PKSVDWR+KGYVTPVKNQGQCGSCW+FS TGALEGQ FRK  +LVSLSEQ LVDCS+   
Sbjct: 115 PKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQG 174

Query: 256 NNGCNGGLMDNAFQYIQ-KYGLESEADYPYTAMD-GPCKYDSTKVVAHCTGFVDIKKGNE 429
           N GCNGGLMDNAF+Y++   GL+SE  YPY   D   C Y      A+ TGFVD+ +  E
Sbjct: 175 NEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQ-RE 233

Query: 430 KDLTKAVATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG---KN 600
           K L KAVAT+GPISVAIDA   SFQ YK GIY + +CSS +LDHGVL VGYG EG    N
Sbjct: 234 KALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNN 293

Query: 601 HFWIVKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 720
            FWIVKNSWGP WG +GY+KM+KD+ N CGIAT ASYPTV
Sbjct: 294 KFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  290 bits (743), Expect = 2e-78
 Identities = 146/221 (66%), Positives = 166/221 (75%), Gaps = 6/221 (2%)
 Frame = +1

Query: 76  PKSVDWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHFRKRKQLVSLSEQQLVDCSKDYQ 255
           PKSVDWR+KGYVT VKNQGQCGSCW+FS TGALEGQ FRK  +LVSLSEQ LVDCS+   
Sbjct: 115 PKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQG 174

Query: 256 NNGCNGGLMDNAFQYIQ-KYGLESEADYPYTAMD-GPCKYDSTKVVAHCTGFVDIKKGNE 429
           N GCNGGLMDNAFQY++   GL++E  YPY   +   C Y      A+ TGFVDI +  E
Sbjct: 175 NQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQ-RE 233

Query: 430 KDLTKAVATVGPISVAIDASRPSFQLYKGGIYNEVNCSSNNLDHGVLAVGYGAEG----K 597
           K L KAVATVGPISVAIDA   SFQ YK GIY + +CSS +LDHGVL VGYG EG     
Sbjct: 234 KALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNS 293

Query: 598 NHFWIVKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 720
           + FWIVKNSWGP WG +GY+KM+KD+ N CGI+T ASYPTV
Sbjct: 294 SKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  290 bits (742), Expect = 3e-78
 Identities = 140/239 (58%), Positives = 174/239 (72%), Gaps = 2/239 (0%)
 Frame = +1

Query: 10  LKPINRNYSRTIFMPPENMDNYPKSVDWRKKGYVTPVKNQGQCGSCWSFSTTGALEGQHF 189
           L+  + ++    F+ P ++   PKSVDWR KG VT VK+QG CGSCW+FS+TGALEGQHF
Sbjct: 104 LRAADESFKGVTFISPAHV-TLPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHF 162

Query: 190 RKRKQLVSLSEQQLVDCSKDYQNNGCNGGLMDNAFQYIQ-KYGLESEADYPYTAMDGPCK 366
           RK   LVSLSEQ LVDCS  Y NNGCNGGLMDNAF+YI+   G+++E  YPY A+D  C 
Sbjct: 163 RKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCH 222

Query: 367 YDSTKVVAHCTGFVDIKKGNEKDLTKAVATVGPISVAIDASRPSFQLYKGGIYNEVNCSS 546
           ++   V A   GF DI +G+EK + +AVATVGP+SVAIDAS  SFQ Y  G+YNE  C +
Sbjct: 223 FNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDA 282

Query: 547 NNLDHGVLAVGYGA-EGKNHFWIVKNSWGPTWGISGYIKMSKDKKNQCGIATMASYPTV 720
            NLDHGVL VG+G  E    +W+VKNSWG TWG  G+IKM ++K+NQCGIA+ +SYP V
Sbjct: 283 QNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 89,434,763
Number of Sequences: 369166
Number of extensions: 1868492
Number of successful extensions: 5665
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4928
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5108
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 7018522000
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)