Planarian EST Database


Dr_sW_002_F01

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_002_F01
         (709 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   268   1e-71
sp|Q10991|CATL_SHEEP  Cathepsin L [Contains: Cathepsin L hea...   268   1e-71
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   265   7e-71
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   265   1e-70
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   263   3e-70
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   262   8e-70
sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   261   1e-69
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   259   5e-69
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   255   7e-68
sp|P25784|CYSP3_HOMAM  Digestive cysteine proteinase 3 precu...   255   9e-68
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  268 bits (684), Expect = 1e-71
 Identities = 133/218 (61%), Positives = 153/218 (70%), Gaps = 5/218 (2%)
 Frame = +2

Query: 2   SVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXX 181
           SVDWR+KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  RLIS SEQ LVDCS      
Sbjct: 117 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNE 176

Query: 182 XXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDL 358
                LMD AF+Y++D  G++SE  YPY AT+ +CK NP   V   TGF DI  Q E  L
Sbjct: 177 GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EKAL 235

Query: 359 ANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQI----GKKY 526
             AVATVGP+SVAIDAGH SF  YK GIY E  CS+  +DHGVL VGYG +       KY
Sbjct: 236 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 295

Query: 527 WIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 640
           W+VKNSW   WG  GY+KM+KD++N CGIA+ ASYP V
Sbjct: 296 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]
          Length = 217

 Score =  268 bits (684), Expect = 1e-71
 Identities = 133/215 (61%), Positives = 152/215 (70%), Gaps = 2/215 (0%)
 Frame = +2

Query: 2   SVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXX 181
           SVDW +KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVD S      
Sbjct: 4   SVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQGNQ 63

Query: 182 XXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDL 358
                LMDNAF+YIK+  G++SE  YPY ATD +C   P     K TGF DI  Q E  L
Sbjct: 64  GCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDI-PQREKAL 122

Query: 359 ANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ-IGKKYWIV 535
             AVATVGP+SVAIDAGH+SFQ YKSGIY +  CS+  LDHGVL VGYG +    K+WIV
Sbjct: 123 MKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKFWIV 182

Query: 536 KNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 640
           KNSW   WG  GY+KM+KD+ N CGIAT ASYP V
Sbjct: 183 KNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  265 bits (678), Expect = 7e-71
 Identities = 129/218 (59%), Positives = 150/218 (68%), Gaps = 5/218 (2%)
 Frame = +2

Query: 2   SVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXX 181
           SVDWR+KGYVTPVKNQ+QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS      
Sbjct: 117 SVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQ 176

Query: 182 XXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDL 358
                 M  AF+Y+K+ G ++SE  YPY A D  CK  P   V   TGFT +    E  L
Sbjct: 177 GCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEKAL 236

Query: 359 ANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYG----TQIGKKY 526
             AVATVGP+SVA+DAGH+SFQ YKSGIY E  CS+  LDHGVL VGYG         KY
Sbjct: 237 MKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKY 296

Query: 527 WIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 640
           W+VKNSW   WG +GY+K++KDK N CGIAT ASYP V
Sbjct: 297 WLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  265 bits (676), Expect = 1e-70
 Identities = 133/218 (61%), Positives = 151/218 (69%), Gaps = 5/218 (2%)
 Frame = +2

Query: 2   SVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXX 181
           SVDWR+KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS      
Sbjct: 117 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNE 176

Query: 182 XXXXXLMDNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETD 355
                LMDNAFRY+KD  G++SE  YPY   D  TC   P       TGF D+  Q E  
Sbjct: 177 GCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKA 235

Query: 356 LANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGY---GTQIGKKY 526
           L  AVAT+GP+SVAIDAGH SFQ YKSGIY +  CS+  LDHGVL VGY   GT    K+
Sbjct: 236 LMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKF 295

Query: 527 WIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 640
           WIVKNSW   WG +GY+KM+KD+ N CGIAT ASYP V
Sbjct: 296 WIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  263 bits (672), Expect = 3e-70
 Identities = 134/219 (61%), Positives = 151/219 (68%), Gaps = 6/219 (2%)
 Frame = +2

Query: 2   SVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXX 181
           SVDW +KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS      
Sbjct: 117 SVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQ 176

Query: 182 XXXXXLMDNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETD 355
                LMDNAF+YIKD  G++SE  YPY ATD  +C   P       TGF DI  Q E  
Sbjct: 177 GCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKA 235

Query: 356 LANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKK 523
           L  AVATVGP+SVAIDAGH SFQ YKSGIY +  CS   LDHGVL VGYG +       K
Sbjct: 236 LMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSNNNK 295

Query: 524 YWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 640
           +WIVKNSW   WG +GY+KM+KD+ N CGIAT ASYP V
Sbjct: 296 FWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  262 bits (669), Expect = 8e-70
 Identities = 127/215 (59%), Positives = 152/215 (70%), Gaps = 2/215 (0%)
 Frame = +2

Query: 2   SVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXX 181
           SVDWR KG VT VK+Q  CGSCW+FS+TG+LEGQ+FRK+  L+S SEQ LVDCS      
Sbjct: 127 SVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNN 186

Query: 182 XXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDL 358
                LMDNAFRYIKD G I++E  YPY A D +C  N   +     GFTDI   +E  +
Sbjct: 187 GCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKM 246

Query: 359 ANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGT-QIGKKYWIV 535
           A AVATVGPVSVAIDA H SFQ Y  G+YNE  C    LDHGVL VG+GT + G+ YW+V
Sbjct: 247 AEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLV 306

Query: 536 KNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 640
           KNSW  TWG+ G+IKM ++K+NQCGIA+ +SYPLV
Sbjct: 307 KNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  261 bits (667), Expect = 1e-69
 Identities = 122/213 (57%), Positives = 156/213 (73%), Gaps = 1/213 (0%)
 Frame = +2

Query: 5   VDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXX 184
           VDWR KG VTPVK+Q QCGSCW+FS TGSLEGQ+F K   LIS +EQQLVDCS       
Sbjct: 111 VDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQG 170

Query: 185 XXXXLMDNAFRYIK-DQGIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLA 361
                M++AF YIK + GI++E  YPY A DG+C+ + + +   C+G T+I S +ET L 
Sbjct: 171 CNGGWMNDAFDYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQ 230

Query: 362 NAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKN 541
            AV  +GP+SV IDA H+SFQ Y SG+Y E SCS + LDH VLAVGYG++ G+ +W+VKN
Sbjct: 231 QAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKN 290

Query: 542 SWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 640
           SW  +WG++GYIKMS+++ N CGIAT+ASYPLV
Sbjct: 291 SWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  259 bits (662), Expect = 5e-69
 Identities = 127/215 (59%), Positives = 149/215 (69%), Gaps = 2/215 (0%)
 Frame = +2

Query: 2   SVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXX 181
           SVDWR+ G VT VK+Q  CGSCW+FS+TG+LEGQ+FRK   L+S SEQ LVDCS      
Sbjct: 125 SVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNN 184

Query: 182 XXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDL 358
                LMDNAFRYIKD G I++E  YPY   D +C  N + I    TGF DI   +E  +
Sbjct: 185 GCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKM 244

Query: 359 ANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGT-QIGKKYWIV 535
             AVAT+GPVSVAIDA H SFQLY  G+YNE  C    LDHGVL VGYGT + G  YW+V
Sbjct: 245 KKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLV 304

Query: 536 KNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 640
           KNSW  TWGE GYIKM++++ NQCGIAT +SYP V
Sbjct: 305 KNSWGTTWGEQGYIKMARNQNNQCGIATASSYPTV 339
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  255 bits (652), Expect = 7e-68
 Identities = 128/219 (58%), Positives = 151/219 (68%), Gaps = 6/219 (2%)
 Frame = +2

Query: 2   SVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXX 181
           SVDWR+KGYVT VKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS      
Sbjct: 117 SVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQ 176

Query: 182 XXXXXLMDNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQNETD 355
                LMDNAF+Y+KD  G+++E  YPY   +  +C   P       TGF DI  Q E  
Sbjct: 177 GCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKA 235

Query: 356 LANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ----IGKK 523
           L  AVATVGP+SVAIDAGH+SFQ YKSGIY +  CS+  LDHGVL VGYG +       K
Sbjct: 236 LMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSK 295

Query: 524 YWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 640
           +WIVKNSW   WG +GY+KM+KD+ N CGI+T ASYP V
Sbjct: 296 FWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
          Length = 321

 Score =  255 bits (651), Expect = 9e-68
 Identities = 123/213 (57%), Positives = 153/213 (71%), Gaps = 1/213 (0%)
 Frame = +2

Query: 5   VDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXXXXX 184
           VDWR K  VTPVK+Q+QCGSCW+FSATG+LEGQ+F KN+ L+S SEQQLVDCS       
Sbjct: 110 VDWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDG 169

Query: 185 XXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNETDLA 361
                M +AF YIKD G I++E  YPY A D +C+ + + I   CTG  ++Q   E  L 
Sbjct: 170 CGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTGSVEVQHTEEA-LQ 228

Query: 362 NAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQIGKKYWIVKN 541
            AV+ VGP+SVAIDA H SFQ Y SG+Y E++CS T LDHGVLAVGYGT+  K YW+VKN
Sbjct: 229 EAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGTESTKDYWLVKN 288

Query: 542 SWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 640
           SW  +WG++GYIKMS+++ N CGIA+  SYP V
Sbjct: 289 SWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 77,027,040
Number of Sequences: 369166
Number of extensions: 1468601
Number of successful extensions: 4737
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4122
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4266
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 6219306880
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)