Planarian EST Database


Dr_sW_005_G16

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_005_G16
         (793 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   272   7e-73
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   271   1e-72
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   270   3e-72
sp|Q10991|CATL_SHEEP  Cathepsin L [Contains: Cathepsin L hea...   270   3e-72
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   270   3e-72
sp|Q9GL24|CATL_CANFA  Cathepsin L precursor [Contains: Cathe...   268   2e-71
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   266   6e-71
sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   263   4e-70
sp|P25784|CYSP3_HOMAM  Digestive cysteine proteinase 3 precu...   258   1e-68
sp|Q28944|CATL_PIG  Cathepsin L precursor [Contains: Catheps...   258   1e-68
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  272 bits (696), Expect = 7e-73
 Identities = 135/239 (56%), Positives = 164/239 (68%), Gaps = 2/239 (0%)
 Frame = +2

Query: 14  IMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYF 193
           +M+ +  L G+TY+ P ++ V P SVDWR+ G VT VK+Q  CGSCW+FS+TG+LEGQ+F
Sbjct: 102 LMRERTGLVGATYIPPAHVTV-PKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHF 160

Query: 194 RKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCK 370
           RK   L+S SEQ LVDCS           LMDNAFRYIKD G I++E  YPY   D +C 
Sbjct: 161 RKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCH 220

Query: 371 RNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCST 550
            N + I    TGF DI   +E  +  AVAT+GPVSVAIDA H SFQLY  G+YNE  C  
Sbjct: 221 FNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDE 280

Query: 551 TQLDHGVLAVGYGT-QIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 724
             LDHGVL VGYGT + G  YW+VKNSW  TWGE GYIKM++++ NQCGIAT +SYP V
Sbjct: 281 QNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARNQNNQCGIATASSYPTV 339
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  271 bits (693), Expect = 1e-72
 Identities = 132/233 (56%), Positives = 164/233 (70%), Gaps = 2/233 (0%)
 Frame = +2

Query: 32  TLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRL 211
           + +G T+++P ++  LP SVDWR KG VT VK+Q  CGSCW+FS+TG+LEGQ+FRK+  L
Sbjct: 110 SFKGVTFISPAHV-TLPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVL 168

Query: 212 ISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKI 388
           +S SEQ LVDCS           LMDNAFRYIKD  GI++E  YPY A D +C  N   +
Sbjct: 169 VSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTV 228

Query: 389 VTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHG 568
                GFTDI   +E  +A AVATVGPVSVAIDA H SFQ Y  G+YNE  C    LDHG
Sbjct: 229 GATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHG 288

Query: 569 VLAVGYGT-QIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 724
           VL VG+GT + G+ YW+VKNSW  TWG+ G+IKM ++K+NQCGIA+ +SYPLV
Sbjct: 289 VLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPLV 341
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  270 bits (691), Expect = 3e-72
 Identities = 134/244 (54%), Positives = 161/244 (65%), Gaps = 5/244 (2%)
 Frame = +2

Query: 8   LGIMKTKPTLEGSTYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQ 187
           +G  + +   +G  +  P  +  LP SVDWR+KGYVTPVKNQ+QCGSCW+FSATG+LEGQ
Sbjct: 92  MGCFRNQKFRKGKVFREPLFLD-LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQ 150

Query: 188 YFRKNNRLISFSEQQLVDCSXXXXXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGT 364
            FRK  +L+S SEQ LVDCS            M  AF+Y+K+  G++SE  YPY A D  
Sbjct: 151 MFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEI 210

Query: 365 CKRNPSKIVTKCTGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESC 544
           CK  P   V   TGFT +    E  L  AVATVGP+SVA+DAGH+SFQ YKSGIY E  C
Sbjct: 211 CKYRPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDC 270

Query: 545 STTQLDHGVLAVGYG----TQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMAS 712
           S+  LDHGVL VGYG         KYW+VKNSW   WG +GY+K++KDK N CGIAT AS
Sbjct: 271 SSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAAS 330

Query: 713 YPLV 724
           YP V
Sbjct: 331 YPNV 334
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]
          Length = 217

 Score =  270 bits (691), Expect = 3e-72
 Identities = 134/218 (61%), Positives = 154/218 (70%), Gaps = 2/218 (0%)
 Frame = +2

Query: 77  LPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXX 256
           +P SVDW +KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVD S   
Sbjct: 1   VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query: 257 XXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNE 433
                   LMDNAF+YIK+  G++SE  YPY ATD +C   P     K TGF DI  Q E
Sbjct: 61  GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDTGFVDI-PQRE 119

Query: 434 TDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ-IGKKY 610
             L  AVATVGP+SVAIDAGH+SFQ YKSGIY +  CS+  LDHGVL VGYG +    K+
Sbjct: 120 KALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNKF 179

Query: 611 WIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 724
           WIVKNSW   WG  GY+KM+KD+ N CGIAT ASYP V
Sbjct: 180 WIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  270 bits (690), Expect = 3e-72
 Identities = 134/220 (60%), Positives = 154/220 (70%), Gaps = 5/220 (2%)
 Frame = +2

Query: 80  PASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXXX 259
           P SVDWR+KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  RLIS SEQ LVDCS    
Sbjct: 115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 174

Query: 260 XXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQNET 436
                  LMD AF+Y++D  G++SE  YPY AT+ +CK NP   V   TGF DI  Q E 
Sbjct: 175 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EK 233

Query: 437 DLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQI----GK 604
            L  AVATVGP+SVAIDAGH SF  YK GIY E  CS+  +DHGVL VGYG +       
Sbjct: 234 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN 293

Query: 605 KYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 724
           KYW+VKNSW   WG  GY+KM+KD++N CGIA+ ASYP V
Sbjct: 294 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|Q9GL24|CATL_CANFA Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 333

 Score =  268 bits (684), Expect = 2e-71
 Identities = 134/221 (60%), Positives = 153/221 (69%), Gaps = 5/221 (2%)
 Frame = +2

Query: 77  LPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXX 256
           +P SVDWR+KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS   
Sbjct: 114 IPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ 173

Query: 257 XXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQN 430
                   LMDNAFRY+KD  G++SE  YPY   D  TC   P       TGF D+  Q 
Sbjct: 174 GNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDL-PQR 232

Query: 431 ETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGY---GTQIG 601
           E  L  AVAT+GP+SVAIDAGH SFQ YKSGIY +  CS+  LDHGVL VGY   GT   
Sbjct: 233 EKALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSN 292

Query: 602 KKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 724
            K+WIVKNSW   WG +GY+KM+KD+ N CGIAT ASYP V
Sbjct: 293 NKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 333
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  266 bits (679), Expect = 6e-71
 Identities = 135/222 (60%), Positives = 153/222 (68%), Gaps = 6/222 (2%)
 Frame = +2

Query: 77  LPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXX 256
           +P SVDW +KGYVTPVKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS   
Sbjct: 114 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQ 173

Query: 257 XXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQN 430
                   LMDNAF+YIKD  G++SE  YPY ATD  +C   P       TGF DI  Q 
Sbjct: 174 GNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQR 232

Query: 431 ETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ----I 598
           E  L  AVATVGP+SVAIDAGH SFQ YKSGIY +  CS   LDHGVL VGYG +     
Sbjct: 233 EKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTDSN 292

Query: 599 GKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 724
             K+WIVKNSW   WG +GY+KM+KD+ N CGIAT ASYP V
Sbjct: 293 NNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  263 bits (672), Expect = 4e-70
 Identities = 124/228 (54%), Positives = 160/228 (70%), Gaps = 1/228 (0%)
 Frame = +2

Query: 44  STYMAPENIGVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFS 223
           S +   +  G     VDWR KG VTPVK+Q QCGSCW+FS TGSLEGQ+F K   LIS +
Sbjct: 96  SVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLA 155

Query: 224 EQQLVDCSXXXXXXXXXXXLMDNAFRYIK-DQGIESEGDYPYTATDGTCKRNPSKIVTKC 400
           EQQLVDCS            M++AF YIK + GI++E  YPY A DG+C+ + + +   C
Sbjct: 156 EQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATC 215

Query: 401 TGFTDIQSQNETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAV 580
           +G T+I S +ET L  AV  +GP+SV IDA H+SFQ Y SG+Y E SCS + LDH VLAV
Sbjct: 216 SGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAV 275

Query: 581 GYGTQIGKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 724
           GYG++ G+ +W+VKNSW  +WG++GYIKMS+++ N CGIAT+ASYPLV
Sbjct: 276 GYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYPLV 323
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
          Length = 321

 Score =  258 bits (660), Expect = 1e-68
 Identities = 125/219 (57%), Positives = 156/219 (71%), Gaps = 1/219 (0%)
 Frame = +2

Query: 71  GVLPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSX 250
           G + A VDWR K  VTPVK+Q+QCGSCW+FSATG+LEGQ+F KN+ L+S SEQQLVDCS 
Sbjct: 104 GPMAADVDWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCST 163

Query: 251 XXXXXXXXXXLMDNAFRYIKDQG-IESEGDYPYTATDGTCKRNPSKIVTKCTGFTDIQSQ 427
                      M +AF YIKD G I++E  YPY A D +C+ + + I   CTG  ++Q  
Sbjct: 164 DYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTGSVEVQHT 223

Query: 428 NETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQIGKK 607
            E  L  AV+ VGP+SVAIDA H SFQ Y SG+Y E++CS T LDHGVLAVGYGT+  K 
Sbjct: 224 EEA-LQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGTESTKD 282

Query: 608 YWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 724
           YW+VKNSW  +WG++GYIKMS+++ N CGIA+  SYP V
Sbjct: 283 YWLVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321
>sp|Q28944|CATL_PIG Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  258 bits (659), Expect = 1e-68
 Identities = 129/222 (58%), Positives = 153/222 (68%), Gaps = 6/222 (2%)
 Frame = +2

Query: 77  LPASVDWRQKGYVTPVKNQQQCGSCWSFSATGSLEGQYFRKNNRLISFSEQQLVDCSXXX 256
           +P SVDWR+KGYVT VKNQ QCGSCW+FSATG+LEGQ FRK  +L+S SEQ LVDCS   
Sbjct: 114 VPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ 173

Query: 257 XXXXXXXXLMDNAFRYIKDQ-GIESEGDYPYTATD-GTCKRNPSKIVTKCTGFTDIQSQN 430
                   LMDNAF+Y+KD  G+++E  YPY   +  +C   P       TGF DI  Q 
Sbjct: 174 GNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDI-PQR 232

Query: 431 ETDLANAVATVGPVSVAIDAGHASFQLYKSGIYNEESCSTTQLDHGVLAVGYGTQ----I 598
           E  L  AVATVGP+SVAIDAGH+SFQ YKSGIY +  CS+  LDHGVL VGYG +     
Sbjct: 233 EKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSN 292

Query: 599 GKKYWIVKNSWDVTWGESGYIKMSKDKKNQCGIATMASYPLV 724
             K+WIVKNSW   WG +GY+KM+KD+ N CGI+T ASYP V
Sbjct: 293 SSKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGISTAASYPTV 334
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 88,946,723
Number of Sequences: 369166
Number of extensions: 1746369
Number of successful extensions: 5397
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4755
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4917
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 7425705210
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)