Planarian EST Database


Dr_sW_024_P12

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_024_P12
         (728 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P07154|CATL_RAT  Cathepsin L precursor (Major excreted pr...   214   2e-55
sp|P06797|CATL_MOUSE  Cathepsin L precursor (Major excreted ...   210   4e-54
sp|Q95029|CATL_DROME  Cathepsin L precursor (Cysteine protei...   207   2e-53
sp|O60911|CATL2_HUMAN  Cathepsin L2 precursor (Cathepsin V) ...   206   7e-53
sp|Q26636|CATL_SARPE  Cathepsin L precursor [Contains: Cathe...   201   2e-51
sp|P25784|CYSP3_HOMAM  Digestive cysteine proteinase 3 precu...   197   2e-50
sp|P07711|CATL_HUMAN  Cathepsin L precursor (Major excreted ...   194   2e-49
sp|P25782|CYSP2_HOMAM  Digestive cysteine proteinase 2 precu...   194   2e-49
sp|Q10991|CATL_SHEEP  Cathepsin L [Contains: Cathepsin L hea...   193   3e-49
sp|P25975|CATL_BOVIN  Cathepsin L precursor [Contains: Cathe...   193   4e-49
>sp|P07154|CATL_RAT Cathepsin L precursor (Major excreted protein) (MEP) (Cyclic
           protein 2) (CP-2) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  214 bits (544), Expect = 2e-55
 Identities = 103/169 (60%), Positives = 127/169 (75%), Gaps = 3/169 (1%)
 Frame = +3

Query: 87  Q*LVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKERDYPYTAQDGDCEYSKEKVVAHCQ 263
           Q LVDCS+D GNQGCNGGLMD AFQYI + G L+ E  YPY A+DG C+Y  E  VA+  
Sbjct: 164 QNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDT 223

Query: 264 GFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVG 443
           GF DI    E+ L + +ATVGPISV +DAS+PS QFY +G+Y E NCSS  LDHGVL VG
Sbjct: 224 GFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVG 282

Query: 444 YGND--EDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 584
           YG +  + ++  YWLVKNSWGK WG++GYIK++KD++N CG+AT ASYP
Sbjct: 283 YGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLATAASYP 331
>sp|P06797|CATL_MOUSE Cathepsin L precursor (Major excreted protein) (MEP) (p39 cysteine
           proteinase) [Contains: Cathepsin L heavy chain;
           Cathepsin L light chain]
          Length = 334

 Score =  210 bits (534), Expect = 4e-54
 Identities = 105/169 (62%), Positives = 125/169 (73%), Gaps = 3/169 (1%)
 Frame = +3

Query: 87  Q*LVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKERDYPYTAQDGDCEYSKEKVVAHCQ 263
           Q LVDCS+  GNQGCNGGLMD AFQYI + G L+ E  YPY A+DG C+Y  E  VA+  
Sbjct: 164 QNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDT 223

Query: 264 GFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVG 443
           GF DI    E+ L + +ATVGPISV +DAS+PS QFY +G+Y E NCSS  LDHGVL VG
Sbjct: 224 GFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVG 282

Query: 444 YGND-EDSQQN-YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 584
           YG +  DS +N YWLVKNSWG  WG+ GYIK++KD+DN CG+AT ASYP
Sbjct: 283 YGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNHCGLATAASYP 331
>sp|Q95029|CATL_DROME Cathepsin L precursor (Cysteine proteinase 1) [Contains: Cathepsin
           L heavy chain; Cathepsin L light chain]
          Length = 341

 Score =  207 bits (528), Expect = 2e-53
 Identities = 98/167 (58%), Positives = 124/167 (74%), Gaps = 1/167 (0%)
 Frame = +3

Query: 87  Q*LVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKERDYPYTAQDGDCEYSKEKVVAHCQ 263
           Q LVDCS  +GN GCNGGLMD+AF+YI   G ++ E+ YPY A D  C ++K  V A  +
Sbjct: 174 QNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATDR 233

Query: 264 GFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVG 443
           GFTDI  G E+ +AE +ATVGP+SV IDAS+ SFQFY  GVY+E  C +  LDHGVL VG
Sbjct: 234 GFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVG 293

Query: 444 YGNDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 584
           +G DE S ++YWLVKNSWG +WG  G+IKM ++K+NQCGIA+ +SYP
Sbjct: 294 FGTDE-SGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYP 339
>sp|O60911|CATL2_HUMAN Cathepsin L2 precursor (Cathepsin V) (Cathepsin U)
          Length = 334

 Score =  206 bits (523), Expect = 7e-53
 Identities = 100/171 (58%), Positives = 122/171 (71%), Gaps = 3/171 (1%)
 Frame = +3

Query: 87  Q*LVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKERDYPYTAQDGDCEYSKEKVVAHCQ 263
           Q LVDCS   GNQGCNGG M  AFQY+ + G L+ E  YPY A D  C+Y  E  VA+  
Sbjct: 164 QNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDT 223

Query: 264 GFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVG 443
           GFT ++ G E+ L + +ATVGPISV +DA + SFQFYK+G+Y E +CSS  LDHGVL VG
Sbjct: 224 GFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVG 283

Query: 444 YGNDEDSQQN--YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 590
           YG +  +  N  YWLVKNSWG  WG NGY+K++KDK+N CGIAT ASYPN+
Sbjct: 284 YGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV 334

 Score = 30.4 bits (67), Expect = 5.0
 Identities = 13/17 (76%), Positives = 14/17 (82%)
 Frame = -2

Query: 52  LPDSVNWVKKGYVTQVK 2
           LP SV+W KKGYVT VK
Sbjct: 114 LPKSVDWRKKGYVTPVK 130
>sp|Q26636|CATL_SARPE Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 339

 Score =  201 bits (511), Expect = 2e-51
 Identities = 94/169 (55%), Positives = 119/169 (70%), Gaps = 1/169 (0%)
 Frame = +3

Query: 87  Q*LVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKERDYPYTAQDGDCEYSKEKVVAHCQ 263
           Q LVDCS  +GN GCNGGLMD+AF+YI   G ++ E+ YPY   D  C ++K  + A   
Sbjct: 172 QNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDT 231

Query: 264 GFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVG 443
           GF DI  G EE + + +AT+GP+SV IDAS+ SFQ Y  GVY+E  C    LDHGVL VG
Sbjct: 232 GFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVG 291

Query: 444 YGNDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 590
           YG DE S  +YWLVKNSWG +WG  GYIKM+++++NQCGIAT +SYP +
Sbjct: 292 YGTDE-SGMDYWLVKNSWGTTWGEQGYIKMARNQNNQCGIATASSYPTV 339
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 precursor
          Length = 321

 Score =  197 bits (501), Expect = 2e-50
 Identities = 97/169 (57%), Positives = 119/169 (70%), Gaps = 1/169 (0%)
 Frame = +3

Query: 87  Q*LVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKERDYPYTAQDGDCEYSKEKVVAHCQ 263
           Q LVDCS D+GN GC GG M SAF YI   G ++ E  YPY A+D  C +    + A C 
Sbjct: 156 QQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICT 215

Query: 264 GFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVG 443
           G  ++ H +EE L E ++ VGPISV IDAS+ SFQFY +GVY E+NCS T LDHGVLAVG
Sbjct: 216 GSVEVQH-TEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVG 274

Query: 444 YGNDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 590
           YG   +S ++YWLVKNSWG SWG  GYIKMS+++DN CGIA+  SYP +
Sbjct: 275 YGT--ESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321
>sp|P07711|CATL_HUMAN Cathepsin L precursor (Major excreted protein) (MEP) [Contains:
           Cathepsin L heavy chain; Cathepsin L light chain]
          Length = 333

 Score =  194 bits (493), Expect = 2e-49
 Identities = 95/171 (55%), Positives = 118/171 (69%), Gaps = 3/171 (1%)
 Frame = +3

Query: 87  Q*LVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKERDYPYTAQDGDCEYSKEKVVAHCQ 263
           Q LVDCS   GN+GCNGGLMD AFQY+   G L+ E  YPY A +  C+Y+ +  VA+  
Sbjct: 164 QNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDT 223

Query: 264 GFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVG 443
           GF DI    E+ L + +ATVGPISV IDA + SF FYK G+Y E +CSS  +DHGVL VG
Sbjct: 224 GFVDIPK-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVG 282

Query: 444 YG--NDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 590
           YG  + E     YWLVKNSWG+ WG+ GY+KM+KD+ N CGIA+ ASYP +
Sbjct: 283 YGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 333
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 precursor
          Length = 323

 Score =  194 bits (493), Expect = 2e-49
 Identities = 93/167 (55%), Positives = 118/167 (70%), Gaps = 1/167 (0%)
 Frame = +3

Query: 87  Q*LVDCSNDFGNQGCNGGLMDSAFQYIM-QYGLEKERDYPYTAQDGDCEYSKEKVVAHCQ 263
           Q LVDCS  +G QGCNGG M+ AF YI    G++ E  YPY A+DG C +    V A C 
Sbjct: 157 QQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATCS 216

Query: 264 GFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVG 443
           G T+I+ GSE  L + +  +GPISV IDA++ SFQFY +GVY E +CS + LDH VLAVG
Sbjct: 217 GHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVG 276

Query: 444 YGNDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYP 584
           YG+  +  Q++WLVKNSW  SWG  GYIKMS++++N CGIAT+ASYP
Sbjct: 277 YGS--EGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASYP 321
>sp|Q10991|CATL_SHEEP Cathepsin L [Contains: Cathepsin L heavy chain; Cathepsin L light
           chain]
          Length = 217

 Score =  193 bits (491), Expect = 3e-49
 Identities = 97/169 (57%), Positives = 116/169 (68%), Gaps = 1/169 (0%)
 Frame = +3

Query: 87  Q*LVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKERDYPYTAQDGDCEYSKEKVVAHCQ 263
           Q LVD S   GNQGCNGGLMD+AFQYI + G L+ E  YPY A D  C Y  E   A   
Sbjct: 51  QNLVDSSRPQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSCNYKPEYSAAKDT 110

Query: 264 GFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAVG 443
           GF DI    E+ L + +ATVGPISV IDA + SFQFYK+G+Y + +CSS  LDHGVL VG
Sbjct: 111 GFVDIPQ-REKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVG 169

Query: 444 YGNDEDSQQNYWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 590
           YG  E +   +W+VKNSWG  WG  GY+KM+KD++N CGIAT ASYP +
Sbjct: 170 YG-FEGTNNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPTV 217

 Score = 30.4 bits (67), Expect = 5.0
 Identities = 12/17 (70%), Positives = 14/17 (82%)
 Frame = -2

Query: 52 LPDSVNWVKKGYVTQVK 2
          +P SV+W KKGYVT VK
Sbjct: 1  VPKSVDWTKKGYVTPVK 17
>sp|P25975|CATL_BOVIN Cathepsin L precursor [Contains: Cathepsin L heavy chain; Cathepsin
           L light chain]
          Length = 334

 Score =  193 bits (490), Expect = 4e-49
 Identities = 100/172 (58%), Positives = 119/172 (69%), Gaps = 4/172 (2%)
 Frame = +3

Query: 87  Q*LVDCSNDFGNQGCNGGLMDSAFQYIMQYG-LEKERDYPYTAQD-GDCEYSKEKVVAHC 260
           Q LVDCS   GNQGCNGGLMD+AFQYI   G L+ E  YPY A D   C Y  E   A+ 
Sbjct: 164 QNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAAND 223

Query: 261 QGFTDISHGSEEDLAEKLATVGPISVGIDASNPSFQFYKAGVYDEENCSSTQLDHGVLAV 440
            GF DI    E+ L + +ATVGPISV IDA + SFQFYK+G+Y + +CS   LDHGVL V
Sbjct: 224 TGFVDIPQ-REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVV 282

Query: 441 GYGND-EDSQQN-YWLVKNSWGKSWGINGYIKMSKDKDNQCGIATMASYPNM 590
           GYG +  DS  N +W+VKNSWG  WG NGY+KM+KD++N CGIAT ASYP +
Sbjct: 283 GYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPTV 334

 Score = 30.4 bits (67), Expect = 5.0
 Identities = 12/17 (70%), Positives = 14/17 (82%)
 Frame = -2

Query: 52  LPDSVNWVKKGYVTQVK 2
           +P SV+W KKGYVT VK
Sbjct: 114 VPKSVDWTKKGYVTPVK 130
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 84,844,051
Number of Sequences: 369166
Number of extensions: 1762976
Number of successful extensions: 5095
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4433
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4646
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 6486082400
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)