Planarian EST Database


Dr_sW_022_H05

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_022_H05
         (646 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q26534|CATL_SCHMA  Cathepsin L precursor (SMCL1)               240   3e-63
sp|Q9R013|CATF_MOUSE  Cathepsin F precursor                       231   1e-60
sp|Q9UBX1|CATF_HUMAN  Cathepsin F precursor (CATSF)               226   4e-59
sp|Q9VN93|CPR1_DROME  Putative cysteine proteinase CG12163 p...   224   1e-58
sp|P04988|CYSP1_DICDI  Cysteine proteinase 1 precursor            182   6e-46
sp|P25804|CYSP_PEA  Cysteine proteinase 15A precursor (Turgo...   173   4e-43
sp|P43296|RD19A_ARATH  Cysteine proteinase RD19a precursor (...   169   4e-42
sp|Q10716|CYSP1_MAIZE  Cysteine proteinase 1 precursor            169   7e-42
sp|P43295|A494_ARATH  Probable cysteine proteinase A494 prec...   167   2e-41
sp|P14658|CYSP_TRYBB  Cysteine proteinase precursor               167   2e-41
>sp|Q26534|CATL_SCHMA Cathepsin L precursor (SMCL1)
          Length = 319

 Score =  240 bits (612), Expect = 3e-63
 Identities = 109/190 (57%), Positives = 140/190 (73%)
 Frame = +3

Query: 3   WAFSTTGNIEGQWFIRTKRLVSLSEQQLVDCDTVDEGCNGGLPSQAYKVIQQMGGLETES 182
           WAFSTTGN+E QWF +T +L+SLSEQQLVDCD +D+GCNGGLPS AY+ I +MGGL  E 
Sbjct: 130 WAFSTTGNVESQWFRKTGKLLSLSEQQLVDCDGLDDGCNGGLPSNAYESIIKMGGLMLED 189

Query: 183 DYPYKADRKTCMLDKSKIAVYINGSESIDSSETTMAAWCSINGPISIGINAFAMQFYRGG 362
           +YPY A  + C L    +AVYIN S ++   ET +AAW   N  IS+G+NA  +QFY+ G
Sbjct: 190 NYPYDAKNEKCHLKTDGVAVYINSSVNLTQDETELAAWLYHNSTISVGMNALLLQFYQHG 249

Query: 363 ISHPFKIFCNPDHLDHGVLIVGFNTTSSG*PFWIVKNSWGPGWGEDGYYRVFRGTGVCGL 542
           ISHP+ IFC+   LDH VL+VG+  +    PFWIVKNSWG  WGE+GY+R++RG G CG+
Sbjct: 250 ISHPWWIFCSKYLLDHAVLLVGYGVSEKNEPFWIVKNSWGVEWGENGYFRMYRGDGSCGI 309

Query: 543 NKMPTSAIIH 572
           N + TSA+I+
Sbjct: 310 NTVATSAMIY 319
>sp|Q9R013|CATF_MOUSE Cathepsin F precursor
          Length = 462

 Score =  231 bits (589), Expect = 1e-60
 Identities = 102/190 (53%), Positives = 135/190 (71%)
 Frame = +3

Query: 3   WAFSTTGNIEGQWFIRTKRLVSLSEQQLVDCDTVDEGCNGGLPSQAYKVIQQMGGLETES 182
           WAFS TGN+EGQWF+    L+SLSEQ+L+DCD VD+ C GGLPS AY  I+ +GGLETE 
Sbjct: 274 WAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSNAYAAIKNLGGLETED 333

Query: 183 DYPYKADRKTCMLDKSKIAVYINGSESIDSSETTMAAWCSINGPISIGINAFAMQFYRGG 362
           DY Y+   +TC        VYIN S  +  +E  +AAW +  GPIS+ INAF MQFYR G
Sbjct: 334 DYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKGPISVAINAFGMQFYRHG 393

Query: 363 ISHPFKIFCNPDHLDHGVLIVGFNTTSSG*PFWIVKNSWGPGWGEDGYYRVFRGTGVCGL 542
           I+HPF+  C+P  +DH VL+VG+   S+  P+W +KNSWG  WGE+GYY ++RG+G CG+
Sbjct: 394 IAHPFRPLCSPWFIDHAVLLVGYGNRSN-IPYWAIKNSWGSDWGEEGYYYLYRGSGACGV 452

Query: 543 NKMPTSAIIH 572
           N M +SA+++
Sbjct: 453 NTMASSAVVN 462
>sp|Q9UBX1|CATF_HUMAN Cathepsin F precursor (CATSF)
          Length = 484

 Score =  226 bits (576), Expect = 4e-59
 Identities = 101/189 (53%), Positives = 130/189 (68%)
 Frame = +3

Query: 3   WAFSTTGNIEGQWFIRTKRLVSLSEQQLVDCDTVDEGCNGGLPSQAYKVIQQMGGLETES 182
           WAFS TGN+EGQWF+    L+SLSEQ+L+DCD +D+ C GGLPS AY  I+ +GGLETE 
Sbjct: 296 WAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETED 355

Query: 183 DYPYKADRKTCMLDKSKIAVYINGSESIDSSETTMAAWCSINGPISIGINAFAMQFYRGG 362
           DY Y+   ++C     K  VYIN S  +  +E  +AAW +  GPIS+ INAF MQFYR G
Sbjct: 356 DYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHG 415

Query: 363 ISHPFKIFCNPDHLDHGVLIVGFNTTSSG*PFWIVKNSWGPGWGEDGYYRVFRGTGVCGL 542
           IS P +  C+P  +DH VL+VG+   S   PFW +KNSWG  WGE GYY + RG+G CG+
Sbjct: 416 ISRPLRPLCSPWLIDHAVLLVGYGNRSDV-PFWAIKNSWGTDWGEKGYYYLHRGSGACGV 474

Query: 543 NKMPTSAII 569
           N M +SA++
Sbjct: 475 NTMASSAVV 483
>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 precursor
          Length = 614

 Score =  224 bits (572), Expect = 1e-58
 Identities = 104/195 (53%), Positives = 134/195 (68%), Gaps = 6/195 (3%)
 Frame = +3

Query: 3    WAFSTTGNIEGQWFIRTKRLVSLSEQQLVDCDTVDEGCNGGLPSQAYKVIQQMGGLETES 182
            WAFS TGNIEG + ++T  L   SEQ+L+DCDT D  CNGGL   AYK I+ +GGLE E+
Sbjct: 419  WAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEA 478

Query: 183  DYPYKADRKTCMLDKSKIAVYINGSESI-DSSETTMAAWCSINGPISIGINAFAMQFYRG 359
            +YPYKA +  C  +++   V + G   +   +ET M  W   NGPISIGINA AMQFYRG
Sbjct: 479  EYPYKAKKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQFYRG 538

Query: 360  GISHPFKIFCNPDHLDHGVLIVGFNTTS-----SG*PFWIVKNSWGPGWGEDGYYRVFRG 524
            G+SHP+K  C+  +LDHGVL+VG+  +         P+WIVKNSWGP WGE GYYRV+RG
Sbjct: 539  GVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRG 598

Query: 525  TGVCGLNKMPTSAII 569
               CG+++M TSA++
Sbjct: 599  DNTCGVSEMATSAVL 613
>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 precursor
          Length = 343

 Score =  182 bits (462), Expect = 6e-46
 Identities = 95/204 (46%), Positives = 124/204 (60%), Gaps = 15/204 (7%)
 Frame = +3

Query: 3   WAFSTTGNIEGQWFIRTKRLVSLSEQQLVDCD----------TVDEGCNGGLPSQAYKVI 152
           W+FSTTGN+EGQ FI   +LVSLSEQ LVDCD            DEGCNGGL   AY  I
Sbjct: 143 WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYI 202

Query: 153 QQMGGLETESDYPYKADRKT-CMLDKSKIAVYINGSESIDSSETTMAAWCSINGPISIGI 329
            + GG++TES YPY A+  T C  + + I   I+    I  +ET MA +    GP++I  
Sbjct: 203 IKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAA 262

Query: 330 NAFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTTSS----G*PFWIVKNSWGPGWGE 497
           +A   QFY GG+   F I CNP+ LDHG+LIVG++  ++      P+WIVKNSWG  WGE
Sbjct: 263 DAVEWQFYIGGV---FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGE 319

Query: 498 DGYYRVFRGTGVCGLNKMPTSAII 569
            GY  + RG   CG++   +++II
Sbjct: 320 QGYIYLRRGKNTCGVSNFVSTSII 343
>sp|P25804|CYSP_PEA Cysteine proteinase 15A precursor (Turgor-responsive protein 15A)
          Length = 363

 Score =  173 bits (438), Expect = 4e-43
 Identities = 88/198 (44%), Positives = 117/198 (59%), Gaps = 15/198 (7%)
 Frame = +3

Query: 3   WAFSTTGNIEGQWFIRTKRLVSLSEQQLVDCDTV---------DEGCNGGLPSQAYKVIQ 155
           WAFSTTG +EG  ++ T +LVSLSEQQLVDCD V         D GCNGGL + A++ + 
Sbjct: 157 WAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLL 216

Query: 156 QMGGLETESDYPYKADRKTCMLDKSKIAVYINGSESIDSSETTMAAWCSINGPISIGINA 335
           + GG+  E DY Y     +C  DKSK+   ++    +   E  +AA    NGP+++ INA
Sbjct: 217 ESGGVVQEKDYAYTGRDGSCKFDKSKVVASVSNFSVVTLDEDQIAANLVKNGPLAVAINA 276

Query: 336 FAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTTS------SG*PFWIVKNSWGPGWGE 497
             MQ Y  G+S P+   C    LDHGVL+VGF   +         P+WI+KNSWG  WGE
Sbjct: 277 AWMQTYMSGVSCPY--VCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIIKNSWGQNWGE 334

Query: 498 DGYYRVFRGTGVCGLNKM 551
            GYY++ RG  VCG++ M
Sbjct: 335 QGYYKICRGRNVCGVDSM 352
>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a precursor (RD19)
          Length = 368

 Score =  169 bits (429), Expect = 4e-42
 Identities = 90/199 (45%), Positives = 120/199 (60%), Gaps = 16/199 (8%)
 Frame = +3

Query: 3   WAFSTTGNIEGQWFIRTKRLVSLSEQQLVDCD---------TVDEGCNGGLPSQAYKVIQ 155
           W+FS TG +EG  F+ T +LVSLSEQQLVDCD         + D GCNGGL + A++   
Sbjct: 160 WSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTL 219

Query: 156 QMGGLETESDYPYKA-DRKTCMLDKSKIAVYINGSESIDSSETTMAAWCSINGPISIGIN 332
           + GGL  E DYPY   D KTC LDKSKI   ++    I   E  +AA    NGP+++ IN
Sbjct: 220 KTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAIN 279

Query: 333 AFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTTSSG*------PFWIVKNSWGPGWG 494
           A  MQ Y GG+S P+   C    L+HGVL+VG+             P+WI+KNSWG  WG
Sbjct: 280 AGYMQTYIGGVSCPY--ICTR-RLNHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWG 336

Query: 495 EDGYYRVFRGTGVCGLNKM 551
           E+G+Y++ +G  +CG++ M
Sbjct: 337 ENGFYKICKGRNICGVDSM 355
>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 precursor
          Length = 371

 Score =  169 bits (427), Expect = 7e-42
 Identities = 89/209 (42%), Positives = 123/209 (58%), Gaps = 19/209 (9%)
 Frame = +3

Query: 3   WAFSTTGNIEGQWFIRTKRLVSLSEQQLVDCD---------TVDEGCNGGLPSQAYKVIQ 155
           W+FS +G +EG  ++ T +L  LSEQQ VDCD         + D GCNGGL + A+  +Q
Sbjct: 162 WSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQ 221

Query: 156 QMGGLETESDYPYKADRKTCMLDKSKIAVYINGSESIDSSETTMAAWCSINGPISIGINA 335
           + GGLE+E DYPY      C  DKSKI   +     +   E  ++A    +GP++IGINA
Sbjct: 222 KAGGLESEKDYPYTGSDGKCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINA 281

Query: 336 FAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTTS------SG*PFWIVKNSWGPGWGE 497
             MQ Y GG+S P+   C   HLDHGVL+VG+  +          P+WI+KNSWG  WGE
Sbjct: 282 AYMQTYIGGVSCPY--ICG-RHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGE 338

Query: 498 DGYYRVFRGTGV---CGLNKM-PTSAIIH 572
           +GYY++ RG+ V   CG++ M  T + +H
Sbjct: 339 NGYYKICRGSNVRNKCGVDSMVSTVSAVH 367
>sp|P43295|A494_ARATH Probable cysteine proteinase A494 precursor
          Length = 361

 Score =  167 bits (424), Expect = 2e-41
 Identities = 86/199 (43%), Positives = 123/199 (61%), Gaps = 16/199 (8%)
 Frame = +3

Query: 3   WAFSTTGNIEGQWFIRTKRLVSLSEQQLVDCD---------TVDEGCNGGLPSQAYKVIQ 155
           W+FSTTG +EG  F+ T +LVSLSEQQLVDCD         + D GCNGGL + A++   
Sbjct: 157 WSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTL 216

Query: 156 QMGGLETESDYPYKA-DRKTCMLDKSKIAVYINGSESIDSSETTMAAWCSINGPISIGIN 332
           + GGL  E DYPY   D  +C LD+SKI   ++    +  +E  +AA    NGP+++ IN
Sbjct: 217 KTGGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAIN 276

Query: 333 AFAMQFYRGGISHPFKIFCNPDHLDHGVLIVGFNTTSSG*------PFWIVKNSWGPGWG 494
           A  MQ Y GG+S P+   C+   L+HGVL+VG+ +           P+WI+KNSWG  WG
Sbjct: 277 AAYMQTYIGGVSCPY--ICSR-RLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWG 333

Query: 495 EDGYYRVFRGTGVCGLNKM 551
           E+G+Y++ +G  +CG++ +
Sbjct: 334 ENGFYKICKGRNICGVDSL 352
>sp|P14658|CYSP_TRYBB Cysteine proteinase precursor
          Length = 450

 Score =  167 bits (423), Expect = 2e-41
 Identities = 85/194 (43%), Positives = 118/194 (60%), Gaps = 5/194 (2%)
 Frame = +3

Query: 3   WAFSTTGNIEGQWFIRTKRLVSLSEQQLVDCDTVDEGCNGGLPSQAYK-VIQQMGG-LET 176
           WAFST GNIEGQW +    LVSLSEQ LV CDT+D GCNGGL   A+  ++   GG + T
Sbjct: 151 WAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSNGGNVFT 210

Query: 177 ESDYPY---KADRKTCMLDKSKIAVYINGSESIDSSETTMAAWCSINGPISIGINAFAMQ 347
           E+ YPY     ++  C ++  +I   I     +   E  +AA+ + NGP++I ++A +  
Sbjct: 211 EASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDAESFM 270

Query: 348 FYRGGISHPFKIFCNPDHLDHGVLIVGFNTTSSG*PFWIVKNSWGPGWGEDGYYRVFRGT 527
            Y GGI       C    LDHGVL+VG+N  S+  P+WI+KNSW   WGEDGY R+ +GT
Sbjct: 271 DYNGGI----LTSCTSKQLDHGVLLVGYNDNSNP-PYWIIKNSWSNMWGEDGYIRIEKGT 325

Query: 528 GVCGLNKMPTSAII 569
             C +N+  +SA++
Sbjct: 326 NQCLMNQAVSSAVV 339
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 74,104,626
Number of Sequences: 369166
Number of extensions: 1454530
Number of successful extensions: 3467
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3062
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3133
length of database: 68,354,980
effective HSP length: 106
effective length of database: 48,773,070
effective search space used: 5267491560
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)