Planaria EST Database


DrC_00583

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_00583
         (700 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P07688|CATB_BOVIN  Cathepsin B precursor [Contains: Cathe...   330   2e-90
sp|P07858|CATB_HUMAN  Cathepsin B precursor (Cathepsin B1) (...   325   9e-89
sp|P00787|CATB_RAT  Cathepsin B precursor (Cathepsin B1) (RS...   323   4e-88
sp|P10605|CATB_MOUSE  Cathepsin B precursor (Cathepsin B1) [...   318   9e-87
sp|P43233|CATB_CHICK  Cathepsin B precursor (Cathepsin B1) [...   306   3e-83
sp|P43157|CYSP_SCHJA  Cathepsin B-like cysteine proteinase p...   285   6e-77
sp|P25792|CYSP_SCHMA  Cathepsin B-like cysteine proteinase p...   276   3e-74
sp|P43510|CPR6_CAEEL  Cathepsin B-like cysteine proteinase 6...   258   8e-69
sp|P43509|CPR5_CAEEL  Cathepsin B-like cysteine proteinase 5...   254   2e-67
sp|P25807|CPR1_CAEEL  Gut-specific cysteine proteinase precu...   250   3e-66
>sp|P07688|CATB_BOVIN Cathepsin B precursor [Contains: Cathepsin B light chain; Cathepsin
           B heavy chain]
          Length = 335

 Score =  330 bits (846), Expect = 2e-90
 Identities = 144/213 (67%), Positives = 167/213 (78%)
 Frame = +1

Query: 1   TDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHL 180
           +DR CIHSNG     +SAED+LTCCG  CGDGCNGGFPSGAW++W   GLV+GG Y +H+
Sbjct: 118 SDRICIHSNGRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHV 177

Query: 181 GCQDYAFPKCSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQ 360
           GC+ Y+ P C HHV G  P CTGE  TPKC K C+ GYS +Y EDK +G SSYSV +N++
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNEK 237

Query: 361 AIMQEILTNGPVEAAFSVYADFPNYKSGVYQHVSGGMLGGHAIKILGWGVENNTPYWLVA 540
            IM EI  NGPVE AFSVY+DF  YKSGVYQHVSG ++GGHAI+ILGWGVEN TPYWLV 
Sbjct: 238 EIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENGTPYWLVG 297

Query: 541 NSWNPTWGDNGYFKILRGSDECGIEDEVVAGIP 639
           NSWN  WGDNG+FKILRG D CGIE E+VAG+P
Sbjct: 298 NSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>sp|P07858|CATB_HUMAN Cathepsin B precursor (Cathepsin B1) (APP secretase) (APPS)
           [Contains: Cathepsin B light chain; Cathepsin B heavy
           chain]
          Length = 339

 Score =  325 bits (832), Expect = 9e-89
 Identities = 142/214 (66%), Positives = 168/214 (78%)
 Frame = +1

Query: 1   TDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHL 180
           +DR CIH+N   +  +SAEDLLTCCG  CGDGCNGG+P+ AW++W   GLV+GG Y +H+
Sbjct: 118 SDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 177

Query: 181 GCQDYAFPKCSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQ 360
           GC+ Y+ P C HHV G  P CTGE  TPKC K C+ GYS TY +DK YG +SYSV ++++
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEK 237

Query: 361 AIMQEILTNGPVEAAFSVYADFPNYKSGVYQHVSGGMLGGHAIKILGWGVENNTPYWLVA 540
            IM EI  NGPVE AFSVY+DF  YKSGVYQHV+G M+GGHAI+ILGWGVEN TPYWLVA
Sbjct: 238 DIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMMGGHAIRILGWGVENGTPYWLVA 297

Query: 541 NSWNPTWGDNGYFKILRGSDECGIEDEVVAGIPK 642
           NSWN  WGDNG+FKILRG D CGIE EVVAGIP+
Sbjct: 298 NSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPR 331
>sp|P00787|CATB_RAT Cathepsin B precursor (Cathepsin B1) (RSG-2) [Contains: Cathepsin B
           light chain; Cathepsin B heavy chain]
          Length = 339

 Score =  323 bits (827), Expect = 4e-88
 Identities = 137/214 (64%), Positives = 166/214 (77%)
 Frame = +1

Query: 1   TDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHL 180
           +DR CIH+NG     +SAEDLLTCCG +CGDGCNGG+PSGAW++W   GLV+GG Y +H+
Sbjct: 118 SDRICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHI 177

Query: 181 GCQDYAFPKCSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQ 360
           GC  Y  P C HHV G  P CTGE  TPKC K C+AGYS +Y EDK YG +SYSV  +++
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEK 237

Query: 361 AIMQEILTNGPVEAAFSVYADFPNYKSGVYQHVSGGMLGGHAIKILGWGVENNTPYWLVA 540
            IM EI  NGPVE AF+V++DF  YKSGVY+H +G ++GGHAI+ILGWG+EN  PYWLVA
Sbjct: 238 EIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWGIENGVPYWLVA 297

Query: 541 NSWNPTWGDNGYFKILRGSDECGIEDEVVAGIPK 642
           NSWN  WGDNG+FKILRG + CGIE E+VAGIP+
Sbjct: 298 NSWNVDWGDNGFFKILRGENHCGIESEIVAGIPR 331
>sp|P10605|CATB_MOUSE Cathepsin B precursor (Cathepsin B1) [Contains: Cathepsin B light
           chain; Cathepsin B heavy chain]
          Length = 339

 Score =  318 bits (815), Expect = 9e-87
 Identities = 136/214 (63%), Positives = 165/214 (77%)
 Frame = +1

Query: 1   TDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHL 180
           +DR CIH+NG     +SAEDLLTCCG +CGDGCNGG+PSGAW +W   GLV+GG Y +H+
Sbjct: 118 SDRTCIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHV 177

Query: 181 GCQDYAFPKCSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQ 360
           GC  Y  P C HHV G  P CTGE  TP+C K+C+AGYS +Y EDK +G +SYSV ++ +
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPCTGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVK 237

Query: 361 AIMQEILTNGPVEAAFSVYADFPNYKSGVYQHVSGGMLGGHAIKILGWGVENNTPYWLVA 540
            IM EI  NGPVE AF+V++DF  YKSGVY+H +G M+GGHAI+ILGWGVEN  PYWL A
Sbjct: 238 EIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHAIRILGWGVENGVPYWLAA 297

Query: 541 NSWNPTWGDNGYFKILRGSDECGIEDEVVAGIPK 642
           NSWN  WGDNG+FKILRG + CGIE E+VAGIP+
Sbjct: 298 NSWNLDWGDNGFFKILRGENHCGIESEIVAGIPR 331
>sp|P43233|CATB_CHICK Cathepsin B precursor (Cathepsin B1) [Contains: Cathepsin B light
           chain; Cathepsin B heavy chain]
          Length = 340

 Score =  306 bits (784), Expect = 3e-83
 Identities = 134/216 (62%), Positives = 160/216 (74%), Gaps = 1/216 (0%)
 Frame = +1

Query: 1   TDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHL 180
           +DR C+H+N   +  +SAEDLL+CCGF CG GCNGG+PSGAW YW   GLV+GG Y +H+
Sbjct: 118 SDRICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHV 177

Query: 181 GCQDYAFPKCSHHVIGPYPNCTGEF-PTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQ 357
           GC+ Y  P C HHV G  P CTGE   TP+C + C+ GYS +Y EDK YG +SY V  ++
Sbjct: 178 GCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSE 237

Query: 358 QAIMQEILTNGPVEAAFSVYADFPNYKSGVYQHVSGGMLGGHAIKILGWGVENNTPYWLV 537
           + IM EI  NGPVE AF VY DF  YKSGVYQHVSG  +GGHAI+ILGWGVEN TPYWL 
Sbjct: 238 KEIMAEIYKNGPVEGAFIVYEDFLMYKSGVYQHVSGEQVGGHAIRILGWGVENGTPYWLA 297

Query: 538 ANSWNPTWGDNGYFKILRGSDECGIEDEVVAGIPKL 645
           ANSWN  WG  G+FKILRG D CGIE E+VAG+P++
Sbjct: 298 ANSWNTDWGITGFFKILRGEDHCGIESEIVAGVPRM 333
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase precursor (Antigen Sj31)
          Length = 342

 Score =  285 bits (730), Expect = 6e-77
 Identities = 125/215 (58%), Positives = 155/215 (72%), Gaps = 1/215 (0%)
 Frame = +1

Query: 1   TDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHL 180
           TDR CI S G Q+  +SA DL++CC   CGDGC GGFP  AW YWV  G+VTGG    H 
Sbjct: 128 TDRICIQSGGGQSAELSALDLISCCK-DCGDGCQGGFPGVAWDYWVKRGIVTGGSKENHT 186

Query: 181 GCQDYAFPKCSHHVIGPYPNC-TGEFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQ 357
           GCQ Y FPKC HH  G YP C T  + TP+CK+ CQ GY   Y +DK YG  SY+V +N+
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNE 246

Query: 358 QAIMQEILTNGPVEAAFSVYADFPNYKSGVYQHVSGGMLGGHAIKILGWGVENNTPYWLV 537
           + I ++I+  GPVEAAF VY DF NYKSG+Y+HV+G ++GGHAI+I+GWGVE  TPYWL+
Sbjct: 247 KVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTPYWLI 306

Query: 538 ANSWNPTWGDNGYFKILRGSDECGIEDEVVAGIPK 642
           ANSWN  WG+ G F+++RG DEC IE +VVAG+ K
Sbjct: 307 ANSWNEDWGEKGLFRMVRGRDECSIESDVVAGLIK 341
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase precursor (Antigen Sm31)
          Length = 340

 Score =  276 bits (707), Expect = 3e-74
 Identities = 122/212 (57%), Positives = 152/212 (71%), Gaps = 1/212 (0%)
 Frame = +1

Query: 1   TDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHL 180
           +DR CI S G Q   +SA DLLTCC   CG GC GG    AW YWV +G+VT      H 
Sbjct: 127 SDRSCIQSGGKQNVELSAVDLLTCCE-SCGLGCEGGILGPAWDYWVKEGIVTASSKENHT 185

Query: 181 GCQDYAFPKCSHHVIGPYPNCTGE-FPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQ 357
           GC+ Y FPKC HH  G YP C  + + TP+CK+ CQ  Y   Y +DK  GKSSY+V +++
Sbjct: 186 GCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDE 245

Query: 358 QAIMQEILTNGPVEAAFSVYADFPNYKSGVYQHVSGGMLGGHAIKILGWGVENNTPYWLV 537
           +AI +EI+  GPVEA+F+VY DF NYKSG+Y+H++G  LGGHAI+I+GWGVEN TPYWL+
Sbjct: 246 KAIQKEIMKYGPVEASFTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTPYWLI 305

Query: 538 ANSWNPTWGDNGYFKILRGSDECGIEDEVVAG 633
           ANSWN  WG+NGYF+I+RG DEC IE EV+AG
Sbjct: 306 ANSWNEDWGENGYFRIVRGRDECSIESEVIAG 337
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 precursor (Cysteine
           protease-related 6)
          Length = 379

 Score =  258 bits (660), Expect = 8e-69
 Identities = 119/220 (54%), Positives = 153/220 (69%), Gaps = 5/220 (2%)
 Frame = +1

Query: 1   TDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHL 180
           +DR CI S+G     +SA+DLL+CC   CG GCNGG P  AW YWV DG+VTG  Y A+ 
Sbjct: 143 SDRICIASHGELQVTLSADDLLSCCK-SCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANN 201

Query: 181 GCQDYAFPKCSHHV----IGPYPNCTGEFPTPKCKKACQAGYS-KTYAEDKQYGKSSYSV 345
           GC+ Y FP C HH       P P+    +PTPKC+K C + Y+ KTY+EDK +G S+Y V
Sbjct: 202 GCKPYPFPPCEHHSKKTHFDPCPHDL--YPTPKCEKKCVSDYTDKTYSEDKFFGASAYGV 259

Query: 346 DSNQQAIMQEILTNGPVEAAFSVYADFPNYKSGVYQHVSGGMLGGHAIKILGWGVENNTP 525
             + +AI +E++T+GP+E AF VY DF NY  GVY H  G + GGHA+K++GWG+++  P
Sbjct: 260 KDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIP 319

Query: 526 YWLVANSWNPTWGDNGYFKILRGSDECGIEDEVVAGIPKL 645
           YW VANSWN  WG++G+F+ILRG DECGIE  VV GIPKL
Sbjct: 320 YWTVANSWNTDWGEDGFFRILRGVDECGIESGVVGGIPKL 359
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 precursor (Cysteine
           protease-related 5)
          Length = 344

 Score =  254 bits (649), Expect = 2e-67
 Identities = 118/221 (53%), Positives = 148/221 (66%), Gaps = 6/221 (2%)
 Frame = +1

Query: 1   TDRYCIHSNGTQTPRISAEDLLTCCG--FRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGA 174
           +DR CI SNG     +S+EDLL+CC   F CG+GC GG+P  AW +WV  GLVTGG Y  
Sbjct: 120 SDRTCIASNGAVNTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHGLVTGGSYET 179

Query: 175 HLGCQDYAFPKCSHHVIG-PYPNCTGEF-PTPKCKKACQA--GYSKTYAEDKQYGKSSYS 342
             GC+ Y+   C   V G  +P C  +  PTPKC  +C +   Y+  Y +DK +G ++Y+
Sbjct: 180 QFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFGSTAYA 239

Query: 343 VDSNQQAIMQEILTNGPVEAAFSVYADFPNYKSGVYQHVSGGMLGGHAIKILGWGVENNT 522
           V    + I  EILTNGP+E AF+VY DF  Y +GVY H +G  LGGHA+KILGWGV+N T
Sbjct: 240 VGKKVEQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGT 299

Query: 523 PYWLVANSWNPTWGDNGYFKILRGSDECGIEDEVVAGIPKL 645
           PYWLVANSWN  WG+ GYF+I+RG +ECGIE   VAGIP L
Sbjct: 300 PYWLVANSWNVAWGEKGYFRIIRGLNECGIEHSAVAGIPDL 340
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase precursor
          Length = 329

 Score =  250 bits (638), Expect = 3e-66
 Identities = 117/215 (54%), Positives = 146/215 (67%)
 Frame = +1

Query: 1   TDRYCIHSNGTQTPRISAEDLLTCCGFRCGDGCNGGFPSGAWHYWVTDGLVTGGEYGAHL 180
           +DR CI + G Q P IS +DLL+CCG  CG+GC GG+P  A  +W + G+VTGG+Y    
Sbjct: 123 SDRTCIETKGAQQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHG-A 181

Query: 181 GCQDYAFPKCSHHVIGPYPNCTGEFPTPKCKKACQAGYSKTYAEDKQYGKSSYSVDSNQQ 360
           GC+ Y    C+        NC  E  TP C  +CQ+GYS  YA+DK +G S+Y+V  N  
Sbjct: 182 GCKPYPIAPCTSG------NCP-ESKTPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAA 234

Query: 361 AIMQEILTNGPVEAAFSVYADFPNYKSGVYQHVSGGMLGGHAIKILGWGVENNTPYWLVA 540
           +I  EI  NGPVEAAFSVY DF  YKSGVY+H +G  LGGHAIKI+GWG E+ +PYWLVA
Sbjct: 235 SIQAEIYANGPVEAAFSVYEDFYKYKSGVYKHTAGKYLGGHAIKIIGWGTESGSPYWLVA 294

Query: 541 NSWNPTWGDNGYFKILRGSDECGIEDEVVAGIPKL 645
           NSW   WG++G+FKI RG D+CGIE  VVAG  K+
Sbjct: 295 NSWGVNWGESGFFKIYRGDDQCGIESAVVAGKAKV 329
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 87,750,369
Number of Sequences: 369166
Number of extensions: 2084444
Number of successful extensions: 6530
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 5986
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6406
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 6073541875
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)

Cluster detail

DrC_00583

  1. Dr_sW_001_F03
  2. Dr_sW_015_E06
  3. Dr_sW_005_O18
  4. Dr_sW_014_P19
  5. Dr_sW_013_A14