Planarian EST Database


Dr_sW_001_L16

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_001_L16
         (884 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P97821|CATC_MOUSE  Dipeptidyl-peptidase I precursor (DPP-...   273   4e-73
sp|P53634|CATC_HUMAN  Dipeptidyl-peptidase I precursor (DPP-...   272   8e-73
sp|P80067|CATC_RAT  Dipeptidyl-peptidase I precursor (DPP-I)...   268   2e-71
sp|Q60HG6|CATC_MACFA  Dipeptidyl-peptidase I precursor (DPP-...   264   2e-70
sp|O97578|CATC_CANFA  Dipeptidyl-peptidase I precursor (DPP-...   261   1e-69
sp|Q26563|CATC_SCHMA  Cathepsin C precursor                       231   3e-60
sp|Q90686|CATK_CHICK  Cathepsin K precursor (JTAP-1)               70   8e-12
sp|Q91BH1|CATV_NPVST  Viral cathepsin (V-cath) (Cysteine pro...    69   2e-11
sp|P41715|CATV_NPVCF  Viral cathepsin (V-cath) (Cysteine pro...    66   1e-10
sp|P55097|CATK_MOUSE  Cathepsin K precursor                        66   1e-10
>sp|P97821|CATC_MOUSE Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 462

 Score =  273 bits (699), Expect = 4e-73
 Identities = 143/300 (47%), Positives = 186/300 (62%), Gaps = 12/300 (4%)
 Frame = +3

Query: 21  SDTPANCTYQDVIGKWQVFTG----NFNVSCSTSKLVATKTLTLIYP-NWAVDEFGNYGK 185
           SDTPANCTY D++G W    G      +++CS  +    K +  +   + A DE GN G 
Sbjct: 24  SDTPANCTYPDLLGTWVFQVGPRSSRSDINCSVMEATEEKVVVHLKKLDTAYDELGNSGH 83

Query: 186 WTLIYNQGFEVTITNKKYFGFFDYKKINSTYTISYCDRLQPSWFHDVLIRQWQCFKAQRT 365
           +TLIYNQGFE+ + + K+F FF Y+    T  ISYC      W HDVL R W CF  ++ 
Sbjct: 84  FTLIYNQGFEIVLNDYKWFAFFKYEVRGHT-AISYCHETMTGWVHDVLGRNWACFVGKKV 142

Query: 366 TTLKEKNNVLPHSNIFALTSLR-------YGSQKRIVDKINLENNGWTAKDYPEFHEKTL 524
            +  EK N+    N   L  L+       Y      V  IN     WTA  Y E+ + +L
Sbjct: 143 ESHIEKVNM----NAAHLGGLQERYSERLYTHNHNFVKAINTVQKSWTATAYKEYEKMSL 198

Query: 525 YEVINMAGGSRSKLERPKPAPITKSILDSVKLIPKSFDWRNVNGLNYVSPVRNQGGCGSC 704
            ++I  +G S+ ++ RPKPAP+T  I   +  +P+S+DWRNV G+NYVSPVRNQ  CGSC
Sbjct: 199 RDLIRRSGHSQ-RIPRPKPAPMTDEIQQQILNLPESWDWRNVQGVNYVSPVRNQESCGSC 257

Query: 705 YSFASAGMLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQ 884
           YSFAS GMLEAR RI +NN+  PILSPQ+VV CSPY+QGCDGGFPYLIAGK+A+DFG+ +
Sbjct: 258 YSFASMGMLEARIRILTNNSQTPILSPQEVVSCSPYAQGCDGGFPYLIAGKYAQDFGVVE 317
>sp|P53634|CATC_HUMAN Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 463

 Score =  272 bits (696), Expect = 8e-73
 Identities = 140/301 (46%), Positives = 183/301 (60%), Gaps = 14/301 (4%)
 Frame = +3

Query: 24  DTPANCTYQDVIGKWQVFTGNF----NVSCSTSKLVATKTLTLIYP-NWAVDEFGNYGKW 188
           DTPANCTY D++G W    G+     +V+CS       K +  +   + A D+ GN G +
Sbjct: 25  DTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQEKKVVVYLQKLDTAYDDLGNSGHF 84

Query: 189 TLIYNQGFEVTITNKKYFGFFDYKKINSTYTISYCDRLQPSWFHDVLIRQWQCFKAQRTT 368
           T+IYNQGFE+ + + K+F FF YK+  S  T +YC+     W HDVL R W CF  ++  
Sbjct: 85  TIIYNQGFEIVLNDYKWFAFFKYKEEGSKVT-TYCNETMTGWVHDVLGRNWACFTGKKVG 143

Query: 369 TLKE---------KNNVLPHSNIFALTSLRYGSQKRIVDKINLENNGWTAKDYPEFHEKT 521
           T  E         KN+   +SN        Y      V  IN     WTA  Y E+   T
Sbjct: 144 TASENVYVNTAHLKNSQEKYSNRL------YKYDHNFVKAINAIQKSWTATTYMEYETLT 197

Query: 522 LYEVINMAGGSRSKLERPKPAPITKSILDSVKLIPKSFDWRNVNGLNYVSPVRNQGGCGS 701
           L ++I  +GG   K+ RPKPAP+T  I   +  +P S+DWRNV+G+N+VSPVRNQ  CGS
Sbjct: 198 LGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGS 257

Query: 702 CYSFASAGMLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMA 881
           CYSFAS GMLEAR RI +NN+  PILSPQ+VV CS Y+QGC+GGFPYLIAGK+A+DFG+ 
Sbjct: 258 CYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLV 317

Query: 882 Q 884
           +
Sbjct: 318 E 318
>sp|P80067|CATC_RAT Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 462

 Score =  268 bits (684), Expect = 2e-71
 Identities = 139/300 (46%), Positives = 183/300 (61%), Gaps = 12/300 (4%)
 Frame = +3

Query: 21  SDTPANCTYQDVIGKWQVFTG----NFNVSCSTSKLVATKTLTLIYP-NWAVDEFGNYGK 185
           SDTPANCTY D++G W    G      +++CS  +    K +  +   + A DE GN G 
Sbjct: 24  SDTPANCTYPDLLGTWVFQVGPRHPRSHINCSVMEPTEEKVVIHLKKLDTAYDEVGNSGY 83

Query: 186 WTLIYNQGFEVTITNKKYFGFFDYKKINSTYTISYCDRLQPSWFHDVLIRQWQCFKAQRT 365
           +TLIYNQGFE+ + + K+F FF Y+ +  +  ISYC      W HD L R W CF  ++ 
Sbjct: 84  FTLIYNQGFEIVLNDYKWFAFFKYE-VKGSRAISYCHETMTGWVHDYLGRNWACFVGKKM 142

Query: 366 TTLKEKNNVLPHSNIFALTSLR-------YGSQKRIVDKINLENNGWTAKDYPEFHEKTL 524
               EK  V    N+  L  L+       Y      V  IN     WTA  Y  + + ++
Sbjct: 143 ANHSEKVYV----NVAHLGGLQEKYSERLYSHHHNFVKAINSVQKSWTATTYRRYEKLSI 198

Query: 525 YEVINMAGGSRSKLERPKPAPITKSILDSVKLIPKSFDWRNVNGLNYVSPVRNQGGCGSC 704
            ++I  +G S  ++ RPKPAPIT  I   +  +P+S+DWRNV G+N+VSPVRNQ  CGSC
Sbjct: 199 RDLIRRSGHS-GRILRPKPAPITDEIQQQILSLPESWDWRNVRGINFVSPVRNQESCGSC 257

Query: 705 YSFASAGMLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQ 884
           YSFAS GMLEAR RI +NN+  PILSPQ+VV CSPY+QGCDGGFPYLIAGK+A+DFG+ +
Sbjct: 258 YSFASIGMLEARIRILTNNSQTPILSPQEVVSCSPYAQGCDGGFPYLIAGKYAQDFGVVE 317
>sp|Q60HG6|CATC_MACFA Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain; Dipeptidyl-peptidase
           I light chain]
          Length = 463

 Score =  264 bits (675), Expect = 2e-70
 Identities = 137/301 (45%), Positives = 179/301 (59%), Gaps = 14/301 (4%)
 Frame = +3

Query: 24  DTPANCTYQDVIGKWQVFTGNF----NVSCSTSKLVATKTLTLIYP-NWAVDEFGNYGKW 188
           DTPANCTY D++G W    G+     +V+CS       K +  +   + A D+ GN G +
Sbjct: 25  DTPANCTYLDLLGTWVFQVGSSGSLRDVNCSVMGPPEKKVVVHLQKLDTAYDDLGNSGHF 84

Query: 189 TLIYNQGFEVTITNKKYFGFFDYKKINSTYTISYCDRLQPSWFHDVLIRQWQCFKAQRTT 368
           T+IYNQGFE+ + + K+F FF YK+     TI YC+     W HDVL R W CF  ++  
Sbjct: 85  TIIYNQGFEIVLNDYKWFAFFKYKEEGIKVTI-YCNETMTGWVHDVLGRNWACFTGKKVG 143

Query: 369 TLKE---------KNNVLPHSNIFALTSLRYGSQKRIVDKINLENNGWTAKDYPEFHEKT 521
           T  E         KN+   +SN        Y      V  IN     WTA  Y E+   T
Sbjct: 144 TASENVYVNTAHLKNSQEKYSNRL------YKYDHNFVKAINAIQKSWTATTYMEYETLT 197

Query: 522 LYEVINMAGGSRSKLERPKPAPITKSILDSVKLIPKSFDWRNVNGLNYVSPVRNQGGCGS 701
           L ++I  +GG   K+ RPKP P+T  I   +  +P S+DWRNV+G+N+VSPVRNQ  CGS
Sbjct: 198 LGDMIKRSGGHSRKIPRPKPTPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGS 257

Query: 702 CYSFASAGMLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMA 881
           CYSFAS GMLEAR RI +NN+  PILS Q+VV CS Y+QGC+GGFPYL AGK+A+DFG+ 
Sbjct: 258 CYSFASVGMLEARIRILTNNSQTPILSSQEVVSCSQYAQGCEGGFPYLTAGKYAQDFGLV 317

Query: 882 Q 884
           +
Sbjct: 318 E 318
>sp|O97578|CATC_CANFA Dipeptidyl-peptidase I precursor (DPP-I) (DPPI) (Cathepsin C)
           (Cathepsin J) (Dipeptidyl transferase) [Contains:
           Dipeptidyl-peptidase I exclusion domain chain;
           Dipeptidyl-peptidase I heavy chain 1;
           Dipeptidyl-peptidase I heavy chain 2;
           Dipeptidyl-peptidase I heavy chain 3;
           Dipeptidyl-peptidase I heavy chain 4;
           Dipeptidyl-peptidase I light chain]
          Length = 435

 Score =  261 bits (668), Expect = 1e-69
 Identities = 138/299 (46%), Positives = 179/299 (59%), Gaps = 12/299 (4%)
 Frame = +3

Query: 24  DTPANCTYQDVIGKWQVF----TGNFNVSCSTSKLVATKTLTLIYP-NWAVDEFGNYGKW 188
           DTPANCT+ +++G W VF     G+ +V+CS       K +  +   + A D FGN G +
Sbjct: 1   DTPANCTHPELLGTW-VFQVGPAGSRSVNCSVMGPPEKKVVVHLEKLDTAYDNFGNTGHF 59

Query: 189 TLIYNQGFEVTITNKKYFGFFDYKKINSTYTISYCDRLQPSWFHDVLIRQWQCFKAQRTT 368
           T+IYNQGFE+ + + K+F FF YK+     T SYC+     W HDVL R W CF   +  
Sbjct: 60  TIIYNQGFEIVLNDYKWFAFFKYKEEGHKVT-SYCNETMTGWVHDVLGRNWACFTGTKMG 118

Query: 369 TLKEKNNV-------LPHSNIFALTSLRYGSQKRIVDKINLENNGWTAKDYPEFHEKTLY 527
           T  EK  V       L  +N   L    Y      V  IN     WTA  Y E+   TL 
Sbjct: 119 TTSEKAKVNTKHIERLQENNSNRLYKYNY----EFVKAINTIQKSWTATRYIEYETLTLR 174

Query: 528 EVINMAGGSRSKLERPKPAPITKSILDSVKLIPKSFDWRNVNGLNYVSPVRNQGGCGSCY 707
           +++   GG   K+ RPKP P+T  I + +  +P S+DWRNV G N+VSPVRNQ  CGSCY
Sbjct: 175 DMMTRVGGR--KIPRPKPTPLTAEIHEEISRLPTSWDWRNVRGTNFVSPVRNQASCGSCY 232

Query: 708 SFASAGMLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMAQ 884
           +FAS  MLEAR RI +NNT  PILSPQ++V CS Y+QGC+GGFPYLIAGK+A+DFG+ +
Sbjct: 233 AFASTAMLEARIRILTNNTQTPILSPQEIVSCSQYAQGCEGGFPYLIAGKYAQDFGLVE 291
>sp|Q26563|CATC_SCHMA Cathepsin C precursor
          Length = 454

 Score =  231 bits (588), Expect = 3e-60
 Identities = 129/301 (42%), Positives = 177/301 (58%), Gaps = 8/301 (2%)
 Frame = +3

Query: 6   VQLSVSDTPANCTYQDVIGKWQVFTGNFNVSCSTSKLVATKT--LTLIYPNWAVDEFGNY 179
           ++ + +DTPANCTY+D  G+W+   G++   C   KL + ++  ++L+YP+ A+DEFGN 
Sbjct: 15  LRFTCADTPANCTYEDAHGRWKFHIGDYQSKCP-EKLNSKQSVVISLLYPDIAIDEFGNR 73

Query: 180 GKWTLIYNQGFEVTITNKKYFGFFDYKKINSTYTISYCDRLQPSWFHDVLIRQWQCFKAQ 359
           G WTLIYNQGFEVTI ++K+   F YK  N  +    C +  P W HD LI        +
Sbjct: 74  GHWTLIYNQGFEVTINHRKWLVIFAYKS-NGEFN---CHKSMPMWTHDTLIDSGSVCSGK 129

Query: 360 RTTTLKEKNNVLPHSNIFALTSLRYGSQKRIVDKINLENNGWTAKDYPEFHEKTLYEVIN 539
                K   N L  S  F  T   Y      V KIN     W  + YPE  + T+ E+ N
Sbjct: 130 IGVHDKFHINKLFGSKSFGRTL--YHINPSFVGKINAHQKSWRGEIYPELSKYTIDELRN 187

Query: 540 MAGGSRSKLERP----KPAPITKSILDSVKLIPKSFDWRNV--NGLNYVSPVRNQGGCGS 701
            AGG +S + RP    +  P +K ++     +P  FDW +      + V+P+RNQG CGS
Sbjct: 188 RAGGVKSMVTRPSVLNRKTP-SKELISLTGNLPLEFDWTSPPDGSRSPVTPIRNQGICGS 246

Query: 702 CYSFASAGMLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGFPYLIAGKFAEDFGMA 881
           CY+  SA  LEAR R+ SN + +PILSPQ VV+CSPYS+GC+GGFP+LIAGK+ EDFG+ 
Sbjct: 247 CYASPSAAALEARIRLVSNFSEQPILSPQTVVDCSPYSEGCNGGFPFLIAGKYGEDFGLP 306

Query: 882 Q 884
           Q
Sbjct: 307 Q 307
>sp|Q90686|CATK_CHICK Cathepsin K precursor (JTAP-1)
          Length = 334

 Score = 70.1 bits (170), Expect = 8e-12
 Identities = 40/108 (37%), Positives = 59/108 (54%)
 Frame = +3

Query: 513 EKTLYEVINMAGGSRSKLERPKPAPITKSILDSVKLIPKSFDWRNVNGLNYVSPVRNQGG 692
           + T  EV+    G R    RP+P   T  + D     P + DWR      YV+PV++QG 
Sbjct: 85  DMTSEEVVRTMTGLRVPRSRPRPNG-TLYVPDWSSRAPAAVDWRRKG---YVTPVKDQGQ 140

Query: 693 CGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVECSPYSQGCDGGF 836
           CGSC++F+S G LE + + R+   +   LSPQ++V C   + GC GG+
Sbjct: 141 CGSCWAFSSVGALEGQLKRRTGKLLS--LSPQNLVYCVSNNNGCGGGY 186
>sp|Q91BH1|CATV_NPVST Viral cathepsin (V-cath) (Cysteine proteinase) (CP)
          Length = 337

 Score = 68.9 bits (167), Expect = 2e-11
 Identities = 33/74 (44%), Positives = 49/74 (66%)
 Frame = +3

Query: 624 PKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVEC 803
           P+SFDWR    LN V+ V+ QG CGSC++FA+ G +E++Y I  ++ +   LS Q +++C
Sbjct: 127 PESFDWRK---LNKVTKVKEQGVCGSCWAFAAIGNIESQYAIMHDSLID--LSEQQLLDC 181

Query: 804 SPYSQGCDGGFPYL 845
               QGCDGG  +L
Sbjct: 182 DRVDQGCDGGLMHL 195
>sp|P41715|CATV_NPVCF Viral cathepsin (V-cath) (Cysteine proteinase) (CP)
 sp|O41479|CATV_NPVCD Viral cathepsin (V-cath) (Cysteine proteinase) (CP)
          Length = 324

 Score = 66.2 bits (160), Expect = 1e-10
 Identities = 31/70 (44%), Positives = 45/70 (64%)
 Frame = +3

Query: 624 PKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVEC 803
           P  FDWR    LN V+ V+NQG CG+C++FA+ G LE+++ I+ N  +   LS Q +++C
Sbjct: 114 PLEFDWRR---LNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNQFIN--LSEQQLIDC 168

Query: 804 SPYSQGCDGG 833
                GCDGG
Sbjct: 169 DFVDAGCDGG 178
>sp|P55097|CATK_MOUSE Cathepsin K precursor
          Length = 329

 Score = 65.9 bits (159), Expect = 1e-10
 Identities = 31/72 (43%), Positives = 48/72 (66%)
 Frame = +3

Query: 621 IPKSFDWRNVNGLNYVSPVRNQGGCGSCYSFASAGMLEARYRIRSNNTVRPILSPQDVVE 800
           +P S D+R      YV+PV+NQG CGSC++F+SAG LE + + ++   +   LSPQ++V+
Sbjct: 115 VPDSIDYRKKG---YVTPVKNQGQCGSCWAFSSAGALEGQLKKKTGKLL--ALSPQNLVD 169

Query: 801 CSPYSQGCDGGF 836
           C   + GC GG+
Sbjct: 170 CVTENYGCGGGY 181
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 108,667,983
Number of Sequences: 369166
Number of extensions: 2277402
Number of successful extensions: 5849
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 5423
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5688
length of database: 68,354,980
effective HSP length: 110
effective length of database: 48,034,130
effective search space used: 8838279920
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)