Planarian EST Database


Dr_sW_009_B08

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_009_B08
         (758 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P40427|EXD_DROME  Homeobox protein extradenticle               204   3e-52
sp|P40426|PBX3_HUMAN  Pre-B-cell leukemia transcription fact...   200   4e-51
sp|O35317|PBX3_MOUSE  Pre-B-cell leukemia transcription fact...   200   4e-51
sp|P40425|PBX2_HUMAN  Pre-B-cell leukemia transcription fact...   197   2e-50
sp|O35984|PBX2_MOUSE  Pre-B-cell leukemia transcription fact...   197   2e-50
sp|P41778|PBX1_MOUSE  Pre-B-cell leukemia transcription fact...   194   2e-49
sp|Q99NE9|PBX4_MOUSE  Pre-B-cell leukemia transcription fact...   176   6e-44
sp|Q9BYU1|PBX4_HUMAN  Pre-B-cell leukemia transcription fact...   161   2e-39
sp|P41779|HM20_CAEEL  Homeobox protein ceh-20                     134   2e-31
sp|Q19503|HM40_CAEEL  Homeobox protein ceh-40                      50   5e-06
>sp|P40427|EXD_DROME Homeobox protein extradenticle
          Length = 376

 Score =  204 bits (518), Expect = 3e-52
 Identities = 111/180 (61%), Positives = 129/180 (71%), Gaps = 8/180 (4%)
 Frame = +2

Query: 239 GRELQNILSVAHQSLDEAQERKHSLNNHRLKPALYSVFCEIKEKTSLSLRNTAQINSEDD 418
           G  LQ I+S++ QSLDEAQ RKH+LN HR+KPAL+SV CEIKEKT LS+RNT       +
Sbjct: 44  GEILQQIMSISEQSLDEAQARKHTLNCHRMKPALFSVLCEIKEKTVLSIRNT------QE 97

Query: 419 SNSPDPQLLRLDKMLIAEGVTGNNSSNIGDIDSE-----YGGNQS---ESNQIEHADYRA 574
              PDPQL+RLD MLIAEGV G      G   +       GG+ S     N IEH+DYRA
Sbjct: 98  EEPPDPQLMRLDNMLIAEGVAGPEKGGGGAAAASAAAASQGGSLSIDGADNAIEHSDYRA 157

Query: 575 KLAQIRQIYHSELEKYKNACNEFTGHVINLLREQSRSRPISPAEIELMVGIIKKKFRAIE 754
           KLAQIRQIYH ELEKY+ ACNEFT HV+NLLREQSR+RPI+P EIE MV II KKF +I+
Sbjct: 158 KLAQIRQIYHQELEKYEQACNEFTTHVMNLLREQSRTRPITPKEIERMVQIIHKKFSSIQ 217
>sp|P40426|PBX3_HUMAN Pre-B-cell leukemia transcription factor 3 (Homeobox protein PBX3)
          Length = 434

 Score =  200 bits (508), Expect = 4e-51
 Identities = 107/183 (58%), Positives = 131/183 (71%), Gaps = 4/183 (2%)
 Frame = +2

Query: 218 DQKIQCTGRELQNILSVAHQSLDEAQERKHSLNNHRLKPALYSVFCEIKEKTSLSLRNTA 397
           D + Q  G  L  I+++  QSLDEAQ +KH+LN HR+KPAL+SV CEIKEKT LS+R   
Sbjct: 40  DGRKQDIGDILHQIMTITDQSLDEAQAKKHALNCHRMKPALFSVLCEIKEKTGLSIRGA- 98

Query: 398 QINSEDDSNSPDPQLLRLDKMLIAEGVTG----NNSSNIGDIDSEYGGNQSESNQIEHAD 565
                 + + PDPQL+RLD ML+AEGV+G      S+      +  GG  S  N IEH+D
Sbjct: 99  -----QEEDPPDPQLMRLDNMLLAEGVSGPEKGGGSAAAAAAAAASGG--SSDNSIEHSD 151

Query: 566 YRAKLAQIRQIYHSELEKYKNACNEFTGHVINLLREQSRSRPISPAEIELMVGIIKKKFR 745
           YRAKL QIRQIYH+ELEKY+ ACNEFT HV+NLLREQSR+RPISP EIE MVGII +KF 
Sbjct: 152 YRAKLTQIRQIYHTELEKYEQACNEFTTHVMNLLREQSRTRPISPKEIERMVGIIHRKFS 211

Query: 746 AIE 754
           +I+
Sbjct: 212 SIQ 214
>sp|O35317|PBX3_MOUSE Pre-B-cell leukemia transcription factor 3 (Homeobox protein PBX3)
          Length = 434

 Score =  200 bits (508), Expect = 4e-51
 Identities = 107/183 (58%), Positives = 131/183 (71%), Gaps = 4/183 (2%)
 Frame = +2

Query: 218 DQKIQCTGRELQNILSVAHQSLDEAQERKHSLNNHRLKPALYSVFCEIKEKTSLSLRNTA 397
           D + Q  G  L  I+++  QSLDEAQ +KH+LN HR+KPAL+SV CEIKEKT LS+R   
Sbjct: 40  DGRKQDIGDILHQIMTITDQSLDEAQAKKHALNCHRMKPALFSVLCEIKEKTGLSIRGA- 98

Query: 398 QINSEDDSNSPDPQLLRLDKMLIAEGVTG----NNSSNIGDIDSEYGGNQSESNQIEHAD 565
                 + + PDPQL+RLD ML+AEGV+G      S+      +  GG  S  N IEH+D
Sbjct: 99  -----QEEDPPDPQLMRLDNMLLAEGVSGPEKGGGSAAAAAAAAASGG--SSDNSIEHSD 151

Query: 566 YRAKLAQIRQIYHSELEKYKNACNEFTGHVINLLREQSRSRPISPAEIELMVGIIKKKFR 745
           YRAKL QIRQIYH+ELEKY+ ACNEFT HV+NLLREQSR+RPISP EIE MVGII +KF 
Sbjct: 152 YRAKLTQIRQIYHTELEKYEQACNEFTTHVMNLLREQSRTRPISPKEIERMVGIIHRKFS 211

Query: 746 AIE 754
           +I+
Sbjct: 212 SIQ 214
>sp|P40425|PBX2_HUMAN Pre-B-cell leukemia transcription factor 2 (Homeobox protein PBX2)
           (G17 protein)
          Length = 430

 Score =  197 bits (502), Expect = 2e-50
 Identities = 104/179 (58%), Positives = 127/179 (70%), Gaps = 4/179 (2%)
 Frame = +2

Query: 230 QCTGRELQNILSVAHQSLDEAQERKHSLNNHRLKPALYSVFCEIKEKTSLSLRNTAQINS 409
           Q  G  LQ I+++  QSLDEAQ +KH+LN HR+KPAL+SV CEIKEKT LS+R      S
Sbjct: 51  QDIGDILQQIMTITDQSLDEAQAKKHALNCHRMKPALFSVLCEIKEKTGLSIR------S 104

Query: 410 EDDSNSPDPQLLRLDKMLIAEGVTG----NNSSNIGDIDSEYGGNQSESNQIEHADYRAK 577
             +    DPQL+RLD ML+AEGV G      S+      +  GG  S  N IEH+DYR+K
Sbjct: 105 SQEEEPVDPQLMRLDNMLLAEGVAGPEKGGGSAAAAAAAAASGGGVSPDNSIEHSDYRSK 164

Query: 578 LAQIRQIYHSELEKYKNACNEFTGHVINLLREQSRSRPISPAEIELMVGIIKKKFRAIE 754
           LAQIR IYHSELEKY+ ACNEFT HV+NLLREQSR+RP++P E+E MV II +KF AI+
Sbjct: 165 LAQIRHIYHSELEKYEQACNEFTTHVMNLLREQSRTRPVAPKEMERMVSIIHRKFSAIQ 223
>sp|O35984|PBX2_MOUSE Pre-B-cell leukemia transcription factor 2 (Homeobox protein PBX2)
          Length = 430

 Score =  197 bits (502), Expect = 2e-50
 Identities = 104/179 (58%), Positives = 127/179 (70%), Gaps = 4/179 (2%)
 Frame = +2

Query: 230 QCTGRELQNILSVAHQSLDEAQERKHSLNNHRLKPALYSVFCEIKEKTSLSLRNTAQINS 409
           Q  G  LQ I+++  QSLDEAQ +KH+LN HR+KPAL+SV CEIKEKT LS+R      S
Sbjct: 51  QDIGDILQQIMTITDQSLDEAQAKKHALNCHRMKPALFSVLCEIKEKTGLSIR------S 104

Query: 410 EDDSNSPDPQLLRLDKMLIAEGVTG----NNSSNIGDIDSEYGGNQSESNQIEHADYRAK 577
             +    DPQL+RLD ML+AEGV G      S+      +  GG  S  N IEH+DYR+K
Sbjct: 105 SQEEEPVDPQLMRLDNMLLAEGVAGPEKGGGSAAAAAAAAASGGGVSPDNSIEHSDYRSK 164

Query: 578 LAQIRQIYHSELEKYKNACNEFTGHVINLLREQSRSRPISPAEIELMVGIIKKKFRAIE 754
           LAQIR IYHSELEKY+ ACNEFT HV+NLLREQSR+RP++P E+E MV II +KF AI+
Sbjct: 165 LAQIRHIYHSELEKYEQACNEFTTHVMNLLREQSRTRPVAPKEMERMVSIIHRKFSAIQ 223
>sp|P41778|PBX1_MOUSE Pre-B-cell leukemia transcription factor 1 (Homeobox protein PBX1)
 sp|P40424|PBX1_HUMAN Pre-B-cell leukemia transcription factor 1 (Homeobox protein PBX1)
           (Homeobox protein PRL)
          Length = 430

 Score =  194 bits (494), Expect = 2e-49
 Identities = 102/178 (57%), Positives = 124/178 (69%), Gaps = 3/178 (1%)
 Frame = +2

Query: 230 QCTGRELQNILSVAHQSLDEAQERKHSLNNHRLKPALYSVFCEIKEKTSLSLRNTAQINS 409
           Q  G  LQ I+++  QSLDEAQ RKH+LN HR+KPAL++V CEIKEKT LS+R       
Sbjct: 41  QDIGDILQQIMTITDQSLDEAQARKHALNCHRMKPALFNVLCEIKEKTVLSIRGA----- 95

Query: 410 EDDSNSPDPQLLRLDKMLIAEGVTG---NNSSNIGDIDSEYGGNQSESNQIEHADYRAKL 580
             +    DPQL+RLD ML+AEGV G      S      +   G     N +EH+DYRAKL
Sbjct: 96  -QEEEPTDPQLMRLDNMLLAEGVAGPEKGGGSAAAAAAAAASGGAGSDNSVEHSDYRAKL 154

Query: 581 AQIRQIYHSELEKYKNACNEFTGHVINLLREQSRSRPISPAEIELMVGIIKKKFRAIE 754
           +QIRQIYH+ELEKY+ ACNEFT HV+NLLREQSR+RPISP EIE MV II +KF +I+
Sbjct: 155 SQIRQIYHTELEKYEQACNEFTTHVMNLLREQSRTRPISPKEIERMVSIIHRKFSSIQ 212
>sp|Q99NE9|PBX4_MOUSE Pre-B-cell leukemia transcription factor 4 (Homeobox protein PBX4)
          Length = 378

 Score =  176 bits (446), Expect = 6e-44
 Identities = 99/175 (56%), Positives = 117/175 (66%), Gaps = 2/175 (1%)
 Frame = +2

Query: 236 TGRELQNILSVAHQSLDEAQERKHSLNNHRLKPALYSVFCEIKEKTSLSLRNTAQINSED 415
           T   LQ I+++  QSLDEAQ RKH+LN HR+K AL+SV CEIK KT++S+    Q   ED
Sbjct: 27  TSDVLQQIMAITDQSLDEAQARKHALNCHRMKSALFSVLCEIKGKTAVSI----QFQEED 82

Query: 416 DSNSPDPQLLRLDKMLIAEGVTGNNSSNIGDIDSEYG--GNQSESNQIEHADYRAKLAQI 589
               PD QLLRLD ML+AEGV+       G         G     N IEH+DYRAKL+QI
Sbjct: 83  P---PDAQLLRLDNMLLAEGVSRPEKRGRGAAAGSTATPGGCPNDNSIEHSDYRAKLSQI 139

Query: 590 RQIYHSELEKYKNACNEFTGHVINLLREQSRSRPISPAEIELMVGIIKKKFRAIE 754
           RQIYHSELEKY+ AC EFT HV NLLREQSR RP+S  E+E MV  I+ KF AI+
Sbjct: 140 RQIYHSELEKYEQACREFTTHVTNLLREQSRVRPVSCREMEHMVNTIQSKFSAIQ 194
>sp|Q9BYU1|PBX4_HUMAN Pre-B-cell leukemia transcription factor 4 (Homeobox protein PBX4)
          Length = 330

 Score =  161 bits (407), Expect = 2e-39
 Identities = 88/151 (58%), Positives = 101/151 (66%), Gaps = 4/151 (2%)
 Frame = +2

Query: 314 NNHRLKPALYSVFCEIKEKTSLSLRNTAQINSEDDSNSPDPQLLRLDKMLIAEGVTGNNS 493
           N HR+KPAL+SV CEIKEKT +S+R         D + PD QLLRLD ML+AEGV     
Sbjct: 1   NCHRMKPALFSVLCEIKEKTVVSIRGI------QDEDPPDAQLLRLDNMLLAEGVCRPEK 54

Query: 494 SNIGDIDSEYG----GNQSESNQIEHADYRAKLAQIRQIYHSELEKYKNACNEFTGHVIN 661
              G   +  G    G     N IEH+DYRAKL+QIRQIYHSELEKY+ AC EFT HV N
Sbjct: 55  RGRGGAVARAGTATPGGCPNDNSIEHSDYRAKLSQIRQIYHSELEKYEQACREFTTHVTN 114

Query: 662 LLREQSRSRPISPAEIELMVGIIKKKFRAIE 754
           LL+EQSR RP+SP EIE MVG I  KF AI+
Sbjct: 115 LLQEQSRMRPVSPKEIERMVGAIHGKFSAIQ 145
>sp|P41779|HM20_CAEEL Homeobox protein ceh-20
          Length = 338

 Score =  134 bits (338), Expect = 2e-31
 Identities = 77/170 (45%), Positives = 104/170 (61%), Gaps = 1/170 (0%)
 Frame = +2

Query: 248 LQNILSVAHQSLDEAQE-RKHSLNNHRLKPALYSVFCEIKEKTSLSLRNTAQINSEDDSN 424
           L  +L +  Q+LD+    +K  L  H ++ AL+ V CE KEKT L++RN        D  
Sbjct: 13  LDAVLKINEQTLDDNDSAKKQELQCHPMRQALFDVLCETKEKTVLTVRNQV------DET 66

Query: 425 SPDPQLLRLDKMLIAEGVTGNNSSNIGDIDSEYGGNQSESNQIEHADYRAKLAQIRQIYH 604
             DPQL+RLD ML+AEGV G +    G + S+  G        + ADYR KL QIR +Y+
Sbjct: 67  PEDPQLMRLDNMLVAEGVAGPDKG--GSLGSDASGG-------DQADYRQKLHQIRVLYN 117

Query: 605 SELEKYKNACNEFTGHVINLLREQSRSRPISPAEIELMVGIIKKKFRAIE 754
            EL KY+ ACNEFT HV +LL++QS+ RPI+  EIE MV II++KF  I+
Sbjct: 118 EELRKYEEACNEFTQHVRSLLKDQSQVRPIAHKEIERMVYIIQRKFNGIQ 167
>sp|Q19503|HM40_CAEEL Homeobox protein ceh-40
          Length = 329

 Score = 50.4 bits (119), Expect = 5e-06
 Identities = 40/170 (23%), Positives = 76/170 (44%), Gaps = 2/170 (1%)
 Frame = +2

Query: 248 LQNILSVAHQSLDE--AQERKHSLNNHRLKPALYSVFCEIKEKTSLSLRNTAQINSEDDS 421
           L  ++ +   ++D     + K  +  +    A+  V  E K K  LS +    + ++++ 
Sbjct: 12  LSEVVKITDMTMDNEAVNKLKPQIKINPFYRAVQDVLVEQKSKIDLSTKMMKDLEAQEND 71

Query: 422 NSPDPQLLRLDKMLIAEGVTGNNSSNIGDIDSEYGGNQSESNQIEHADYRAKLAQIRQIY 601
                   RLD ML AEGV G + S +  I    G +Q E        YR +L ++R+  
Sbjct: 72  E-------RLDTMLKAEGVAGPDDSLL-RIQEAAGTDQYE--------YRQQLLKVRREL 115

Query: 602 HSELEKYKNACNEFTGHVINLLREQSRSRPISPAEIELMVGIIKKKFRAI 751
            +E + +   C ++  +V ++L++Q   RPI+    E  +  +  KF  +
Sbjct: 116 ENETKAFDKHCKKWCEYVEDVLQQQGEFRPITQQSTEKFMNKMSGKFNKV 165
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 70,604,228
Number of Sequences: 369166
Number of extensions: 1249173
Number of successful extensions: 3403
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3294
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3379
length of database: 68,354,980
effective HSP length: 108
effective length of database: 48,403,600
effective search space used: 6970118400
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)