Planaria EST Database


DrC_00457

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_00457
         (725 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q9CQ88|SAS_MOUSE  Sarcoma amplified sequence homolog           123   6e-28
sp|Q7ZUB3|SAS_BRARE  Sarcoma amplified sequence homolog           121   2e-27
sp|Q7ZWW7|SAS2_XENLA  Sarcoma amplified sequence homolog 2        119   8e-27
sp|Q5U1V9|SAS_RAT  Sarcoma amplified sequence homolog             119   8e-27
sp|Q5FVL6|TSN13_RAT  Tetraspanin-13 (Tspan-13) (Transmembran...   118   2e-26
sp|Q5RAP8|SAS_PONPY  Sarcoma amplified sequence homolog           117   4e-26
sp|O95857|TSN13_HUMAN  Tetraspanin-13 (Tspan-13) (Transmembr...   115   9e-26
sp|Q9D8C2|TSN13_MOUSE  Tetraspanin-13 (Tspan-13) (Transmembr...   115   9e-26
sp|Q12999|SAS_HUMAN  Sarcoma amplified sequence (Tetraspanin...   114   2e-25
sp|Q5XHG6|SAS1_XENLA  Sarcoma amplified sequence homolog 1        113   5e-25
>sp|Q9CQ88|SAS_MOUSE Sarcoma amplified sequence homolog
          Length = 210

 Score =  123 bits (308), Expect = 6e-28
 Identities = 72/219 (32%), Positives = 112/219 (51%), Gaps = 1/219 (0%)
 Frame = +1

Query: 1   SSLFALNIIYILVSFLLILFGAYARLSAIVTXXXXXXXXXXXXVFLAIVASLGIFATSKH 180
           ++L ALN++Y+LV FLLI   A+ +   +V+            VFL ++A  G+   + H
Sbjct: 11  NALCALNVVYMLVGFLLIGVAAWGKGLGVVSSIHIIGGVIAVGVFLLLIAVAGLVGAANH 70

Query: 181 NQVAXXXXXXXXXXXXXVQFSIACVCLAVNDKIEENLVRQGWKN-SDHAQIIVMKTFNCC 357
           +QV               QF I+C CLA+N   + +++   W   S+  +  + ++F+CC
Sbjct: 71  HQVLLFFYMIILGLVFIFQFGISCSCLAINRNTQADVINASWSVLSNSTRHELERSFDCC 130

Query: 358 GLMESDLDNPSELKCKLANLTCCKGFPANETRCCTGVISRYEDCPCQQCWKMLQYSLNKA 537
           GL                NLT  +    ++T C    + + +   CQ C +      +KA
Sbjct: 131 GLF---------------NLTTLR--LQDDTSC--SAVCKTKSSTCQMCGERFLKHSDKA 171

Query: 538 VKVTGAVGLVFSFTEIIGVWLAIRFRNMRDPIADPKLFL 654
           +K+ G VGL FSFTEI+GVWLA+RFRN +DP A+P  FL
Sbjct: 172 LKILGGVGLFFSFTEILGVWLAMRFRNQKDPRANPSAFL 210
>sp|Q7ZUB3|SAS_BRARE Sarcoma amplified sequence homolog
          Length = 212

 Score =  121 bits (303), Expect = 2e-27
 Identities = 73/219 (33%), Positives = 116/219 (52%), Gaps = 1/219 (0%)
 Frame = +1

Query: 1   SSLFALNIIYILVSFLLILFGAYARLSAIVTXXXXXXXXXXXXVFLAIVASLGIFATSKH 180
           ++L +LN++Y+LV  LLI+  A+ +   IV+             FL ++A +G+     H
Sbjct: 11  NALCSLNVVYMLVGLLLIVVAAWGKGFGIVSSIHIIGGVIAVGFFLQLIAIVGLIGAVHH 70

Query: 181 NQVAXXXXXXXXXXXXXVQFSIACVCLAVNDKIEENLVRQGWK-NSDHAQIIVMKTFNCC 357
           +QV               QF ++C CLA+N   +E L+   WK  S+  +I + K  +CC
Sbjct: 71  HQVMLFFYMVILFVVFLFQFGVSCSCLAMNQGQQEKLLESSWKIMSNDTRISLEKKLDCC 130

Query: 358 GLMESDLDNPSELKCKLANLTCCKGFPANETRCCTGVISRYEDCPCQQCWKMLQYSLNKA 537
           GL      N + L+  + +          +   CT   ++ ++C      KMLQ+S ++A
Sbjct: 131 GLF-----NSTNLQADIMS----------DLHLCTSPCTQKKECVTCGL-KMLQHS-SEA 173

Query: 538 VKVTGAVGLVFSFTEIIGVWLAIRFRNMRDPIADPKLFL 654
           +K+ G VGL FSFTEI+GVWLA+R+RN +DP A+P  FL
Sbjct: 174 LKILGGVGLFFSFTEILGVWLAMRYRNQKDPRANPSAFL 212
>sp|Q7ZWW7|SAS2_XENLA Sarcoma amplified sequence homolog 2
          Length = 212

 Score =  119 bits (298), Expect = 8e-27
 Identities = 72/219 (32%), Positives = 109/219 (49%), Gaps = 1/219 (0%)
 Frame = +1

Query: 1   SSLFALNIIYILVSFLLILFGAYARLSAIVTXXXXXXXXXXXXVFLAIVASLGIFATSKH 180
           ++L ALN++Y+LV  LLI+  A+ +   IV+            VFL ++A +G+     H
Sbjct: 11  NALCALNVVYMLVGVLLIIVAAWGKGFGIVSSIHIIGGVIAIGVFLLLIAIIGLIGAVSH 70

Query: 181 NQVAXXXXXXXXXXXXXVQFSIACVCLAVNDKIEENLVRQGWKN-SDHAQIIVMKTFNCC 357
           +QV               QF ++C CLA+N   +E L+   W   S+  ++ + KT +CC
Sbjct: 71  HQVMLFIYMVVLILVFIFQFIVSCSCLAMNRSQQEYLLNTTWNRMSNETRLNLEKTLDCC 130

Query: 358 GLMESDLDNPSELKCKLANLTCCKGFPANETRCCTGVISRYEDCPCQQCWKMLQYSLNKA 537
           G + +  +   E K  +A               C  V S    CP   C   +    ++A
Sbjct: 131 GFLNTT-EGRDEFKQDVA--------------LCIQVCSDPHKCP--SCGDKMLNHADEA 173

Query: 538 VKVTGAVGLVFSFTEIIGVWLAIRFRNMRDPIADPKLFL 654
           +K+ G VGL FSFTEI+GVWLA R+RN +DP A+P  FL
Sbjct: 174 LKILGGVGLFFSFTEILGVWLAFRYRNQKDPRANPSAFL 212
>sp|Q5U1V9|SAS_RAT Sarcoma amplified sequence homolog
          Length = 210

 Score =  119 bits (298), Expect = 8e-27
 Identities = 72/221 (32%), Positives = 109/221 (49%), Gaps = 3/221 (1%)
 Frame = +1

Query: 1   SSLFALNIIYILVSFLLILFGAYARLSAIVTXXXXXXXXXXXXVFLAIVASLGIFATSKH 180
           ++L ALN++Y+LV  LLI   A+ +   +V+            VFL ++A  G+     H
Sbjct: 11  NALCALNVVYMLVGLLLIGVAAWGKGLGVVSSIHIIGGVIAVGVFLLLIAVAGLVGAVNH 70

Query: 181 NQVAXXXXXXXXXXXXXVQFSIACVCLAVNDKIEENLVRQGW---KNSDHAQIIVMKTFN 351
           +QV               QF I+C CLA+N   + +++   W    NS   ++   ++F+
Sbjct: 71  HQVLLFFYMIILGLVFIFQFGISCSCLAINRNTQADVINASWWVLSNSTRHEL--ERSFD 128

Query: 352 CCGLMESDLDNPSELKCKLANLTCCKGFPANETRCCTGVISRYEDCPCQQCWKMLQYSLN 531
           CCGL                NLT       ++   C+  + +    PCQ C +      +
Sbjct: 129 CCGLF---------------NLTTVH---LHDDASCSA-LCKTRSSPCQMCGETFLKHSD 169

Query: 532 KAVKVTGAVGLVFSFTEIIGVWLAIRFRNMRDPIADPKLFL 654
           KA+K+ G VGL FSFTEI+GVWLA+RFRN +DP A+P  FL
Sbjct: 170 KALKILGGVGLFFSFTEILGVWLAMRFRNQKDPRANPSAFL 210
>sp|Q5FVL6|TSN13_RAT Tetraspanin-13 (Tspan-13) (Transmembrane 4 superfamily member 13)
          Length = 204

 Score =  118 bits (295), Expect = 2e-26
 Identities = 66/216 (30%), Positives = 101/216 (46%)
 Frame = +1

Query: 7   LFALNIIYILVSFLLILFGAYARLSAIVTXXXXXXXXXXXXVFLAIVASLGIFATSKHNQ 186
           L ALN++Y LVS LLI   A+     +++            +FL ++A +G+    KH+Q
Sbjct: 13  LCALNLLYTLVSLLLIGIAAWGIGFGLISSLRVVGVVIAVGIFLFLIALVGLIGAVKHHQ 72

Query: 187 VAXXXXXXXXXXXXXVQFSIACVCLAVNDKIEENLVRQGWKNSDHAQIIVMKTFNCCGLM 366
           V              VQFS++C CLA+N + +  L+  GW N+  A+  + +  NCCG  
Sbjct: 73  VLLFFYMIILLLVFIVQFSVSCACLALNREQQGQLLEVGWNNTASARNDIQRNLNCCGFR 132

Query: 367 ESDLDNPSELKCKLANLTCCKGFPANETRCCTGVISRYEDCPCQQCWKMLQYSLNKAVKV 546
             + ++     C  +N  C           C  +I  Y                 + ++ 
Sbjct: 133 NYNPNDTCPASCAKSNQKCSS---------CAPIIGEY---------------AGEVLRF 168

Query: 547 TGAVGLVFSFTEIIGVWLAIRFRNMRDPIADPKLFL 654
            G +GL FSFTEI+GVWL  R+RN +DP A+P  FL
Sbjct: 169 VGGIGLFFSFTEILGVWLTYRYRNQKDPRANPSAFL 204
>sp|Q5RAP8|SAS_PONPY Sarcoma amplified sequence homolog
          Length = 210

 Score =  117 bits (292), Expect = 4e-26
 Identities = 76/220 (34%), Positives = 114/220 (51%), Gaps = 2/220 (0%)
 Frame = +1

Query: 1   SSLFALNIIYILVSFLLILFGAYARLSAIVTXXXXXXXXXXXXVFLAIVASLGIFATSKH 180
           ++L ALN++Y+LVS LLI   A+ +   +V+            VFL ++A  G+     H
Sbjct: 11  NALCALNVVYMLVSLLLIGVAAWGKGLGLVSSIHIIGGVIAVGVFLLLIAVAGLVGAVNH 70

Query: 181 NQVAXXXXXXXXXXXXXVQFSIACVCLAVNDKIEENLVRQGW-KNSDHAQIIVMKTFNCC 357
           +QV               QF I+C CLA+N   + +++   W   S+  +  + ++F+CC
Sbjct: 71  HQVLLFFYMIILGLVFIFQFVISCSCLAINRSKQADVINASWWVMSNKTRDELERSFDCC 130

Query: 358 GLMESDLDNPSELKCKLANLTCCKGFPANETRCCTGVISRYEDCPCQQCW-KMLQYSLNK 534
           GL                NLT        +   CT  I + +   CQ C  K L+YS ++
Sbjct: 131 GLF---------------NLTTLY---QQDYDFCTA-ICKSQSPTCQMCGEKFLKYS-DE 170

Query: 535 AVKVTGAVGLVFSFTEIIGVWLAIRFRNMRDPIADPKLFL 654
           A+K+ G VGL FSFTEI+GVWLA+RFRN++DP A+P  FL
Sbjct: 171 ALKILGGVGLFFSFTEILGVWLAMRFRNLKDPRANPSAFL 210
>sp|O95857|TSN13_HUMAN Tetraspanin-13 (Tspan-13) (Transmembrane 4 superfamily member 13)
           (Tetraspan NET-6)
          Length = 204

 Score =  115 bits (289), Expect = 9e-26
 Identities = 65/216 (30%), Positives = 102/216 (47%)
 Frame = +1

Query: 7   LFALNIIYILVSFLLILFGAYARLSAIVTXXXXXXXXXXXXVFLAIVASLGIFATSKHNQ 186
           L ALN++Y LVS LLI   A+     +++            +FL ++A +G+    KH+Q
Sbjct: 13  LCALNLLYTLVSLLLIGIAAWGIGFGLISSLRVVGVVIAVGIFLFLIALVGLIGAVKHHQ 72

Query: 187 VAXXXXXXXXXXXXXVQFSIACVCLAVNDKIEENLVRQGWKNSDHAQIIVMKTFNCCGLM 366
           V              VQFS++C CLA+N + +  L+  GW N+  A+  + +  NCCG  
Sbjct: 73  VLLFFYMIILLLVFIVQFSVSCACLALNQEQQGQLLEVGWNNTASARNDIQRNLNCCGFR 132

Query: 367 ESDLDNPSELKCKLANLTCCKGFPANETRCCTGVISRYEDCPCQQCWKMLQYSLNKAVKV 546
             + ++     C  ++ +C           C  +I  Y                 + ++ 
Sbjct: 133 SVNPNDTCLASCVKSDHSCSP---------CAPIIGEY---------------AGEVLRF 168

Query: 547 TGAVGLVFSFTEIIGVWLAIRFRNMRDPIADPKLFL 654
            G +GL FSFTEI+GVWL  R+RN +DP A+P  FL
Sbjct: 169 VGGIGLFFSFTEILGVWLTYRYRNQKDPRANPSAFL 204
>sp|Q9D8C2|TSN13_MOUSE Tetraspanin-13 (Tspan-13) (Transmembrane 4 superfamily member 13)
          Length = 204

 Score =  115 bits (289), Expect = 9e-26
 Identities = 65/216 (30%), Positives = 100/216 (46%)
 Frame = +1

Query: 7   LFALNIIYILVSFLLILFGAYARLSAIVTXXXXXXXXXXXXVFLAIVASLGIFATSKHNQ 186
           L ALN++Y LVS LLI   A+     +++            +FL ++A +G+    KH+Q
Sbjct: 13  LCALNLLYTLVSLLLIGIAAWGIGFGLISSLRVVGVVIAVGIFLFLIALVGLIGAVKHHQ 72

Query: 187 VAXXXXXXXXXXXXXVQFSIACVCLAVNDKIEENLVRQGWKNSDHAQIIVMKTFNCCGLM 366
           V              VQFS++C CLA+N + +  L+  GW N+  A+  + +  NCCG  
Sbjct: 73  VLLFFYMIILLLVFIVQFSVSCACLALNREQQGQLLEVGWNNTASARNDIQRNLNCCGFR 132

Query: 367 ESDLDNPSELKCKLANLTCCKGFPANETRCCTGVISRYEDCPCQQCWKMLQYSLNKAVKV 546
             + ++     C  +   C           C  +I  Y                 + ++ 
Sbjct: 133 SYNPNDTCPASCAKSTQKCSS---------CAPIIGEY---------------AGEVLRF 168

Query: 547 TGAVGLVFSFTEIIGVWLAIRFRNMRDPIADPKLFL 654
            G +GL FSFTEI+GVWL  R+RN +DP A+P  FL
Sbjct: 169 VGGIGLFFSFTEILGVWLTYRYRNQKDPRANPSAFL 204
>sp|Q12999|SAS_HUMAN Sarcoma amplified sequence (Tetraspanin-31) (Tspan-31)
          Length = 210

 Score =  114 bits (286), Expect = 2e-25
 Identities = 72/219 (32%), Positives = 109/219 (49%), Gaps = 1/219 (0%)
 Frame = +1

Query: 1   SSLFALNIIYILVSFLLILFGAYARLSAIVTXXXXXXXXXXXXVFLAIVASLGIFATSKH 180
           ++L ALN++Y+LVS LLI   A+ +   +V+            VFL ++A  G+     H
Sbjct: 11  NALCALNVVYMLVSLLLIGVAAWGKGLGLVSSIHIIGGVIAVGVFLLLIAVAGLVGAVNH 70

Query: 181 NQVAXXXXXXXXXXXXXVQFSIACVCLAVNDKIEENLVRQGW-KNSDHAQIIVMKTFNCC 357
           +QV               QF I+C CLA+N   + +++   W   S+  +  + ++F+CC
Sbjct: 71  HQVLLFFYMIILGLVFIFQFVISCSCLAINRSKQTDVINASWWVMSNKTRDELERSFDCC 130

Query: 358 GLMESDLDNPSELKCKLANLTCCKGFPANETRCCTGVISRYEDCPCQQCWKMLQYSLNKA 537
           GL                NLT        +   CT  I + +   CQ C +      ++A
Sbjct: 131 GLF---------------NLTTLY---QQDYDFCTA-ICKSQSPTCQMCGEKFLKHSDEA 171

Query: 538 VKVTGAVGLVFSFTEIIGVWLAIRFRNMRDPIADPKLFL 654
           +K+ G VGL FSFTEI+GVWLA+RFRN +DP A+P  FL
Sbjct: 172 LKILGGVGLFFSFTEILGVWLAMRFRNQKDPRANPSAFL 210
>sp|Q5XHG6|SAS1_XENLA Sarcoma amplified sequence homolog 1
          Length = 212

 Score =  113 bits (283), Expect = 5e-25
 Identities = 70/223 (31%), Positives = 106/223 (47%), Gaps = 5/223 (2%)
 Frame = +1

Query: 1   SSLFALNIIYILVSFLLILFGAYARLSAIVTXXXXXXXXXXXXVFLAIVASLGIFATSKH 180
           ++L ALN++Y+LV  LLI   A+ +   IV+            VFL ++A +G+     H
Sbjct: 11  NALCALNVVYMLVGLLLIGVAAWGKGFGIVSSIHIIGGVIAIGVFLLLIAIIGLIGAVSH 70

Query: 181 NQVAXXXXXXXXXXXXXVQFSIACVCLAVNDKIEENLVRQGWKN-SDHAQIIVMKTFNCC 357
           +QV               QF ++C CLA+N   +E  +   W+  S+  ++ + +T  CC
Sbjct: 71  HQVMLFIYMVVLILVFIFQFIVSCSCLAMNRSQQEYFLNTTWRRMSNETRLNLEETLECC 130

Query: 358 GLMESDLDNPSELKCKLANLTCCKGFPANETRCCTGVISRYEDCP----CQQCWKMLQYS 525
           G +               N T  +     +   C+ V      CP    C  C   +   
Sbjct: 131 GFL---------------NTTEARELFNKDVALCSHV------CPDPHKCLSCGDKMLNH 169

Query: 526 LNKAVKVTGAVGLVFSFTEIIGVWLAIRFRNMRDPIADPKLFL 654
            ++A+K+ G VGL FSFTEI+GVWLA RFRN +DP A+P  FL
Sbjct: 170 ADEALKILGGVGLFFSFTEILGVWLAFRFRNQKDPRANPSAFL 212
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 77,853,452
Number of Sequences: 369166
Number of extensions: 1515874
Number of successful extensions: 3158
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 3039
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3145
length of database: 68,354,980
effective HSP length: 107
effective length of database: 48,588,335
effective search space used: 6510836890
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)

Cluster detail

DrC_00457

  1. Dr_sW_011_P22
  2. Dr_sW_025_G04