Planarian EST Database


Dr_sW_023_C02

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_023_C02
         (933 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P46504|YLX8_CAEEL  Hypothetical protein F23F12.8 in chrom...    42   0.002
sp|Q9NQS7|INCE_HUMAN  Inner centromere protein                     41   0.004
sp|P13392|MYSP_DIRIM  Paramyosin                                   41   0.006
sp|Q9NJA9|MYSP_ANISI  Paramyosin (Allergen Ani s 2)                40   0.012
sp|Q01202|MYSP_BRUMA  Paramyosin                                   39   0.016
sp|Q02171|MYSP_ONCVO  Paramyosin                                   39   0.027
sp|P08089|M6B_STRP6  M protein, serotype 6 precursor               39   0.027
sp|P24788|CD2L1_MOUSE  PITSLRE serine/threonine-protein kina...    38   0.047
sp|P21127|CD2L1_HUMAN  PITSLRE serine/threonine-protein kina...    38   0.047
sp|Q01397|DYNA_NEUCR  Dynactin, 150 kDa isoform (150 kDa dyn...    38   0.047
>sp|P46504|YLX8_CAEEL Hypothetical protein F23F12.8 in chromosome III precursor
          Length = 887

 Score = 42.0 bits (97), Expect = 0.002
 Identities = 66/305 (21%), Positives = 120/305 (39%), Gaps = 25/305 (8%)
 Frame = +1

Query: 91   RQIGIDKQALDQQVKDKKERECTERNVELAYA--------SDSIRNDLIATLLSKRQQDD 246
            RQ  +D+QA    +  ++ER   ERN EL           ++ +R + IA  +SK     
Sbjct: 349  RQAELDRQAT---IYAEQERMAMERNRELERIRLEEKKRENERVRQEEIAMEISK----- 400

Query: 247  IRELNKILNEFRA----FHQQPETRREWDLYDPDALKKDKPARVCDDDPRCGISSMQKFD 414
            IREL ++  E +       Q+ E  R++ L + +  +K +                QK +
Sbjct: 401  IRELERLQLERQRKNERVRQELEAARKYKLQEEERQRKIQ---------------QQKVE 445

Query: 415  GEDLNNKARTKYQKEQMNDWNERQKADKERAEQAQKEADRLYDL----------KRREMD 564
             E +  +   +  +EQ+    E +  + ER  Q + E     ++          K+ E D
Sbjct: 446  MEQIRQQEEAR--QEQLRVLEEERARELERVRQEELERQHQMEILRQQEEDQKKKKLEKD 503

Query: 565  ERACDLAAAEEACRRAMNMANSQYNVALFNEMNERARLKKQQENDDTLTAMANALYSD-- 738
                +   AEE  R  +     +    +  E N+R  L+K+ E+        NA+Y +  
Sbjct: 504  REQREQQEAEELNRMIIEKEMKENKQKMIEEKNKRKMLEKEMED------RQNAIYEEEE 557

Query: 739  -LLTENPAQATSSFGPGRVIPDRFKGMTPEQLNEIYKQQEDQMKDNKRRQMEEDLRNEEW 915
              + E   +        R I  +    T E+      ++E +M    R+  E + + +E 
Sbjct: 558  RRIAEEERRKQIEIEERRRIQQQIMIATEERSRLDAMEREREM---LRQIKESEKQRKEL 614

Query: 916  DRQRL 930
            +RQ L
Sbjct: 615  ERQEL 619
>sp|Q9NQS7|INCE_HUMAN Inner centromere protein
          Length = 919

 Score = 41.2 bits (95), Expect = 0.004
 Identities = 47/221 (21%), Positives = 91/221 (41%), Gaps = 2/221 (0%)
 Frame = +1

Query: 127  QVKDKKERECTERNVELAYASDSIRNDLIATLLSKRQQDDIRELNKILNEFRAFHQQPET 306
            Q+ +K E+   ER  E      +    +      ++Q++D R L       R   Q+ E 
Sbjct: 601  QIDEKTEKAKEERLAEEKAKKKAAAKKMEEVEARRKQEEDARRL-------RWLQQEEEE 653

Query: 307  RREWDLYDPDALKKDKPARVCDDDPRCGISSMQ-KFDGEDLNNKARTKYQKEQMNDWNER 483
            RR  +L      ++ +  R   +  R      Q + + E    + R + ++EQ     ER
Sbjct: 654  RRHQELLQKKKEEEQERLRKAAEAKRLAEQREQERREQERREQERREQERREQERREQER 713

Query: 484  QKADKERAEQ-AQKEADRLYDLKRREMDERACDLAAAEEACRRAMNMANSQYNVALFNEM 660
            ++ +++ AEQ  ++E +RL     RE+ ER   L   +E  +R +     +        +
Sbjct: 714  REQERQLAEQERRREQERLQ--AERELQEREKALRLQKEQLQRELEEKKKKEEQ---QRL 768

Query: 661  NERARLKKQQENDDTLTAMANALYSDLLTENPAQATSSFGP 783
             ER   ++Q++        + AL   +  ++PA  +S   P
Sbjct: 769  AERQLQEEQEKKAKEAAGASKALNVTVDVQSPACTSSPITP 809
>sp|P13392|MYSP_DIRIM Paramyosin
          Length = 848

 Score = 40.8 bits (94), Expect = 0.006
 Identities = 69/302 (22%), Positives = 125/302 (41%), Gaps = 37/302 (12%)
 Frame = +1

Query: 109  KQALDQQVKDKKERECTERNVELAYASDSIRNDLIATLLSKRQQDDIRELNKILNEFRAF 288
            K+AL ++ K K + +  E    LA A+  +       L + R   +IREL   L E  A 
Sbjct: 400  KEALARENK-KLQDDLHEAKEALADANRKLHE---LDLENARLAGEIRELQTALKESEAA 455

Query: 289  HQQPETRREWDLYDPDALKKDKPARVCDDDPRC-GISSMQKFDGEDLN------------ 429
             +  E R +  L +   L+ +   R+ + +     +    +F+ + L             
Sbjct: 456  RRDAENRAQRALAELQQLRIEMERRLQEKEEEMEALRKNMQFEIDRLTAALADAEARMKA 515

Query: 430  --NKARTKYQKE---------QMNDWN-ERQKADKERAEQAQKEADRLYDLKRR---EMD 564
              ++ + KYQ E          +N  N E QK  K+++EQ +     L D +R+    +D
Sbjct: 516  EISRLKKKYQAEIAELEMTVDNLNRANIEAQKTIKKQSEQLKILQASLEDTQRQLQQTLD 575

Query: 565  ERACD------LAAAEEACRRAMNMANSQYNVALFNEMNERARLKKQQENDDTLTAMANA 726
            + A        L+A  E C+ A++ A      A  +     AR+      ++ LTA+ N 
Sbjct: 576  QYALAQRKVSALSAELEECKVALDNAIRARKQAEIDLEEANARITDLVSINNNLTAIKNK 635

Query: 727  LYSDLLTENP--AQATSSFGPGRVIPDRFKGMTPEQLNEIYKQQEDQMK-DNKRRQMEED 897
            L ++L T      +AT          +R        + +++++QE  MK D  R+ +EE 
Sbjct: 636  LETELSTAQADLDEATKELHAADERANRALADAARAVEQLHEEQEHSMKIDALRKSLEEQ 695

Query: 898  LR 903
            ++
Sbjct: 696  VK 697
>sp|Q9NJA9|MYSP_ANISI Paramyosin (Allergen Ani s 2)
          Length = 869

 Score = 39.7 bits (91), Expect = 0.012
 Identities = 68/302 (22%), Positives = 122/302 (40%), Gaps = 37/302 (12%)
 Frame = +1

Query: 109  KQALDQQVKDKKERECTERNVELAYASDSIRNDLIATLLSKRQQDDIRELNKILNEFRAF 288
            K+AL ++ K K + +  E N  LA A+  +       L + R   +IR+L   L E  A 
Sbjct: 421  KEALARENK-KLQDDLHEANEALADANRKLHE---LDLENARLAGEIRDLQVALKESEAA 476

Query: 289  HQQPETRREWDLYDPDALKKDKPARVCDDDPRC-GISSMQKFDGEDLNN----------- 432
             +  E R +  L +   ++ +   R+ + +     +    +F+ + L             
Sbjct: 477  RRDAEARAQRALAELQQVRIEMERRLQEKEEEMEALRKSMQFEIDRLTAALADAEARMKA 536

Query: 433  ---KARTKYQKE---------QMNDWN-ERQKADKERAEQAQKEADRLYDLKRR---EMD 564
               + R KYQ E          +N  N E QK  K+++EQ  +    L D +R+    +D
Sbjct: 537  EIARLRKKYQAEIAELEMTVDNLNRANLEAQKTIKKQSEQIIQLQANLEDTQRQLQQTLD 596

Query: 565  ERACD------LAAAEEACRRAMNMANSQYNVALFNEMNERARLKKQQENDDTLTAMANA 726
            + A        L+A  E C+ A++ A      A  +      R+      +  LTA+ N 
Sbjct: 597  QYALAQRKISALSAELEECKTALDNAIRARKQAEADLEEAHVRISDLTSINSNLTAIKNK 656

Query: 727  LYSDLLTENP--AQATSSFGPGRVIPDRFKGMTPEQLNEIYKQQEDQMK-DNKRRQMEED 897
            L ++L T      + T          +R        + E++++QE  MK D  R+ +EE 
Sbjct: 657  LETELSTAQADLDEVTKELHAADERANRALADAARAVQELHEEQEHSMKIDALRKSLEEQ 716

Query: 898  LR 903
            ++
Sbjct: 717  VK 718
>sp|Q01202|MYSP_BRUMA Paramyosin
          Length = 880

 Score = 39.3 bits (90), Expect = 0.016
 Identities = 68/302 (22%), Positives = 124/302 (41%), Gaps = 37/302 (12%)
 Frame = +1

Query: 109  KQALDQQVKDKKERECTERNVELAYASDSIRNDLIATLLSKRQQDDIRELNKILNEFRAF 288
            K+AL ++ K K + +  E    LA A+  +       L + R   +IREL   L E  A 
Sbjct: 427  KEALARENK-KLQDDLHEAKEALADANRKLHE---LDLENARLAGEIRELQTALKESEAA 482

Query: 289  HQQPETRREWDLYDPDALKKDKPARVCDDDPRC-GISSMQKFDGEDLN------------ 429
             +  E R +  L +   L+ +   R+ + +     +    +F+ + L             
Sbjct: 483  RRDAENRAQRALAELQQLRIEMERRLQEKEEEMEALRKNMQFEIDRLTAALADAEARMKA 542

Query: 430  --NKARTKYQKE---------QMNDWN-ERQKADKERAEQAQKEADRLYDLKRR---EMD 564
              ++ + KYQ E          +N  N E QK  K+++EQ +     L D +R+    +D
Sbjct: 543  EISRLKKKYQAEIAELEMTVDNLNRANIEAQKTIKKQSEQLKILQASLEDTQRQLQQTLD 602

Query: 565  ERACD------LAAAEEACRRAMNMANSQYNVALFNEMNERARLKKQQENDDTLTAMANA 726
            + A        L+A  E C+ A++ A      A  +      R+      ++ LTA+ N 
Sbjct: 603  QYALAQRKVSALSAELEECKVALDNAIRARKQAEIDLEEANGRITDLVSVNNNLTAIKNK 662

Query: 727  LYSDLLTENP--AQATSSFGPGRVIPDRFKGMTPEQLNEIYKQQEDQMK-DNKRRQMEED 897
            L ++L T      +AT          +R        + +++++QE  MK D  R+ +EE 
Sbjct: 663  LETELSTAQADLDEATKELHAADERANRALADAARAVEQLHEEQEHSMKIDALRKSLEEQ 722

Query: 898  LR 903
            ++
Sbjct: 723  VK 724
>sp|Q02171|MYSP_ONCVO Paramyosin
          Length = 879

 Score = 38.5 bits (88), Expect = 0.027
 Identities = 68/302 (22%), Positives = 123/302 (40%), Gaps = 37/302 (12%)
 Frame = +1

Query: 109  KQALDQQVKDKKERECTERNVELAYASDSIRNDLIATLLSKRQQDDIRELNKILNEFRAF 288
            K+AL ++ K K + +  E    LA A+  +       L + R   +IREL   L E  A 
Sbjct: 427  KEALARENK-KLQDDLHEAKEALADANRKLHE---LDLENARLAGEIRELQTALKESEAA 482

Query: 289  HQQPETRREWDLYDPDALKKDKPARVCDDDPRC-GISSMQKFDGEDLNN----------- 432
             +  E R +  L +   L+ +   R+ + +     +    +F+ + L             
Sbjct: 483  RRDAENRAQRALAELQQLRIEMERRLQEKEEEMEALRKNMQFEIDRLTAALADAEARMKA 542

Query: 433  ---KARTKYQKE---------QMNDWN-ERQKADKERAEQAQKEADRLYDLKRR---EMD 564
               + + KYQ E          +N  N E QK  K+++EQ +     L D +R+    +D
Sbjct: 543  EIARLKKKYQAEIAELEMTVDNLNRANIEAQKTIKKQSEQLKVLQASLEDTQRQLQQTLD 602

Query: 565  ERACD------LAAAEEACRRAMNMANSQYNVALFNEMNERARLKKQQENDDTLTAMANA 726
            + A        L+A  E C+ A++ A      A  +      R+      ++ LTA+ N 
Sbjct: 603  QYALAQRKVSALSAELEECKVALDNAIRARKQAEIDLEEANGRITDLVSINNNLTAIKNK 662

Query: 727  LYSDLLTENP--AQATSSFGPGRVIPDRFKGMTPEQLNEIYKQQEDQMK-DNKRRQMEED 897
            L ++L T      +AT          +R        + +++++QE  MK D  R+ +EE 
Sbjct: 663  LETELSTAQADLDEATKELHAADERANRALADAARAVEQLHEEQEHSMKIDALRKSLEEQ 722

Query: 898  LR 903
            ++
Sbjct: 723  VK 724
>sp|P08089|M6B_STRP6 M protein, serotype 6 precursor
          Length = 483

 Score = 38.5 bits (88), Expect = 0.027
 Identities = 78/296 (26%), Positives = 118/296 (39%), Gaps = 34/296 (11%)
 Frame = +1

Query: 97   IGIDKQALDQQVKDK--KERECTERNVELAYASDSIRNDLIATLLSKRQQDDIRELNKIL 270
            IG  K+ LD+ VKDK  KE+E  E    L    D    D IA    +  ++ I  L K L
Sbjct: 162  IGTLKKTLDETVKDKIAKEQESKETIGTLKKTLDETVKDKIAK--EQESKETIGTLKKTL 219

Query: 271  NEF--RAFHQQPETRREWDLYDP--DALKKDKPARVCDDDPRCGISSMQKFDGEDLNNKA 438
            +E       ++ E++          D   KDK AR        G +  Q+   +D  NK 
Sbjct: 220  DETVKDKIAKEQESKETIGTLKKILDETVKDKIAREQKSKQDIG-ALKQELAKKDEGNKV 278

Query: 439  RTKYQKEQMNDWNERQKADKERAEQAQ---------KEADRLYDLKRREMDERACDLAAA 591
                +K    D +  ++A K+  +            KE  ++ D  R+ +     DL A+
Sbjct: 279  SEASRKGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQISDASRQGLRR---DLDAS 335

Query: 592  EEA---CRRAMNMANSQYNV--ALFNEMNERARL--KKQQENDDTLTAMANALYSDLLTE 750
             EA     +A+  ANS+      L  E+ E  +L  K++ E    L A A AL   L  +
Sbjct: 336  REAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEQLAKQ 395

Query: 751  -------NPAQATSSFGP-----GRVIPDRFKGMTPEQLNEIYKQQEDQMKDNKRR 882
                      +A+ S  P      +V+P   KG  P Q      Q +  MK+ KR+
Sbjct: 396  AEELAKLRAGKASDSQTPDAKPGNKVVPG--KGQAP-QAGTKPNQNKAPMKETKRQ 448
>sp|P24788|CD2L1_MOUSE PITSLRE serine/threonine-protein kinase CDC2L1
           (Galactosyltransferase-associated protein kinase
           p58/GTA) (Cell division cycle 2-like protein kinase 1)
          Length = 784

 Score = 37.7 bits (86), Expect = 0.047
 Identities = 40/177 (22%), Positives = 81/177 (45%), Gaps = 11/177 (6%)
 Frame = +1

Query: 73  IFNSKTRQIGIDKQALDQQVKDKKERECTERNVELAYASD-----SIRNDLIA---TLLS 228
           I   K R+   +++A  +++K+  +R+    ++E     D     +IRN       ++  
Sbjct: 16  ILQEKKRRKEQEEKAEIKRLKNSDDRDSKRDSLEEGELRDHRMEITIRNSPYRREDSMED 75

Query: 229 KRQQDD---IRELNKILNEFRAFHQQPETRREWDLYDPDALKKDKPARVCDDDPRCGISS 399
           + ++DD   I+   ++  + +A H++ E R+E   +   + +  K ARV           
Sbjct: 76  RGEEDDSLAIKPPQQMSRKEKAHHRKDEKRKEKRRHRSHSAEGGKHARV----------- 124

Query: 400 MQKFDGEDLNNKARTKYQKEQMNDWNERQKADKERAEQAQKEADRLYDLKRREMDER 570
            ++ + E    K   + Q +   +W ERQK  +   E +++E DRL  L+R+   ER
Sbjct: 125 -KEKEREHERRKRHREEQDKARREW-ERQKRREMAREHSRRERDRLEQLERKRERER 179
>sp|P21127|CD2L1_HUMAN PITSLRE serine/threonine-protein kinase CDC2L1
           (Galactosyltransferase-associated protein kinase
           p58/GTA) (Cell division cycle 2-like protein kinase 1)
           (CLK-1) (CDK11) (p58 CLK-1)
          Length = 795

 Score = 37.7 bits (86), Expect = 0.047
 Identities = 63/295 (21%), Positives = 120/295 (40%), Gaps = 17/295 (5%)
 Frame = +1

Query: 73  IFNSKTRQIGIDKQALDQQVKDKKERECTERNVELAYASD-----SIRNDLIA---TLLS 228
           I   K R+   +++A  +++K+  +R+    ++E     D     +IRN       ++  
Sbjct: 16  ILQEKKRRKEQEEKAEIKRLKNSDDRDSKRDSLEEGELRDHRMEITIRNSPYRREDSMED 75

Query: 229 KRQQDD---IRELNKILNEFRAFHQQPETRREWDLYDPDALKKDKPARVCDDDPRCGISS 399
           + ++DD   I+   ++  + +  H++ E R+E   +   + +  K ARV           
Sbjct: 76  RGEEDDSLAIKPPQQMSRKEKVHHRKDEKRKEKRRHRSHSAEGGKHARV----------- 124

Query: 400 MQKFDGEDLNNKARTKYQKEQMNDWNERQKADKERAEQAQKEADRLYDLKRREMDERACD 579
            ++ + E    K   + Q +   +W ERQK  +   E +++E DRL  L+R+   ER   
Sbjct: 125 -KEKEREHERRKRHREEQDKARREW-ERQKRREMAREHSRRERDRLEQLERKRERER--- 179

Query: 580 LAAAEEACRRAMNMANSQYNVALFNEMNERA-RLKKQQENDDTLTAMANALYSDLLTENP 756
                        M   Q       E   RA   +K++E    ++A    +  D   +  
Sbjct: 180 ------------KMREQQKEQREQKERERRAEERRKEREARREVSAHHRTMREDYSDKVK 227

Query: 757 AQATSSFGPGRVIPDRFK---GMTPEQLNEIYKQQEDQMKDNK--RRQMEEDLRN 906
           A   S   P R   +RF+   G  P +      Q+  Q+K+ K   R +  DL++
Sbjct: 228 ASHWSR-SPPRPpreRFELGDGRKPGEARPARAQKPAQLKEEKMEERDLLSDLQD 281
>sp|Q01397|DYNA_NEUCR Dynactin, 150 kDa isoform (150 kDa dynein-associated polypeptide)
           (DP-150) (DAP-150) (p150-glued)
          Length = 1300

 Score = 37.7 bits (86), Expect = 0.047
 Identities = 49/203 (24%), Positives = 92/203 (45%), Gaps = 25/203 (12%)
 Frame = +1

Query: 379 PRCGISSMQKFDGEDLNNKARTKY-QKEQMNDWN-----ERQKADKERAEQA-------- 516
           PR  I++    + E  + KA+ K  +K++M D       E+ KA++++ E+         
Sbjct: 250 PRSSITNTATMNKEIEDLKAKLKVLEKKRMEDREKLNSLEKVKAERDKFERIIQTLQIKY 309

Query: 517 QKEADRLYDLKR--REMDERACDLAAAEEACRRAMNMANSQYNVALFNEMNERARLKKQQ 690
           Q +   + DLKR  +E + R  ++   +     AM +A     +A       +  L   +
Sbjct: 310 QPQQQEIQDLKRQLKEAENRLYNVEELQAEHDTAMELATLDREMAEETAEVLKVELDALK 369

Query: 691 ENDDTLTAMANALYSDLLTENPAQATSSFGP------GRVIPDRFKGMTPE---QLNEIY 843
           + ++ L      L  ++L E  ++ T+   P      G +  +R      E   +L +I 
Sbjct: 370 QKNEELE-----LEVEVLREENSEFTNGMSPEERASTGWLQMERNNERLREALIRLRDIT 424

Query: 844 KQQEDQMKDNKRRQMEEDLRNEE 912
           +QQE+++KD + + MEEDLR  E
Sbjct: 425 QQQEEELKD-QIKSMEEDLREFE 446
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.313    0.128    0.361 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 85,736,667
Number of Sequences: 369166
Number of extensions: 1532223
Number of successful extensions: 5426
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4859
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5297
length of database: 68,354,980
effective HSP length: 110
effective length of database: 48,034,130
effective search space used: 9606826000
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)