Planaria EST Database


DrC_00409

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_00409
         (1127 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q61545|EWS_MOUSE  RNA-binding protein EWS                       79   2e-14
sp|Q92804|RBP56_HUMAN  TATA-binding protein associated facto...    79   2e-14
sp|Q01844|EWS_HUMAN  RNA-binding protein EWS (EWS oncogene) ...    78   4e-14
sp|Q28009|FUS_BOVIN  RNA-binding protein FUS (Pigpen protein)      74   8e-13
sp|P56959|FUS_MOUSE  RNA-binding protein FUS (Pigpen protein)      73   1e-12
sp|P35637|FUS_HUMAN  RNA-binding protein FUS (Oncogene FUS) ...    73   1e-12
sp|Q27294|CAZ_DROME  RNA-binding protein cabeza (Sarcoma-ass...    69   2e-11
sp|O43120|UAP2_SCHPO  Splicing factor U2AF-associated protein 2    54   8e-07
sp|P53830|YN26_YEAST  Hypothetical 32.3 kDa protein in SEC21...    54   1e-06
sp|Q9P2K5|MYEF2_HUMAN  Myelin expression factor 2 (MyEF-2) (...    47   1e-04
>sp|Q61545|EWS_MOUSE RNA-binding protein EWS
          Length = 655

 Score = 79.3 bits (194), Expect = 2e-14
 Identities = 69/258 (26%), Positives = 99/258 (38%), Gaps = 34/258 (13%)
 Frame = +2

Query: 155  GGSDAMNMAFQGMMSAQRGQLSNSERDNYNS-----------------NNDGALRLNTVY 283
            GG D   M+ +G     RG L   ER  +N                  + D     + +Y
Sbjct: 305  GGFDRGGMS-RGGRGGGRGGLGAGERGGFNKPGGPMDEGPDLDLGLPIDPDEDSDNSAIY 363

Query: 284  VSNLPQNLDHEMLKTQFSKAGSIKLNSKSGMPMIWIFKDR--GVPKGDALVTYDSNSSAS 457
            V  L  N+  + L   F + G +K+N ++G PMI I+ D+  G PKGDA V+Y+   +A 
Sbjct: 364  VQGLNDNVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAK 423

Query: 458  NAVSYFKDHDFNGRRIEVRIA-----TNAERPILAPPSSXXXXXXXXXXXXXXXYKIESI 622
             AV +F   DF G +++V +A      N+ R  + P                       +
Sbjct: 424  AAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGMPpreGRGMPPPLRGGPGGPGGPGGPM 483

Query: 623  SRNNDYESDSYDWQKR--------EDGGFNKQREFXXXXXXXXXXXXGPDWICGK--CNN 772
             R      D   +  R          GG N Q                 DW C    C N
Sbjct: 484  GRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHR-------------AGDWQCPNPGCGN 530

Query: 773  NNFSWRESCNRCQEPKAE 826
             NF+WR  CN+C+ PK E
Sbjct: 531  QNFAWRTECNQCKAPKPE 548
>sp|Q92804|RBP56_HUMAN TATA-binding protein associated factor 2N (RNA-binding protein 56)
           (TAFII68) (TAF(II)68)
          Length = 592

 Score = 79.3 bits (194), Expect = 2e-14
 Identities = 57/212 (26%), Positives = 91/212 (42%), Gaps = 4/212 (1%)
 Frame = +2

Query: 209 GQLSNSERDNYNSNNDGALRLNTVYVSNLPQNLDHEMLKTQFSKAGSIKLNSKSGMPMIW 388
           G  ++++ ++ NS+N      NT++V  L + +  + +   F + G IK N K+G PMI 
Sbjct: 219 GPRTDADSESDNSDN------NTIFVQGLGEGVSTDQVGEFFKQIGIIKTNKKTGKPMIN 272

Query: 389 IF--KDRGVPKGDALVTYDSNSSASNAVSYFKDHDFNGRRIEVRIATNAERPILAPPSSX 562
           ++  KD G PKG+A V++D   SA  A+ +F   +F+G  I+V  AT             
Sbjct: 273 LYTDKDTGKPKGEATVSFDDPPSAKAAIDWFDGKEFHGNIIKVSFATR------------ 320

Query: 563 XXXXXXXXXXXXXXYKIESISRNNDYESDSYDWQKREDGGFNKQREFXXXXXXXXXXXXG 742
                                R            +R  GG+  +  F             
Sbjct: 321 ---------------------RPEFMRGGGSGGGRRGRGGYRGRGGF----QGRGGDPKS 355

Query: 743 PDWIC--GKCNNNNFSWRESCNRCQEPKAENA 832
            DW+C    C N NF+ R SCN+C EP+ E++
Sbjct: 356 GDWVCPNPSCGNMNFARRNSCNQCNEPRPEDS 387
>sp|Q01844|EWS_HUMAN RNA-binding protein EWS (EWS oncogene) (Ewing sarcoma breakpoint
            region 1 protein)
          Length = 656

 Score = 78.2 bits (191), Expect = 4e-14
 Identities = 68/258 (26%), Positives = 98/258 (37%), Gaps = 34/258 (13%)
 Frame = +2

Query: 155  GGSDAMNMAFQGMMSAQRGQLSNSERDNYNS-----------------NNDGALRLNTVY 283
            GG D   M+  G    + G  S  ER  +N                  + D     + +Y
Sbjct: 305  GGFDRGGMSRGGRGGGRGGMGSAGERGGFNKPGGPMDEGPDLDLGPPVDPDEDSDNSAIY 364

Query: 284  VSNLPQNLDHEMLKTQFSKAGSIKLNSKSGMPMIWIFKDR--GVPKGDALVTYDSNSSAS 457
            V  L  ++  + L   F + G +K+N ++G PMI I+ D+  G PKGDA V+Y+   +A 
Sbjct: 365  VQGLNDSVTLDDLADFFKQCGVVKMNKRTGQPMIHIYLDKETGKPKGDATVSYEDPPTAK 424

Query: 458  NAVSYFKDHDFNGRRIEVRIA-----TNAERPILAPPSSXXXXXXXXXXXXXXXYKIESI 622
             AV +F   DF G +++V +A      N+ R  L P                       +
Sbjct: 425  AAVEWFDGKDFQGSKLKVSLARKKPPMNSMRGGLPpreGRGMPPPLRGGPGGPGGPGGPM 484

Query: 623  SRNNDYESDSYDWQKR--------EDGGFNKQREFXXXXXXXXXXXXGPDWICGK--CNN 772
             R      D   +  R          GG N Q                 DW C    C N
Sbjct: 485  GRMGGRGGDRGGFPPRGPRGSRGNPSGGGNVQHR-------------AGDWQCPNPGCGN 531

Query: 773  NNFSWRESCNRCQEPKAE 826
             NF+WR  CN+C+ PK E
Sbjct: 532  QNFAWRTECNQCKAPKPE 549
>sp|Q28009|FUS_BOVIN RNA-binding protein FUS (Pigpen protein)
          Length = 512

 Score = 73.9 bits (180), Expect = 8e-13
 Identities = 71/289 (24%), Positives = 107/289 (37%), Gaps = 24/289 (8%)
 Frame = +2

Query: 32  PIIPSFGGVWDANQLQQMNSYGGMNPLLAQSLQGAQMQNMYGGSDAMNMAFQ-------- 187
           P + S GG        Q   YGG      Q  +G + +   GG +  +  ++        
Sbjct: 180 PSMSSGGGGGGYGNQDQSGGYGG-----GQQDRGGRGRGGGGGYNRSSGGYEPRGRGGGR 234

Query: 188 ----GMMSAQRGQLSN--------SERDNYNSNNDGALRLNTVYVSNLPQNLDHEMLKTQ 331
               GM  + RG  +         S  D+   N+D     NT++V  L +N+  E +   
Sbjct: 235 GGRGGMGGSDRGGFNKFGGPRDQGSRHDSEQDNSDN----NTIFVQGLGENVTIESVADY 290

Query: 332 FSKAGSIKLNSKSGMPMIWIFKDR--GVPKGDALVTYDSNSSASNAVSYFKDHDFNGRRI 505
           F + G IK N K+G PMI ++ DR  G  KG+A V++D   SA  A+ +F   +F+G   
Sbjct: 291 FKQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGN-- 348

Query: 506 EVRIATNAERPILAPPSSXXXXXXXXXXXXXXXYKIESISRNNDYESDSYDWQKREDGGF 685
            ++++    R                              R        Y       GG 
Sbjct: 349 PIKVSFATRRADFNRGGGNGRGGR---------------GRGGPMGRGGYGGGGSGGGG- 392

Query: 686 NKQREFXXXXXXXXXXXXGPDWICGK--CNNNNFSWRESCNRCQEPKAE 826
             +  F              DW C    C N NFSWR  CN+C+ PK +
Sbjct: 393 --RGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPD 439
>sp|P56959|FUS_MOUSE RNA-binding protein FUS (Pigpen protein)
          Length = 518

 Score = 73.2 bits (178), Expect = 1e-12
 Identities = 61/228 (26%), Positives = 88/228 (38%), Gaps = 4/228 (1%)
 Frame = +2

Query: 155 GGSDAMNMAFQGMMSAQRGQLSNSERDNYNSNNDGALRLNTVYVSNLPQNLDHEMLKTQF 334
           GGSD             R Q S  + +  NS+N      NT++V  L +N+  E +   F
Sbjct: 248 GGSDRGGF---NKFGGPRDQGSRHDSEQDNSDN------NTIFVQGLGENVTIESVADYF 298

Query: 335 SKAGSIKLNSKSGMPMIWIFKDR--GVPKGDALVTYDSNSSASNAVSYFKDHDFNGRRIE 508
            + G IK N K+G PMI ++ DR  G  KG+A V++D   SA  A+ +F   +F+G    
Sbjct: 299 KQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGN--P 356

Query: 509 VRIATNAERPILAPPSSXXXXXXXXXXXXXXXYKIESISRNNDYESDSYDWQKREDGGFN 688
           ++++    R                              R        Y       GG  
Sbjct: 357 IKVSFATRRADFNRGGGNGRGGR---------------GRGGPMGRGGYGGGGSGGGG-- 399

Query: 689 KQREFXXXXXXXXXXXXGPDWICGK--CNNNNFSWRESCNRCQEPKAE 826
            +  F              DW C    C N NFSWR  CN+C+ PK +
Sbjct: 400 -RGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPD 446
>sp|P35637|FUS_HUMAN RNA-binding protein FUS (Oncogene FUS) (Oncogene TLS) (Translocated
           in liposarcoma protein) (POMp75) (75 kDa DNA-pairing
           protein)
          Length = 526

 Score = 73.2 bits (178), Expect = 1e-12
 Identities = 61/228 (26%), Positives = 88/228 (38%), Gaps = 4/228 (1%)
 Frame = +2

Query: 155 GGSDAMNMAFQGMMSAQRGQLSNSERDNYNSNNDGALRLNTVYVSNLPQNLDHEMLKTQF 334
           GGSD             R Q S  + +  NS+N      NT++V  L +N+  E +   F
Sbjct: 255 GGSDRGGF---NKFGGPRDQGSRHDSEQDNSDN------NTIFVQGLGENVTIESVADYF 305

Query: 335 SKAGSIKLNSKSGMPMIWIFKDR--GVPKGDALVTYDSNSSASNAVSYFKDHDFNGRRIE 508
            + G IK N K+G PMI ++ DR  G  KG+A V++D   SA  A+ +F   +F+G    
Sbjct: 306 KQIGIIKTNKKTGQPMINLYTDRETGKLKGEATVSFDDPPSAKAAIDWFDGKEFSGN--P 363

Query: 509 VRIATNAERPILAPPSSXXXXXXXXXXXXXXXYKIESISRNNDYESDSYDWQKREDGGFN 688
           ++++    R                              R        Y       GG  
Sbjct: 364 IKVSFATRRADFNRGGGNGRGGR---------------GRGGPMGRGGYGGGGSGGGG-- 406

Query: 689 KQREFXXXXXXXXXXXXGPDWICGK--CNNNNFSWRESCNRCQEPKAE 826
            +  F              DW C    C N NFSWR  CN+C+ PK +
Sbjct: 407 -RGGFPSGGGGGGGQQRAGDWKCPNPTCENMNFSWRNECNQCKAPKPD 453
>sp|Q27294|CAZ_DROME RNA-binding protein cabeza (Sarcoma-associated RNA-binding fly
           homolog) (P19)
          Length = 399

 Score = 69.3 bits (168), Expect = 2e-11
 Identities = 32/95 (33%), Positives = 57/95 (60%), Gaps = 2/95 (2%)
 Frame = +2

Query: 242 NSNNDGALRLNTVYVSNLPQNLDHEMLKTQFSKAGSIKLNSKSGMPMIWIFKDR--GVPK 415
           +  ND   + +T++VS +  +   + ++T F   G IK + ++  P IW++K++  G  K
Sbjct: 110 SGGNDMITQEDTIFVSGMDPSTTEQDIETHFGAIGIIKKDKRTMKPKIWLYKNKETGASK 169

Query: 416 GDALVTYDSNSSASNAVSYFKDHDFNGRRIEVRIA 520
           G+A VTYD  ++A +A+ +F   DFNG  I+V +A
Sbjct: 170 GEATVTYDDTNAAQSAIEWFDGRDFNGNAIKVSLA 204

 Score = 49.7 bits (117), Expect = 2e-05
 Identities = 16/28 (57%), Positives = 20/28 (71%)
 Frame = +2

Query: 746 DWICGKCNNNNFSWRESCNRCQEPKAEN 829
           DW C  CNN NF+WR  CNRC+ PK ++
Sbjct: 278 DWKCNSCNNTNFAWRNECNRCKTPKGDD 305
>sp|O43120|UAP2_SCHPO Splicing factor U2AF-associated protein 2
          Length = 367

 Score = 53.9 bits (128), Expect = 8e-07
 Identities = 49/169 (28%), Positives = 82/169 (48%), Gaps = 15/169 (8%)
 Frame = +2

Query: 59  WDANQLQQMNSYGG----MNPLLAQSLQGAQMQNMYGGSDAM------NMAFQGMMSAQR 208
           +D N L+ MN  G     ++ + A++ +G +  N   G D        + + +G  S  R
Sbjct: 36  YDPNSLK-MNKAGSTGAEVSDVTAEATEGKESSN---GEDRHTKRLYESTSAEGYPSGSR 91

Query: 209 GQLSNSERDNYNSNNDGALRLN-TVYVSNLPQNLDHEMLKTQFSKAGSIKLNSKSGMPMI 385
            + S SE    NS    A  +N  VY+  LP ++  + ++  F K G I  N  +G P I
Sbjct: 92  NKKSKSE----NSEASPAPVINKAVYIQGLPLDVTVDEIEEVFKKCGVIAKNIDNGTPRI 147

Query: 386 WIFK-DRGVPKGDALVTYDSNSSASNAVSYFKDHDF---NGRRIEVRIA 520
            I++ + G PKGDAL+ +  + S   A   F D +F   +G+++ V+ A
Sbjct: 148 KIYRTEDGTPKGDALIVFFRSESVELAEQLFDDTEFRYGSGQKMRVQKA 196
>sp|P53830|YN26_YEAST Hypothetical 32.3 kDa protein in SEC21-MRPL10 intergenic region
          Length = 285

 Score = 53.5 bits (127), Expect = 1e-06
 Identities = 35/108 (32%), Positives = 54/108 (50%), Gaps = 2/108 (1%)
 Frame = +2

Query: 203 QRGQLSNSERDNYNSNNDGALRLNTVYVSNLPQN-LDHEMLKTQFSKAGSIKLNSKSGMP 379
           +R QL  S         + A +  ++Y+S LP +    E L  QF K G I+ N + G P
Sbjct: 22  RRKQLKESNLQKRELEYNNASKNTSIYISGLPTDKTTKEGLTEQFCKYGMIRTN-RDGEP 80

Query: 380 MIWIF-KDRGVPKGDALVTYDSNSSASNAVSYFKDHDFNGRRIEVRIA 520
           +  ++  D+G  KGDAL+TY    S + A+    +  F G++I V  A
Sbjct: 81  LCKLYVNDKGAFKGDALITYSKEESVTLAIEMMNESIFLGKQIRVERA 128
>sp|Q9P2K5|MYEF2_HUMAN Myelin expression factor 2 (MyEF-2) (MST156)
          Length = 600

 Score = 47.0 bits (110), Expect = 1e-04
 Identities = 52/189 (27%), Positives = 77/189 (40%), Gaps = 17/189 (8%)
 Frame = +2

Query: 14  FGANGNPIIPSFGGVWDANQLQQMNS--YGGMNPLLAQSLQGAQMQNMYGGSDAMNMAFQ 187
           FG  G+ +I  F G   ++ +  + S   GGM  +   S+ G     M  G D M+ +F 
Sbjct: 432 FGRLGSAMIGGFAGRIGSSNMGPVGSGISGGMGSM--NSVTGG----MGMGLDRMSSSFD 485

Query: 188 GM-----------MSAQRGQLS----NSERDNYNSNNDGALRLNTVYVSNLPQNLDHEML 322
            M           +   RG LS    +  R+   S        N ++V NLP +L  + L
Sbjct: 486 RMGPGIGAILERSIDMDRGFLSGPMGSGMRERIGSKG------NQIFVRNLPFDLTWQKL 539

Query: 323 KTQFSKAGSIKLNSKSGMPMIWIFKDRGVPKGDALVTYDSNSSASNAVSYFKDHDFNGRR 502
           K +FS+ G +            I  + G  KG   V +DS  SA  A         +GR 
Sbjct: 540 KEKFSQCGHVMFAE--------IKMENGKSKGCGTVRFDSPESAEKACRIMNGIKISGRE 591

Query: 503 IEVRIATNA 529
           I+VR+  NA
Sbjct: 592 IDVRLDRNA 600
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 124,014,304
Number of Sequences: 369166
Number of extensions: 2358081
Number of successful extensions: 5695
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 5425
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5673
length of database: 68,354,980
effective HSP length: 112
effective length of database: 47,664,660
effective search space used: 12535805580
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)

Cluster detail

DrC_00409

  1. Dr_sW_011_B18
  2. Dr_sW_022_G12
  3. Dr_sW_021_E15