Planaria EST Database


DrC_01871

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_01871
         (833 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q9EPU4|CPSF1_MOUSE  Cleavage and polyadenylation specific...   241   2e-63
sp|Q10569|CPSF1_BOVIN  Cleavage and polyadenylation specific...   239   5e-63
sp|Q10570|CPSF1_HUMAN  Cleavage and polyadenylation specific...   239   5e-63
sp|Q9V726|CPSF1_DROME  Cleavage and polyadenylation specific...   224   3e-58
sp|Q9N4C2|CPSF1_CAEEL  Probable cleavage and polyadenylation...   200   5e-51
sp|Q9FGR0|CPSF1_ARATH  Probable cleavage and polyadenylation...    97   5e-20
sp|Q9M0V3|DB1A_ARATH  DNA damage binding protein 1a (UV-dama...    47   5e-05
sp|O49552|DDB1B_ARATH  DNA damage binding protein 1b (UV-dam...    46   1e-04
sp|Q6QNU4|DDB1_LYCES  DNA damage binding protein 1 (UV-damag...    45   2e-04
sp|Q6E7D1|DDB1_LYCCS  DNA damage binding protein 1 (UV-damag...    45   2e-04
>sp|Q9EPU4|CPSF1_MOUSE Cleavage and polyadenylation specificity factor, 160 kDa subunit
            (CPSF 160 kDa subunit)
          Length = 1441

 Score =  241 bits (614), Expect = 2e-63
 Identities = 115/250 (46%), Positives = 174/250 (69%)
 Frame = +3

Query: 33   KEEEPIERDSDIIRAPLPIYTLEFYANESNGMWELVPHAKVVFEDWDKVSCMTTVQLFSQ 212
            KE E IERD   I      ++++  +  S   WE +P+A++  E+W+ V+CM TV L S+
Sbjct: 1053 KEFEAIERDDRYIHPQQEAFSIQLISPVS---WEAIPNARIELEEWEHVTCMKTVSLRSE 1109

Query: 213  VTYSGTKDFLAVGSNMSYGEEIPVRGKIMIVDIIDVVPEPGQVLTRHKMKIIYDVEQKGP 392
             T SG K ++A G+ +  GEE+  RG+I+I+D+I+VVPEPGQ LT++K K++Y+ EQKGP
Sbjct: 1110 ETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGP 1169

Query: 393  VTALTSCKGYLVSAIGQKMYIWELKDNELVGVAFVDSEIYVQQLHCMRNLILTGDFLKSI 572
            VTAL  C G+LVSAIGQK+++W L+ +EL G+AF+D+++Y+ Q+  ++N IL  D +KSI
Sbjct: 1170 VTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSI 1229

Query: 573  QLLRFQSDLRVLSLVSKDSHSRGVTCSNFFVDNDNLGFMLADMHANIIIYSYNPLIEESR 752
             LLR+Q + + LSLVS+D+    V   +F VDN  LGF+++D   N+++Y Y P  +ES 
Sbjct: 1230 SLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESF 1289

Query: 753  NGKQLVRNAD 782
             G +L+R AD
Sbjct: 1290 GGMRLLRRAD 1299
>sp|Q10569|CPSF1_BOVIN Cleavage and polyadenylation specificity factor, 160 kDa subunit
            (CPSF 160 kDa subunit)
          Length = 1444

 Score =  239 bits (611), Expect = 5e-63
 Identities = 114/250 (45%), Positives = 173/250 (69%)
 Frame = +3

Query: 33   KEEEPIERDSDIIRAPLPIYTLEFYANESNGMWELVPHAKVVFEDWDKVSCMTTVQLFSQ 212
            KE E IERD   +      + ++  +  S   WE +P+A++  E+W+ V+CM TV L S+
Sbjct: 1056 KEFETIERDERYVHPQQEAFCIQLISPVS---WEAIPNARIELEEWEHVTCMKTVSLRSE 1112

Query: 213  VTYSGTKDFLAVGSNMSYGEEIPVRGKIMIVDIIDVVPEPGQVLTRHKMKIIYDVEQKGP 392
             T SG K ++A G+ +  GEE+  RG+I+I+D+I+VVPEPGQ LT++K K++Y+ EQKGP
Sbjct: 1113 ETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGP 1172

Query: 393  VTALTSCKGYLVSAIGQKMYIWELKDNELVGVAFVDSEIYVQQLHCMRNLILTGDFLKSI 572
            VTAL  C G+LVSAIGQK+++W L+ +EL G+AF+D+++Y+ Q+  ++N IL  D +KSI
Sbjct: 1173 VTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSI 1232

Query: 573  QLLRFQSDLRVLSLVSKDSHSRGVTCSNFFVDNDNLGFMLADMHANIIIYSYNPLIEESR 752
             LLR+Q + + LSLVS+D+    V   +F VDN  LGF+++D   N+++Y Y P  +ES 
Sbjct: 1233 SLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESF 1292

Query: 753  NGKQLVRNAD 782
             G +L+R AD
Sbjct: 1293 GGMRLLRRAD 1302
>sp|Q10570|CPSF1_HUMAN Cleavage and polyadenylation specificity factor, 160 kDa subunit
            (CPSF 160 kDa subunit)
          Length = 1443

 Score =  239 bits (611), Expect = 5e-63
 Identities = 114/250 (45%), Positives = 174/250 (69%)
 Frame = +3

Query: 33   KEEEPIERDSDIIRAPLPIYTLEFYANESNGMWELVPHAKVVFEDWDKVSCMTTVQLFSQ 212
            KE E IERD   I      ++++  +  S   WE +P+A++  ++W+ V+CM TV L S+
Sbjct: 1055 KEFETIERDERYIHPQQEAFSIQLISPVS---WEAIPNARIELQEWEHVTCMKTVSLRSE 1111

Query: 213  VTYSGTKDFLAVGSNMSYGEEIPVRGKIMIVDIIDVVPEPGQVLTRHKMKIIYDVEQKGP 392
             T SG K ++A G+ +  GEE+  RG+I+I+D+I+VVPEPGQ LT++K K++Y+ EQKGP
Sbjct: 1112 ETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGP 1171

Query: 393  VTALTSCKGYLVSAIGQKMYIWELKDNELVGVAFVDSEIYVQQLHCMRNLILTGDFLKSI 572
            VTAL  C G+LVSAIGQK+++W L+ +EL G+AF+D+++Y+ Q+  ++N IL  D +KSI
Sbjct: 1172 VTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSI 1231

Query: 573  QLLRFQSDLRVLSLVSKDSHSRGVTCSNFFVDNDNLGFMLADMHANIIIYSYNPLIEESR 752
             LLR+Q + + LSLVS+D+    V   +F VDN  LGF+++D   N+++Y Y P  +ES 
Sbjct: 1232 SLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESF 1291

Query: 753  NGKQLVRNAD 782
             G +L+R AD
Sbjct: 1292 GGMRLLRRAD 1301
>sp|Q9V726|CPSF1_DROME Cleavage and polyadenylation specificity factor, 160 kDa subunit
            (CPSF 160 kDa subunit) (CPSF-160) (dCPSF)
          Length = 1455

 Score =  224 bits (570), Expect = 3e-58
 Identities = 109/231 (47%), Positives = 154/231 (66%)
 Frame = +3

Query: 129  WELVPHAKVVFEDWDKVSCMTTVQLFSQVTYSGTKDFLAVGSNMSYGEEIPVRGKIMIVD 308
            WE+VP A + FE W+ V+    V+L  + T SG K++L +G+N +Y E+I  RG I I D
Sbjct: 1097 WEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYD 1156

Query: 309  IIDVVPEPGQVLTRHKMKIIYDVEQKGPVTALTSCKGYLVSAIGQKMYIWELKDNELVGV 488
            II+VVPEPG+ +T+ K+K I+  EQKGPV+A++   G+LV+ +GQK+YIW+L+D +L+GV
Sbjct: 1157 IIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGV 1216

Query: 489  AFVDSEIYVQQLHCMRNLILTGDFLKSIQLLRFQSDLRVLSLVSKDSHSRGVTCSNFFVD 668
            AF+D+ IYV Q+  +++LI   D  KSI LLRFQ + R LSL S+D +   V    F VD
Sbjct: 1217 AFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVD 1276

Query: 669  NDNLGFMLADMHANIIIYSYNPLIEESRNGKQLVRNADIKLPIPVTCSLRI 821
            N NLGF++ D   NII+Y Y P   ES  G++L+R AD  L   V    R+
Sbjct: 1277 NSNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRV 1327
>sp|Q9N4C2|CPSF1_CAEEL Probable cleavage and polyadenylation specificity factor, 160 kDa
            subunit (CPSF 160 kDa subunit)
          Length = 1454

 Score =  200 bits (508), Expect = 5e-51
 Identities = 99/267 (37%), Positives = 166/267 (62%), Gaps = 2/267 (0%)
 Frame = +3

Query: 27   NSKEEEPIERDSDIIRAPLPIYTLEFYANESNGMWELVPHAKVVFEDWDKVSCMTTVQLF 206
            + K+EE  E+D + +    P YTL  ++++    W  VP+ ++ FED + V+    V L 
Sbjct: 1059 DDKQEEIHEKDENFVLPAPPKYTLNLFSSQD---WAAVPNTEISFEDMEAVTACEDVALK 1115

Query: 207  SQVTYSGTKDFLAVGSNMSYGEEIPVRGKIMIVDIIDVVPEPGQVLTRHKMKIIYDVEQK 386
            S+ T SG +  LA+G+  +YGEE+ VRG+I++ ++I+VVPEP Q  +  K+K+++D EQK
Sbjct: 1116 SESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIEVVPEPDQPTSNRKIKVLFDKEQK 1175

Query: 387  GPVTALTSCKGYLVSAIGQKMYIWELKDNELVGVAFVDSEIYVQQLHCMRNLILTGDFLK 566
            GPVT L +  G L+  +GQK++IW+ KDN+L+G++F+D   YV QLH +R + +  D  +
Sbjct: 1176 GPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFLDMHYYVYQLHSLRTIAIACDARE 1235

Query: 567  SIQLLRFQSDLRVLSLVSKDSH--SRGVTCSNFFVDNDNLGFMLADMHANIIIYSYNPLI 740
            S+ L+RFQ D + +S+ S+D    ++    S   VD  ++GF+L+D   NI +++Y P  
Sbjct: 1236 SMSLIRFQEDNKAMSIASRDDRKCAQPPMASQLVVDGAHVGFLLSDETGNITMFNYAPEA 1295

Query: 741  EESRNGKQLVRNADIKLPIPVTCSLRI 821
             ES  G++L   A I +   +   +R+
Sbjct: 1296 PESNGGERLTVRAAINIGTNINAFVRL 1322
>sp|Q9FGR0|CPSF1_ARATH Probable cleavage and polyadenylation specificity factor, 160 kDa
            subunit (CPSF 160 kDa subunit)
          Length = 1448

 Score = 97.1 bits (240), Expect = 5e-20
 Identities = 68/236 (28%), Positives = 117/236 (49%), Gaps = 1/236 (0%)
 Frame = +3

Query: 117  SNGMWELVPHAKVVFEDWDKVSCMTTVQLFSQVTYSGTKDFLAVGSNMSYGEEIPVRGKI 296
            S G WE    AK+  +  +    +  V L +  T    +  LAVG+    GE++  RG++
Sbjct: 1099 SGGPWET--KAKIPMQTSEHALTVRVVTLLNAST-GENETLLAVGTAYVQGEDVAARGRV 1155

Query: 297  MIVDIIDVVPEPGQVLTRHKMKIIYDVEQKGPVTALTSCKGYLVSAIGQKMYIWELKDNE 476
            ++            V+T      +Y  E KG ++A+ S +G+L+ + G K+ + +    E
Sbjct: 1156 LLFSFGKNGDNSQNVVTE-----VYSRELKGAISAVASIQGHLLISSGPKIILHKWNGTE 1210

Query: 477  LVGVAFVDSE-IYVQQLHCMRNLILTGDFLKSIQLLRFQSDLRVLSLVSKDSHSRGVTCS 653
            L GVAF D+  +YV  ++ +++ IL GD  KSI  L ++     LSL++KD  S     +
Sbjct: 1211 LNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFAT 1270

Query: 654  NFFVDNDNLGFMLADMHANIIIYSYNPLIEESRNGKQLVRNADIKLPIPVTCSLRI 821
             F +D   L   ++D   NI ++ Y P + ES  G +L+  A+  +   V+  LR+
Sbjct: 1271 EFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRL 1326
>sp|Q9M0V3|DB1A_ARATH DNA damage binding protein 1a (UV-damaged DNA-binding protein 1a)
            (DDB1a)
          Length = 1088

 Score = 47.4 bits (111), Expect = 5e-05
 Identities = 39/159 (24%), Positives = 75/159 (47%), Gaps = 5/159 (3%)
 Frame = +3

Query: 270  EEIPVRGKIMIVDIIDVVPEPGQVLTRHKMKIIYDVEQKGPVTALTSCKGYLVSAIGQK- 446
            E  P +G+I++  + D            ++++I + E KG V +L +  G L++AI QK 
Sbjct: 795  ENEPTKGRILVFIVED-----------GRLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 843

Query: 447  -MYIWELKDN---ELVGVAFVDSEIYVQQLHCMRNLILTGDFLKSIQLLRFQSDLRVLSL 614
             +Y W L+D+   EL         I    +    + I+ GD +KSI LL ++ +   +  
Sbjct: 844  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLLYKHEEGAIEE 903

Query: 615  VSKDSHSRGVTCSNFFVDNDNLGFMLADMHANIIIYSYN 731
             ++D ++  ++      D+  LG   A+ + N++    N
Sbjct: 904  RARDYNANWMSAVEILDDDIYLG---AENNFNLLTVKKN 939
>sp|O49552|DDB1B_ARATH DNA damage binding protein 1b (UV-damaged DNA-binding protein 1b)
            (DDB1b)
          Length = 1088

 Score = 46.2 bits (108), Expect = 1e-04
 Identities = 35/143 (24%), Positives = 67/143 (46%), Gaps = 5/143 (3%)
 Frame = +3

Query: 270  EEIPVRGKIMIVDIIDVVPEPGQVLTRHKMKIIYDVEQKGPVTALTSCKGYLVSAIGQK- 446
            E  P +G+I++            ++   ++++I + E KG V +L +  G L+++I QK 
Sbjct: 795  ENEPTKGRILVF-----------IVEEGRLQLITEKETKGAVYSLNAFNGKLLASINQKI 843

Query: 447  -MYIWELKDN---ELVGVAFVDSEIYVQQLHCMRNLILTGDFLKSIQLLRFQSDLRVLSL 614
             +Y W L+D+   EL         I    +    + I  GD +KSI LL ++ +   +  
Sbjct: 844  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIAVGDLMKSISLLIYKHEEGAIEE 903

Query: 615  VSKDSHSRGVTCSNFFVDNDNLG 683
             ++D ++  +T      D+  LG
Sbjct: 904  RARDYNANWMTAVEILNDDIYLG 926
>sp|Q6QNU4|DDB1_LYCES DNA damage binding protein 1 (UV-damaged DNA-binding protein 1) (High
            pigmentation protein 1)
          Length = 1090

 Score = 45.4 bits (106), Expect = 2e-04
 Identities = 37/147 (25%), Positives = 70/147 (47%), Gaps = 9/147 (6%)
 Frame = +3

Query: 270  EEIPVRGKIMIVDIIDVVPEPGQVLTRHKMKIIYDVEQKGPVTALTSCKGYLVSAIGQKM 449
            E  P +G+I++  + D            K+++I + E KG V +L +  G L++AI QK+
Sbjct: 796  ENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 844

Query: 450  YIWELKDNELVGVAFVDSE---------IYVQQLHCMRNLILTGDFLKSIQLLRFQSDLR 602
             +++    E  G   + +E         +YVQ      + I+ GD +KSI LL F+ +  
Sbjct: 845  QLYKWASREDGGSRELQTECGHHGHILALYVQT---RGDFIVVGDLMKSISLLIFKHEEG 901

Query: 603  VLSLVSKDSHSRGVTCSNFFVDNDNLG 683
             +   ++D ++  ++      D+  LG
Sbjct: 902  AIEERARDYNANWMSAVEILDDDIYLG 928
>sp|Q6E7D1|DDB1_LYCCS DNA damage binding protein 1 (UV-damaged DNA-binding protein 1)
          Length = 1095

 Score = 45.4 bits (106), Expect = 2e-04
 Identities = 37/147 (25%), Positives = 70/147 (47%), Gaps = 9/147 (6%)
 Frame = +3

Query: 270  EEIPVRGKIMIVDIIDVVPEPGQVLTRHKMKIIYDVEQKGPVTALTSCKGYLVSAIGQKM 449
            E  P +G+I++  + D            K+++I + E KG V +L +  G L++AI QK+
Sbjct: 801  ENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 849

Query: 450  YIWELKDNELVGVAFVDSE---------IYVQQLHCMRNLILTGDFLKSIQLLRFQSDLR 602
             +++    E  G   + +E         +YVQ      + I+ GD +KSI LL F+ +  
Sbjct: 850  QLYKWASREDGGSRELQTECGHHGHILALYVQT---RGDFIVVGDLMKSISLLIFKHEEG 906

Query: 603  VLSLVSKDSHSRGVTCSNFFVDNDNLG 683
             +   ++D ++  ++      D+  LG
Sbjct: 907  AIEERARDYNANWMSAVEILDDDIYLG 933
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 100,536,711
Number of Sequences: 369166
Number of extensions: 2161970
Number of successful extensions: 5134
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4977
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5128
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 8100769320
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)