Planaria EST Database


DrC_00631

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_00631
         (1050 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q8TC41|IBR1_HUMAN  IBR domain containing protein 1             132   2e-30
sp|O95376|ARI2_HUMAN  Ariadne-2 protein homolog (ARI-2) (Tri...    79   2e-14
sp|Q9Z1K6|ARI2_MOUSE  Ariadne-2 protein homolog (ARI-2) (Tri...    75   4e-13
sp|Q9Y4X5|ARI1_HUMAN  Ariadne-1 protein homolog (ARI-1) (Ubi...    72   2e-12
sp|Q9Z1K5|ARI1_MOUSE  Ariadne-1 protein homolog (ARI-1) (Ubi...    72   2e-12
sp|Q22431|ARI2_CAEEL  Probable ariadne-2 protein (Ari-2)           68   4e-11
sp|O76924|ARI2_DROME  Ariadne-2 protein (Ari-2)                    67   9e-11
sp|Q8IWT3|PARC_HUMAN  p53-associated parkin-like cytoplasmic...    67   1e-10
sp|P36113|YKZ7_YEAST  Hypothetical 63.6 kDa protein in YPT52...    66   1e-10
sp|Q80TT8|PARC_MOUSE  p53-associated parkin-like cytoplasmic...    64   6e-10
>sp|Q8TC41|IBR1_HUMAN IBR domain containing protein 1
          Length = 275

 Score =  132 bits (332), Expect = 2e-30
 Identities = 69/211 (32%), Positives = 106/211 (50%), Gaps = 1/211 (0%)
 Frame = +1

Query: 193 LGRIHVECPSMNCHKPIRFQSIWIRLSENNRKIYEKMKNSHEQTDYKKLCPHCMQFYELT 372
           LG++ ++CP   C + +   ++   L+  +   Y+            K CP C  F   T
Sbjct: 5   LGQVEIKCPITECFEFLEETTVVYNLTHEDSIKYKYFLELGRIDSSTKPCPQCKHF--TT 62

Query: 373 NEEKNQLTHXXXXXXXXXGFNVKCHMCNWEWCFKCQTPVHN-LSCKKNLSTDKLLMKWAN 549
            ++K    H          + ++C  C + WCFKC +P H  ++CK+    DKLL  WA+
Sbjct: 63  FKKKG---HIPTPSRSESKYKIQCPTCQFVWCFKCHSPWHEGVNCKEYKKGDKLLRHWAS 119

Query: 550 SPAETIQSSVKKARKCPKCHVLVERNGGCPHMECSKCKCSWCYDCGRRRIKVKHIPFMNH 729
                I+   + A+KCPKC + ++R  GC HM CS+C  ++CY CG R  +++   F +H
Sbjct: 120 E----IEHGQRNAQKCPKCKIHIQRTEGCDHMTCSQCNTNFCYRCGERYRQLRF--FGDH 173

Query: 730 DSKFFILGCSKNFLASKPHLRRFVRISVFCG 822
            S   I GC   +L  +PHLRR VR SV  G
Sbjct: 174 TSNLSIFGCKYRYLPERPHLRRLVRGSVCAG 204
>sp|O95376|ARI2_HUMAN Ariadne-2 protein homolog (ARI-2) (Triad1 protein)
          Length = 493

 Score = 79.3 bits (194), Expect = 2e-14
 Identities = 52/212 (24%), Positives = 94/212 (44%), Gaps = 13/212 (6%)
 Frame = +1

Query: 88  CSICYETHGAFVKR------CCDMIVCEQCYTQYLNYQVSNLGRIHVECPSMNCHKPIRF 249
           C++C +    FV++       C    C  C+ Q+ +  V +   + V C + +C  P+R 
Sbjct: 139 CAVCMQ----FVRKENLLSLACQHQFCRSCWEQHCSVLVKDGVGVGVSCMAQDC--PLRT 192

Query: 250 QSIWIRLSENNRKIYEKMKN----SHEQTDYK-KLCP--HCMQFYELTNEEKNQLTHXXX 408
              ++     N ++ EK +      + ++ Y+ +LCP   C     +      +      
Sbjct: 193 PEDFVFPLLPNEELREKYRRYLFRDYVESHYQLQLCPGADCPMVIRVQEPRARR------ 246

Query: 409 XXXXXXGFNVKCHMCNWEWCFKCQTPVHNLSCKKNLSTDKLLMKWANSPAETIQSSVKKA 588
                    V+C+ CN  +CFKC+   H      + +T +  +      +ET        
Sbjct: 247 ---------VQCNRCNEVFCFKCRQMYH---APTDCATIRKWLTKCADDSETANYISAHT 294

Query: 589 RKCPKCHVLVERNGGCPHMECSKCKCSWCYDC 684
           + CPKC++ +E+NGGC HM+CSKCK  +C+ C
Sbjct: 295 KDCPKCNICIEKNGGCNHMQCSKCKHDFCWMC 326
>sp|Q9Z1K6|ARI2_MOUSE Ariadne-2 protein homolog (ARI-2) (Triad1 protein)
           (UbcM4-interacting protein 48)
          Length = 492

 Score = 74.7 bits (182), Expect = 4e-13
 Identities = 51/215 (23%), Positives = 92/215 (42%), Gaps = 16/215 (7%)
 Frame = +1

Query: 88  CSICYETHGAFVKR------CCDMIVCEQCYTQYLNYQVSNLGRIHVECPSMNCHKPIRF 249
           C++C +    FV++       C    C  C+ Q+ +  V +   + + C + +C  P+R 
Sbjct: 138 CAVCMQ----FVRKENLLSLACQHQFCRSCWEQHCSVLVKDGVGVGISCMAQDC--PLRT 191

Query: 250 QSIWIRLSENNRKIYEKMKN--------SHEQTDYKKLCP--HCMQFYELTNEEKNQLTH 399
              ++     N ++ +K +         SH Q    +LCP   C     +      +   
Sbjct: 192 PEDFVFPLLPNEELRDKYRRYLFRDYVESHFQL---QLCPGADCPMVIRVQEPRARR--- 245

Query: 400 XXXXXXXXXGFNVKCHMCNWEWCFKCQTPVHNLSCKKNLSTDKLLMKWANSPAETIQSSV 579
                       V+C+ C+  +CFKC+   H      + +T +  +      +ET     
Sbjct: 246 ------------VQCNRCSEVFCFKCRQMYH---APTDCATIRKWLTKCADDSETANYIS 290

Query: 580 KKARKCPKCHVLVERNGGCPHMECSKCKCSWCYDC 684
              + CPKC++ +E+NGGC HM+CSKCK  +C+ C
Sbjct: 291 AHTKDCPKCNICIEKNGGCNHMQCSKCKHDFCWMC 325
>sp|Q9Y4X5|ARI1_HUMAN Ariadne-1 protein homolog (ARI-1) (Ubiquitin-conjugating enzyme
           E2-binding protein 1) (UbcH7-binding protein)
           (UbcM4-interacting protein) (HHARI) (H7-AP2) (MOP-6)
          Length = 557

 Score = 72.4 bits (176), Expect = 2e-12
 Identities = 53/211 (25%), Positives = 88/211 (41%), Gaps = 9/211 (4%)
 Frame = +1

Query: 79  DATCSICYETH--GAFVKRCCDMIVCEQCYTQYLNYQVSNLGRIH-VECPSMNCHKPIRF 249
           D  C ICY  +    F    C    C QC+++YL  ++   G    + CP+  C   +  
Sbjct: 183 DMPCQICYLNYPNSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDILVDD 242

Query: 250 QSIWIRLSENNRKIYEKMKNSHEQTDYKKLCPHCMQFYELTNEEKNQLTHXXXXXXXXXG 429
            ++   ++++      K+K  H  T+    C   +++    +       H          
Sbjct: 243 NTVMRLITDSK----VKLKYQHLITNSFVECNRLLKWCPAPD------CHHVVKVQYPDA 292

Query: 430 FNVKCHMCNWEWCFKCQTPVHN-LSCKKNLSTDKLLMKW---ANSPAETIQSSVKKARKC 597
             V+C  C  ++CF C    H+ + CK        L KW    +  +ET        ++C
Sbjct: 293 KPVRCK-CGRQFCFNCGENWHDPVKCK-------WLKKWIKKCDDDSETSNWIAANTKEC 344

Query: 598 PKCHVLVERNGGCPHMEC--SKCKCSWCYDC 684
           PKCHV +E++GGC HM C    CK  +C+ C
Sbjct: 345 PKCHVTIEKDGGCNHMVCRNQNCKAEFCWVC 375
>sp|Q9Z1K5|ARI1_MOUSE Ariadne-1 protein homolog (ARI-1) (Ubiquitin-conjugating enzyme
           E2-binding protein 1) (UbcH7-binding protein)
           (UbcM4-interacting protein 77)
          Length = 555

 Score = 72.4 bits (176), Expect = 2e-12
 Identities = 53/211 (25%), Positives = 88/211 (41%), Gaps = 9/211 (4%)
 Frame = +1

Query: 79  DATCSICYETH--GAFVKRCCDMIVCEQCYTQYLNYQVSNLGRIH-VECPSMNCHKPIRF 249
           D  C ICY  +    F    C    C QC+++YL  ++   G    + CP+  C   +  
Sbjct: 181 DMPCQICYLNYPNSYFTGLECGHKFCMQCWSEYLTTKIMEEGMGQTISCPAHGCDILVDD 240

Query: 250 QSIWIRLSENNRKIYEKMKNSHEQTDYKKLCPHCMQFYELTNEEKNQLTHXXXXXXXXXG 429
            ++   ++++      K+K  H  T+    C   +++    +       H          
Sbjct: 241 NTVMRLITDSK----VKLKYQHLITNSFVECNRLLKWCPAPD------CHHVVKVQYPDA 290

Query: 430 FNVKCHMCNWEWCFKCQTPVHN-LSCKKNLSTDKLLMKW---ANSPAETIQSSVKKARKC 597
             V+C  C  ++CF C    H+ + CK        L KW    +  +ET        ++C
Sbjct: 291 KPVRCK-CGRQFCFNCGENWHDPVKCK-------WLKKWIKKCDDDSETSNWIAANTKEC 342

Query: 598 PKCHVLVERNGGCPHMEC--SKCKCSWCYDC 684
           PKCHV +E++GGC HM C    CK  +C+ C
Sbjct: 343 PKCHVTIEKDGGCNHMVCRNQNCKAEFCWVC 373
>sp|Q22431|ARI2_CAEEL Probable ariadne-2 protein (Ari-2)
          Length = 482

 Score = 68.2 bits (165), Expect = 4e-11
 Identities = 51/210 (24%), Positives = 84/210 (40%), Gaps = 11/210 (5%)
 Frame = +1

Query: 88  CSIC-YETHGAFVKRCCDMIVCEQCYTQYLNYQVSNLGRIHVECPSMNC--HKPIRFQSI 258
           CS+C  + +       C    CE C+  ++  ++S      +EC    C  + P  F   
Sbjct: 129 CSVCAMDGYTELPHLTCGHCFCEHCWKSHVESRLSEGVASRIECMESECEVYAPSEFVLS 188

Query: 259 WIRLSENNRKIYEK-----MKNSHEQTDY--KKLCPHCMQFYELTNEEKNQLTHXXXXXX 417
            I+ S   +  YE+     M NSH    +     CP  ++  E+  +             
Sbjct: 189 IIKNSPVIKLKYERFLLRDMVNSHPHLKFCVGNECPVIIRSTEVKPKR------------ 236

Query: 418 XXXGFNVKCHMCNWEWCFKCQTPVHN-LSCKKNLSTDKLLMKWANSPAETIQSSVKKARK 594
                 V C  C+  +C KC    H   SC+    T K  M      +ET        + 
Sbjct: 237 ------VTCMQCHTSFCVKCGADYHAPTSCE----TIKQWMTKCADDSETANYISAHTKD 286

Query: 595 CPKCHVLVERNGGCPHMECSKCKCSWCYDC 684
           CP+CH  +E+ GGC H++C++C+  +C+ C
Sbjct: 287 CPQCHSCIEKAGGCNHIQCTRCRHHFCWMC 316
>sp|O76924|ARI2_DROME Ariadne-2 protein (Ari-2)
          Length = 509

 Score = 67.0 bits (162), Expect = 9e-11
 Identities = 49/205 (23%), Positives = 76/205 (37%), Gaps = 6/205 (2%)
 Frame = +1

Query: 88  CSICYETH--GAFVKRCCDMIVCEQCYTQYLNYQVSNLGRIHVECPSMNCHKPIRFQSIW 261
           C +C  +     F    C    C+ C+T Y   Q+       + C +  C+  +    + 
Sbjct: 153 CPVCASSQLGDKFYSLACGHSFCKDCWTIYFETQIFQGISTQIGCMAQMCNVRVPEDLV- 211

Query: 262 IRLSENNRKIYEKMKNSHEQTDYKKLCPHCMQFYELTNEEKNQLTHXXXXXXXXXGFNVK 441
             L+   R +           DY K  P  ++F    N                      
Sbjct: 212 --LTLVTRPVMRDKYQQFAFKDYVKSHPE-LRFCPGPN------CQIIVQSSEISAKRAI 262

Query: 442 CHMCNWEWCFKCQTPVHNLSCKKNLSTD-KLLMKWANSPA---ETIQSSVKKARKCPKCH 609
           C  C+  +CF+C    H         TD +++ KW    A   ET        + CPKCH
Sbjct: 263 CKACHTGFCFRCGMDYH-------APTDCQVIKKWLTKCADDSETANYISAHTKDCPKCH 315

Query: 610 VLVERNGGCPHMECSKCKCSWCYDC 684
           + +E+NGGC HM+C  CK  +C+ C
Sbjct: 316 ICIEKNGGCNHMQCFNCKHDFCWMC 340
>sp|Q8IWT3|PARC_HUMAN p53-associated parkin-like cytoplasmic protein (UbcH7 associated
            protein 1)
          Length = 2517

 Score = 66.6 bits (161), Expect = 1e-10
 Identities = 47/213 (22%), Positives = 79/213 (37%), Gaps = 14/213 (6%)
 Frame = +1

Query: 88   CSICYETHGA---FVKRCCDMIVCEQCYTQYLNYQVSNLGRIHVECPSMNCHKPIRFQSI 258
            C +C    G        CC    C+ C+ +YL  ++     ++  CP  +C  P +    
Sbjct: 2070 CPVCVSPLGCDDDLPSLCCMHYCCKSCWNEYLTTRIEQNLVLNCTCPIADC--PAQPTGA 2127

Query: 259  WIRLSENNRKIYEKMKNSHEQTDYKKLCPHCMQFYELTNEEKNQLTHXXXXXXXXXGFNV 438
            +IR   ++ ++  K    +E+   +     C      TN +               G   
Sbjct: 2128 FIRAIVSSPEVISK----YEKALLRGYVESCSNLTWCTNPQGCD----RILCRQGLGCGT 2179

Query: 439  KCHMCNWEWCFKCQTPV--HNLSCKKNLSTDKLLMKWANSPAETIQSSVKK--------- 585
             C  C W  CF C  P   +  SC         + +W +        SV+          
Sbjct: 2180 TCSKCGWASCFNCSFPEAHYPASCGH-------MSQWVDDGGYYDGMSVEAQSKHLAKLI 2232

Query: 586  ARKCPKCHVLVERNGGCPHMECSKCKCSWCYDC 684
            +++CP C   +E+N GC HM C+KC   +C+ C
Sbjct: 2233 SKRCPSCQAPIEKNEGCLHMTCAKCNHGFCWRC 2265
>sp|P36113|YKZ7_YEAST Hypothetical 63.6 kDa protein in YPT52-GCN3 intergenic region
          Length = 551

 Score = 66.2 bits (160), Expect = 1e-10
 Identities = 66/255 (25%), Positives = 104/255 (40%), Gaps = 11/255 (4%)
 Frame = +1

Query: 34  NDRLRPIEYEGLWESDATCSICYETHGAFVKRC-CDMIVCEQCYTQYLNYQVSNLGRIHV 210
           N   R +E++    +D TC IC +          C    C  CY  Y+  ++   G I +
Sbjct: 165 NSHFREVEFK----NDFTCIICCDKKDTETFALECGHEYCINCYRHYIKDKLHE-GNI-I 218

Query: 211 ECPSMNCHKPIRFQSI-WIRLSENNRKIYEKMKNSHEQTDYK--KLCPH--CMQFYELTN 375
            C  M+C   ++ + I  +    ++ K+ +    S  Q   +  K CP   C     L +
Sbjct: 219 TC--MDCSLALKNEDIDKVMGHPSSSKLMDSSIKSFVQKHNRNYKWCPFADCKSIVHLRD 276

Query: 376 E----EKNQLTHXXXXXXXXXGFNVKCHMCNWEWCFKCQTPVHN-LSCKKNLSTDKLLMK 540
                E  +L +            VKC+  +  +CF C   VH+   CK   +     +K
Sbjct: 277 TSSLPEYTRLHYSPF---------VKCNSFH-RFCFNCGFEVHSPADCKITTAW----VK 322

Query: 541 WANSPAETIQSSVKKARKCPKCHVLVERNGGCPHMECSKCKCSWCYDCGRRRIKVKHIPF 720
            A   +E +   +   ++CPKC V +E+NGGC HM CS CK  +C+ C          P+
Sbjct: 323 KARKESEILNWVLSHTKECPKCSVNIEKNGGCNHMVCSSCKYEFCWIC--------EGPW 374

Query: 721 MNHDSKFFILGCSKN 765
             H   FF     KN
Sbjct: 375 APHGKNFFQCTMYKN 389
>sp|Q80TT8|PARC_MOUSE p53-associated parkin-like cytoplasmic protein
          Length = 1865

 Score = 64.3 bits (155), Expect = 6e-10
 Identities = 46/213 (21%), Positives = 79/213 (37%), Gaps = 14/213 (6%)
 Frame = +1

Query: 88   CSICYET---HGAFVKRCCDMIVCEQCYTQYLNYQVSNLGRIHVECPSMNCHKPIRFQSI 258
            C +C      H      CC    C+ C+ +YL  ++     ++  CP  +C  P +    
Sbjct: 1409 CPVCVTPLGPHDDSPSLCCLHCCCKSCWNEYLTTRIEQNFVLNCTCPIADC--PAQPTGA 1466

Query: 259  WIRLSENNRKIYEKMKNSHEQTDYKKLCPHCMQFYELTNEEKNQLTHXXXXXXXXXGFNV 438
            +IR   ++ ++  K    +E+   +     C      TN +               G   
Sbjct: 1467 FIRNIVSSPEVISK----YEKALLRGYVESCSNLTWCTNPQGCD----RILCRQGLGSGT 1518

Query: 439  KCHMCNWEWCFKCQTPV--HNLSCKKNLSTDKLLMKWANSPAETIQSSVKK--------- 585
             C  C W  CF C  P   +  SC         + +W +        SV+          
Sbjct: 1519 TCSKCGWASCFSCSFPEAHYPASCGH-------MSQWVDDGGYYDGMSVEAQSKHLAKLI 1571

Query: 586  ARKCPKCHVLVERNGGCPHMECSKCKCSWCYDC 684
            +++CP C   +E+N GC HM C++C   +C+ C
Sbjct: 1572 SKRCPSCQAPIEKNEGCLHMTCARCNHGFCWRC 1604
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 116,337,128
Number of Sequences: 369166
Number of extensions: 2329990
Number of successful extensions: 6102
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 5758
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6065
length of database: 68,354,980
effective HSP length: 111
effective length of database: 47,849,395
effective search space used: 11388156010
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)

Cluster detail

DrC_00631

  1. Dr_sW_016_K20
  2. Dr_sW_006_N23