Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_019_H08
(837 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|P16356|RPB1_CAEEL DNA-directed RNA polymerase II largest... 39 0.014
sp|P35074|RPB1_CAEBR DNA-directed RNA polymerase II largest... 39 0.018
sp|P04052|RPB1_DROME DNA-directed RNA polymerase II largest... 35 0.26
sp|Q25434|FP1_MYTCO Adhesive plaque matrix protein precurso... 33 0.97
sp|O15018|PDZK3_HUMAN PDZ domain containing protein 3 (PDZ ... 33 0.97
sp|P20823|HNF1A_HUMAN Hepatocyte nuclear factor 1-alpha (HN... 33 1.3
sp|P35085|CBPA_DICDI Calcium-binding protein 33 1.3
sp|P35084|RPB1_DICDI DNA-directed RNA polymerase II largest... 32 1.7
sp|O14686|MLL2_HUMAN Myeloid/lymphoid or mixed-lineage leuk... 32 1.7
sp|P56945|BCA1_HUMAN CRK-associated substrate (p130Cas) (Br... 32 1.7
>sp|P16356|RPB1_CAEEL DNA-directed RNA polymerase II largest subunit
Length = 1852
Score = 39.3 bits (90), Expect = 0.014
Identities = 40/134 (29%), Positives = 48/134 (35%), Gaps = 2/134 (1%)
Frame = +1
Query: 37 YIPGPGAYKPESTIKKVYSK-APEYPFGIRHMERRTDDTPGPNVYS-VDPMLSRTVRSQK 210
Y P Y P S YS +P Y G + +P YS P S T
Sbjct: 1689 YSPTSPTYSPTSP---TYSPTSPSYESGGGYSPSSPKYSPSSPTYSPTSPSYSPTSPQYS 1745
Query: 211 PSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMTSRNPMPGDTTQK 390
P++P S SS + TP TY T P F S P YS TS P +
Sbjct: 1746 PTSPQYSPSSPTY----------TPSSPTYNPTSPRGFSS--PQYSPTSPTYSPTSPSYT 1793
Query: 391 PGPGAYSAELVTFT 432
P YS T+T
Sbjct: 1794 PSSPQYSPTSPTYT 1807
>sp|P35074|RPB1_CAEBR DNA-directed RNA polymerase II largest subunit
Length = 1853
Score = 38.9 bits (89), Expect = 0.018
Identities = 39/134 (29%), Positives = 48/134 (35%), Gaps = 2/134 (1%)
Frame = +1
Query: 37 YIPGPGAYKPESTIKKVYS-KAPEYPFGIRHMERRTDDTPGPNVYS-VDPMLSRTVRSQK 210
Y P Y P S + YS +P+Y +P YS P S T
Sbjct: 1700 YSPTSPTYSPTSPSYEGYSPSSPKY-------------SPSSPTYSPTSPSYSPTSPQYS 1746
Query: 211 PSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMTSRNPMPGDTTQK 390
P++P S SS + TP TY T P F S P YS TS P +
Sbjct: 1747 PTSPQYSPSSPTY----------TPSSPTYNPTSPRAFSS--PQYSPTSPTYSPTSPSYT 1794
Query: 391 PGPGAYSAELVTFT 432
P YS T+T
Sbjct: 1795 PSSPQYSPTSPTYT 1808
Score = 35.8 bits (81), Expect = 0.15
Identities = 36/129 (27%), Positives = 48/129 (37%), Gaps = 1/129 (0%)
Frame = +1
Query: 28 SLDYIPGPGAYKPESTIKKVYSKAPEYPFGIRHMERRTDDTPGPNVYS-VDPMLSRTVRS 204
S Y P +Y P S S +P P R + +P YS P S T +
Sbjct: 1648 SPSYSPTSPSYSPTSP-----SYSPTSPSYSPSSPRYSPTSP---TYSPTSPTYSPTSPT 1699
Query: 205 QKPSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMTSRNPMPGDTT 384
P++P S +S S G + +P TY T P + P YS TS P T
Sbjct: 1700 YSPTSPTYSPTSPSYEGYSPSSPKYSPSSPTYSPTSPS-YSPTSPQYSPTSPQYSPSSPT 1758
Query: 385 QKPGPGAYS 411
P Y+
Sbjct: 1759 YTPSSPTYN 1767
>sp|P04052|RPB1_DROME DNA-directed RNA polymerase II largest subunit
Length = 1887
Score = 35.0 bits (79), Expect = 0.26
Identities = 41/161 (25%), Positives = 67/161 (41%), Gaps = 4/161 (2%)
Frame = +1
Query: 4 LYSRPK--SLSLDYIPGPGAYKPESTIKKVYSK-APEYPFGIRHMERRTDDTPGPNVYSV 174
LY+ P+ S + ++ P Y P S+ YS +P Y ++ + G N+YS
Sbjct: 1611 LYASPRYASTTPNFNPQSTGYSPSSS---GYSPTSPVYSPTVQFQSSPSFAGSGSNIYSP 1667
Query: 175 DPMLSRTVRSQKPSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMT 354
S + + P++P S +S S +P +Y T P + P YS T
Sbjct: 1668 GNAYSPSSSNYSPNSPSYSPTSPSY----------SPSSPSYSPTSP-CYSPTSPSYSPT 1716
Query: 355 SRNPMPGDTTQKPGPGAYSAELVTFTRPGAPKFT-FGIRHS 474
S N P + P YSA P +P ++ G+++S
Sbjct: 1717 SPNYTPVTPSYSPTSPNYSAS--PQYSPASPAYSQTGVKYS 1755
Score = 30.8 bits (68), Expect = 4.8
Identities = 39/154 (25%), Positives = 50/154 (32%), Gaps = 15/154 (9%)
Frame = +1
Query: 37 YIPGPGAYKPESTIKKVYSKAPEYPFGIRHMERRTDDTPGPNVYS-VDPMLSRTVRSQKP 213
Y P Y P S +P+Y TPG YS P S T P
Sbjct: 1754 YSPTSPTYSPPSPSYDGSPGSPQY-------------TPGSPQYSPASPKYSPTSPLYSP 1800
Query: 214 SAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGV-------------FKSRYPMYSMT 354
S+P S S+ Q +P TY T P + P Y+ T
Sbjct: 1801 SSPQHSPSN-----------QYSPTGSTYSATSPRYSPNMSIYSPSSTKYSPTSPTYTPT 1849
Query: 355 SRNPMPGDTTQKP-GPGAYSAELVTFTRPGAPKF 453
+RN P P P YS ++ P +P F
Sbjct: 1850 ARNYSPTSPMYSPTAPSHYSPTSPAYS-PSSPTF 1882
>sp|Q25434|FP1_MYTCO Adhesive plaque matrix protein precursor (Foot protein 1) (MCFP1)
Length = 872
Score = 33.1 bits (74), Expect = 0.97
Identities = 38/128 (29%), Positives = 45/128 (35%), Gaps = 11/128 (8%)
Frame = +1
Query: 49 PGAYKPESTIKKVYSKAPEYPFGIRHMERRTDDTPGPNVYSVDPMLSRTVRSQKPSAPMI 228
P YKP+ T Y P YP P Y P T QKPS P I
Sbjct: 279 PPTYKPKVTYPPTYKPKPSYP------PTYKPKITYPPTYKPKPSYP-TPYKQKPSYPPI 331
Query: 229 SMSSRSKIGGFHEDLQKTPGPGTY--KVTDPGVFKSR--YP-------MYSMTSRNPMPG 375
S S + K P TY K+T P +K + YP YS T + +
Sbjct: 332 YKSKSSYPTSYK---SKKTYPPTYKPKITYPPTYKPKPSYPPSYKPKKTYSPTYKPKITY 388
Query: 376 DTTQKPGP 399
T KP P
Sbjct: 389 PPTYKPKP 396
Score = 31.2 bits (69), Expect = 3.7
Identities = 34/117 (29%), Positives = 41/117 (35%), Gaps = 2/117 (1%)
Frame = +1
Query: 49 PGAYKPESTIKKVYSKAPEYPFGIRHMERRTDDTPGPNVYSVDPMLSRTVRSQKPSAPMI 228
P YKP+ T Y P YP P Y P T QKPS P I
Sbjct: 459 PPTYKPKITYPPTYKPKPSYP------PTYKPKITYPPTYKRKPSYP-TPYKQKPSYPPI 511
Query: 229 SMSSRSKIGGFHEDLQKTPGPGTY--KVTDPGVFKSRYPMYSMTSRNPMPGDTTQKP 393
S S + K P TY K+T P +K + P Y + + T KP
Sbjct: 512 YKSKSSYPTSYK---SKKTYPPTYKPKITYPPTYKPK-PSYPPSYKPKTTYPPTYKP 564
Score = 30.8 bits (68), Expect = 4.8
Identities = 37/132 (28%), Positives = 50/132 (37%), Gaps = 9/132 (6%)
Frame = +1
Query: 49 PGAYKPESTIKKVYSKAPEYPFGIR----HMERRTDDTPGPNVYSVDPMLSRTVRSQKPS 216
P +YKP++T Y YP + + P Y P T QKPS
Sbjct: 549 PPSYKPKTTYPPTYKPKIRYPPTYKPKASYPPTYKPKITYPPTYKPKPSYP-TPYKQKPS 607
Query: 217 APMISMSSRSKIGGFHEDLQKTPGPGTY--KVTDPGVFKSRYPMYSMTSRNPMPGDTTQK 390
P I S S + K P TY K+T P +K + P Y + R + T K
Sbjct: 608 YPPIYKSKSSYPTAYK---SKKTYPPTYKPKITYPPTYKPK-PSYPPSYRPKITYPPTYK 663
Query: 391 PG---PGAYSAE 417
P P AY ++
Sbjct: 664 PKKSYPQAYKSK 675
>sp|O15018|PDZK3_HUMAN PDZ domain containing protein 3 (PDZ domain containing protein 2)
(Activated in prostate cancer protein)
Length = 2839
Score = 33.1 bits (74), Expect = 0.97
Identities = 28/101 (27%), Positives = 41/101 (40%), Gaps = 2/101 (1%)
Frame = +1
Query: 151 PGPNVYSVDP-MLSRTVRSQKPSAPMISM-SSRSKIGGFHEDLQKTPGPGTYKVTDPGVF 324
P P+ SVD +SR +P++P ++ +RS + HE +P PG P
Sbjct: 1247 PDPSKTSVDTGQVSRPENPSQPASPRVAKCKARSPVRLPHEG---SPSPGEKAAAPPDYS 1303
Query: 325 KSRYPMYSMTSRNPMPGDTTQKPGPGAYSAELVTFTRPGAP 447
K+R + T N + GPGA PG P
Sbjct: 1304 KTRSASETSTPHNTRRVAALRGAGPGAEGMTPAGAVLPGDP 1344
>sp|P20823|HNF1A_HUMAN Hepatocyte nuclear factor 1-alpha (HNF-1A) (Liver-specific
transcription factor LF-B1) (LFB1) (Transcription factor
1) (TCF-1)
Length = 631
Score = 32.7 bits (73), Expect = 1.3
Identities = 38/165 (23%), Positives = 67/165 (40%), Gaps = 11/165 (6%)
Frame = +1
Query: 43 PGPGAYKPESTIKKVYSKA--PEYPFGIRHMERRTDDTP-------GPNVYSVDPMLSRT 195
PGPG P + + A P G+R+ + T +T GP V P+ +
Sbjct: 293 PGPGPALPAHSSPGLPPPALSPSKVHGVRYGQPATSETAEVPSSSGGPLVTVSTPLHQVS 352
Query: 196 VRSQKPSAPMISMSSR--SKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMTSRNPM 369
+PS ++S ++ S GG + + + T PG+ + + + +
Sbjct: 353 PTGLEPSHSLLSTEAKLVSAAGGPLPPVSTLTALHSLEQTSPGLNQQPQNLIMAS----L 408
Query: 370 PGDTTQKPGPGAYSAELVTFTRPGAPKFTFGIRHSQYKGEMIVNN 504
PG T GPG ++ TFT GA G+ +Q + ++N+
Sbjct: 409 PGVMTI--GPGEPASLGPTFTNTGASTLVIGLASTQAQSVPVINS 451
>sp|P35085|CBPA_DICDI Calcium-binding protein
Length = 467
Score = 32.7 bits (73), Expect = 1.3
Identities = 24/64 (37%), Positives = 27/64 (42%), Gaps = 6/64 (9%)
Frame = +1
Query: 274 QKTPG----PGTYKVTDPGVFKSRYPMYSMTSRNPMPGDTTQKP--GPGAYSAELVTFTR 435
Q TPG PG Y PG S P Y T + PG Q P PG Y + +
Sbjct: 34 QSTPGAPGAPGQYPPQQPGAPGSNLPPYPGTQQPGAPGAPGQYPPQQPGQYPPQ-----Q 88
Query: 436 PGAP 447
PGAP
Sbjct: 89 PGAP 92
>sp|P35084|RPB1_DICDI DNA-directed RNA polymerase II largest subunit
Length = 902
Score = 32.3 bits (72), Expect = 1.7
Identities = 37/142 (26%), Positives = 51/142 (35%), Gaps = 7/142 (4%)
Frame = +1
Query: 28 SLDYIPGPGAYKPES-----TIKKVYSKAPEY-PFGIRHMERRTDDTPGPNVYS-VDPML 186
S Y P +Y P S T +P Y P + +P YS P
Sbjct: 760 SPSYSPTSPSYSPTSPFYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSY 819
Query: 187 SRTVRSQKPSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKSRYPMYSMTSRNP 366
S T S P++P S +S S +P +Y T P + P YS +S +
Sbjct: 820 SPTSPSYSPTSPSYSPTSPSY----------SPTSPSYSPTSPS-YSPTSPSYSPSSPSY 868
Query: 367 MPGDTTQKPGPGAYSAELVTFT 432
P + P +YS TFT
Sbjct: 869 SPSSPSYSPSSPSYSPSSPTFT 890
>sp|O14686|MLL2_HUMAN Myeloid/lymphoid or mixed-lineage leukemia protein 2 (ALL1-related
protein)
Length = 5262
Score = 32.3 bits (72), Expect = 1.7
Identities = 31/134 (23%), Positives = 49/134 (36%), Gaps = 23/134 (17%)
Frame = +1
Query: 154 GPNVYSVDPMLSRTVRSQKPSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPG----- 318
GP + D LSR PS+ + ++SR +GG Q+ P PG+ +
Sbjct: 2487 GPGSFPSDDRLSRPPPPATPSS--MDVNSRQLVGGSQAFYQRAPYPGSLPLQQQQQQLWQ 2544
Query: 319 ------------VFKSRYP------MYSMTSRNPMPGDTTQKPGPGAYSAELVTFTRPGA 444
+R+P + +P+ G +T+ PGPG P
Sbjct: 2545 QQQATAATSMRFAMSARFPSTPGPELGRQALGSPLAGISTRLPGPGE------PVPGPAG 2598
Query: 445 PKFTFGIRHSQYKG 486
P +RH+ KG
Sbjct: 2599 PAQFIELRHNVQKG 2612
>sp|P56945|BCA1_HUMAN CRK-associated substrate (p130Cas) (Breast cancer anti-estrogen
resistance 1 protein)
Length = 870
Score = 32.3 bits (72), Expect = 1.7
Identities = 24/94 (25%), Positives = 36/94 (38%), Gaps = 4/94 (4%)
Frame = +1
Query: 151 PGPNVYSVDPMLSRTVRSQKPSAPMISMSSRSKIGGFHEDLQKTPGPGTYKVTDPGVFKS 330
P P PML T + Q S ++ S+++ G L + PGP + P K
Sbjct: 92 PAPPASQYTPMLPNTYQPQPDSVYLVPTPSKAQQG-----LYQVPGPSPQFQSPPA--KQ 144
Query: 331 RYPMYSMTSRNPMPGDTTQ----KPGPGAYSAEL 420
T +P P T PGPG + ++
Sbjct: 145 TSTFSKQTPHHPFPSPATDLYQVPPGPGGPAQDI 178
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 103,728,254
Number of Sequences: 369166
Number of extensions: 2425341
Number of successful extensions: 6539
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 6119
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6479
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 8148988185
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)