Planarian EST Database


Dr_sW_026_E22

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_026_E22
         (557 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P08121|CO3A1_MOUSE  Collagen alpha 1(III) chain precursor       85   1e-16
sp|P13941|CO3A1_RAT  Collagen alpha 1(III) chain precursor         84   2e-16
sp|P02461|CO3A1_HUMAN  Collagen alpha 1(III) chain precursor       83   5e-16
sp||P02459_2  [Segment 2 of 2] Collagen alpha 1(II) chain pr...    81   2e-15
sp|O93484|CO1A2_ONCMY  Collagen alpha 2(I) chain precursor         79   6e-15
sp|P02458|CO2A1_HUMAN  Collagen alpha 1(II) chain precursor ...    79   1e-14
sp|Q9XSJ7|CO1A1_CANFA  Collagen alpha 1(I) chain precursor         79   1e-14
sp|P02452|CO1A1_HUMAN  Collagen alpha 1(I) chain precursor         79   1e-14
sp|P02460|CA12_CHICK  Collagen alpha 1(II) chain precursor         79   1e-14
sp|P28481|CO2A1_MOUSE  Collagen alpha 1(II) chain precursor ...    78   2e-14
>sp|P08121|CO3A1_MOUSE Collagen alpha 1(III) chain precursor
          Length = 1464

 Score = 84.7 bits (208), Expect = 1e-16
 Identities = 60/150 (40%), Positives = 80/150 (53%), Gaps = 16/150 (10%)
 Frame = +3

Query: 42   WIDEASEDQS-WFGEA-TGIFKFDY-------QIESSQLIFLKLLSSHAKQKLIIHCKN- 191
            W D  +E +  WFGE+  G F+F Y        +   QL FL+LLSS A Q +  HCKN 
Sbjct: 1314 WTDSGAEKKHVWFGESMNGGFQFSYGPPDLPEDVVDVQLAFLRLLSSRASQNITYHCKNS 1373

Query: 192  LAVVE----NSPKPLILYSDHDEEVMKDGDL-FRYKVLQDGCKNSEGIVSLTELEMDT-E 353
            +A ++    N  K L L   ++ E   +G+  F Y VL+DGC    G  S T  E  T +
Sbjct: 1374 IAYMDQASGNVKKSLKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEYQTRK 1433

Query: 354  AHRLPIRDVALHMGRSKKQQFGLDIGQVCF 443
            A RLPI D+A +      Q+FG+DIG VCF
Sbjct: 1434 AMRLPIIDIAPYDIGGPDQEFGVDIGPVCF 1463
>sp|P13941|CO3A1_RAT Collagen alpha 1(III) chain precursor
          Length = 636

 Score = 84.0 bits (206), Expect = 2e-16
 Identities = 60/150 (40%), Positives = 80/150 (53%), Gaps = 16/150 (10%)
 Frame = +3

Query: 42  WIDEASEDQS-WFGEA-TGIFKFDY-------QIESSQLIFLKLLSSHAKQKLIIHCKN- 191
           W D  +E +  WFGE+  G F+F Y        +   QL FL+LLSS A Q +  HCKN 
Sbjct: 486 WTDAGAEKKHVWFGESMNGGFQFSYGNPDLPEDVLDVQLAFLRLLSSRASQNITYHCKNS 545

Query: 192 LAVVE----NSPKPLILYSDHDEEVMKDGDL-FRYKVLQDGCKNSEGIVSLTELEMDT-E 353
           +A ++    N  K L L   ++ E   +G+  F Y VL+DGC    G  S T  E  T +
Sbjct: 546 IAYMDQANGNVKKSLKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEYQTRK 605

Query: 354 AHRLPIRDVALHMGRSKKQQFGLDIGQVCF 443
           A RLPI D+A +      Q+FG+DIG VCF
Sbjct: 606 AMRLPIIDIAPYDIGGPDQEFGVDIGPVCF 635
>sp|P02461|CO3A1_HUMAN Collagen alpha 1(III) chain precursor
          Length = 1466

 Score = 82.8 bits (203), Expect = 5e-16
 Identities = 59/150 (39%), Positives = 81/150 (54%), Gaps = 16/150 (10%)
 Frame = +3

Query: 42   WIDEASEDQS-WFGEAT-GIFKFDY-------QIESSQLIFLKLLSSHAKQKLIIHCKN- 191
            W D ++E +  WFGE+  G F+F Y        +   QL FL+LLSS A Q +  HCKN 
Sbjct: 1316 WTDSSAEKKHVWFGESMDGGFQFSYGNPELPEDVLDVQLAFLRLLSSRASQNITYHCKNS 1375

Query: 192  LAVVE----NSPKPLILYSDHDEEVMKDGD-LFRYKVLQDGCKNSEGIVSLTELEMDT-E 353
            +A ++    N  K L L   ++ E   +G+  F Y VL+DGC    G  S T  E  T +
Sbjct: 1376 IAYMDQASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEWSKTVFEYRTRK 1435

Query: 354  AHRLPIRDVALHMGRSKKQQFGLDIGQVCF 443
            A RLPI D+A +      Q+FG+D+G VCF
Sbjct: 1436 AVRLPIVDIAPYDIGGPDQEFGVDVGPVCF 1465
>sp||P02459_2 [Segment 2 of 2] Collagen alpha 1(II) chain precursor
          Length = 181

 Score = 81.3 bits (199), Expect = 2e-15
 Identities = 56/153 (36%), Positives = 81/153 (52%), Gaps = 17/153 (11%)
 Frame = +3

Query: 36  KNWIDEASEDQS--WFGEA-TGIFKFDYQIESS-------QLIFLKLLSSHAKQKLIIHC 185
           KNW    S+D+   WFGE   G F F Y  ++        Q+ FL+LLS+   Q +  HC
Sbjct: 28  KNWWSSKSKDKKHIWFGETINGGFHFSYGDDNLAPNTADVQMTFLRLLSTEGSQNITYHC 87

Query: 186 KN-LAVVE----NSPKPLILYSDHDEEVMKDGDL-FRYKVLQDGCKNSEGIVSLTELEMD 347
           KN +A ++    N  K L++   +D E+  +G+  F Y VL+DGC    G    T +E  
Sbjct: 88  KNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTVLKDGCTKHTGKWGKTMIEYR 147

Query: 348 TE-AHRLPIRDVALHMGRSKKQQFGLDIGQVCF 443
           ++   RLPI D+A       +Q+FG+DIG VCF
Sbjct: 148 SQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 180
>sp|O93484|CO1A2_ONCMY Collagen alpha 2(I) chain precursor
          Length = 1356

 Score = 79.3 bits (194), Expect = 6e-15
 Identities = 60/154 (38%), Positives = 88/154 (57%), Gaps = 18/154 (11%)
 Frame = +3

Query: 36   KNWIDEASEDQS--WFGEA-TGIFKFDYQIES-------SQLIFLKLLSSHAKQKLIIHC 185
            KNW   +SE++   WFGE   G  +F Y  E+       +QL F++LL++ A Q +  HC
Sbjct: 1204 KNWY-RSSENKKHVWFGETINGGTEFAYNDETLSPQSMATQLAFMRLLANQATQNITYHC 1262

Query: 186  KN-LAVVE----NSPKPLILYSDHDEEVMKDGDL-FRYKVLQDGCKNSEGIVSLTELEMD 347
            KN +A ++    N  K ++L   +D E+  +G+  F + VL+DGC    G  S T +E  
Sbjct: 1263 KNSVAYMDGENGNLKKAVLLQGSNDVELRAEGNSRFTFNVLEDGCTRHTGQWSKTVIEYR 1322

Query: 348  T-EAHRLPIRDVA-LHMGRSKKQQFGLDIGQVCF 443
            T +  RLPI D+A L +G +  Q+FGLDIG VCF
Sbjct: 1323 TNKPSRLPILDIAPLDIGEAD-QEFGLDIGPVCF 1355
>sp|P02458|CO2A1_HUMAN Collagen alpha 1(II) chain precursor [Contains: Chondrocalcin]
          Length = 1418

 Score = 78.6 bits (192), Expect = 1e-14
 Identities = 54/153 (35%), Positives = 79/153 (51%), Gaps = 17/153 (11%)
 Frame = +3

Query: 36   KNWIDEASEDQS--WFGEA-TGIFKFDY-------QIESSQLIFLKLLSSHAKQKLIIHC 185
            KNW    S+++   WFGE   G F F Y          + Q+ FL+LLS+   Q +  HC
Sbjct: 1265 KNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHC 1324

Query: 186  KN-LAVVE----NSPKPLILYSDHDEEVMKDGDL-FRYKVLQDGCKNSEGIVSLTELEMD 347
            KN +A ++    N  K L++   +D E+  +G+  F Y  L+DGC    G    T +E  
Sbjct: 1325 KNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYR 1384

Query: 348  TE-AHRLPIRDVALHMGRSKKQQFGLDIGQVCF 443
            ++   RLPI D+A       +Q+FG+DIG VCF
Sbjct: 1385 SQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1417
>sp|Q9XSJ7|CO1A1_CANFA Collagen alpha 1(I) chain precursor
          Length = 1460

 Score = 78.6 bits (192), Expect = 1e-14
 Identities = 55/154 (35%), Positives = 78/154 (50%), Gaps = 18/154 (11%)
 Frame = +3

Query: 36   KNWIDEASEDQS---WFGEA-TGIFKFDYQIESS-------QLIFLKLLSSHAKQKLIIH 182
            KNW    +  +    W+GE+ T  F+F+Y  + S       QL FL+L+S+ A Q +  H
Sbjct: 1306 KNWYISKNPKEKRHVWYGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYH 1365

Query: 183  CKNLAV-----VENSPKPLILYSDHDEEVMKDGDL-FRYKVLQDGCKNSEGIVSLTELEM 344
            CKN          N  K L+L   ++ E+  +G+  F Y V  DGC +  G    T +E 
Sbjct: 1366 CKNSVAYMDQQTGNLKKALLLQGSNEIEIRAEGNSRFTYSVTYDGCTSHTGAWGKTVIEY 1425

Query: 345  DT-EAHRLPIRDVALHMGRSKKQQFGLDIGQVCF 443
             T +  RLPI DVA     +  Q+FG+DIG VCF
Sbjct: 1426 KTTKTSRLPIIDVAPLDVGAPDQEFGMDIGPVCF 1459
>sp|P02452|CO1A1_HUMAN Collagen alpha 1(I) chain precursor
          Length = 1464

 Score = 78.6 bits (192), Expect = 1e-14
 Identities = 57/154 (37%), Positives = 80/154 (51%), Gaps = 18/154 (11%)
 Frame = +3

Query: 36   KNW-IDEASEDQS--WFGEA-TGIFKFDYQIESS-------QLIFLKLLSSHAKQKLIIH 182
            KNW I +  +D+   WFGE+ T  F+F+Y  + S       QL FL+L+S+ A Q +  H
Sbjct: 1310 KNWYISKNPKDKRHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYH 1369

Query: 183  CKNLAV-----VENSPKPLILYSDHDEEVMKDGDL-FRYKVLQDGCKNSEGIVSLTELEM 344
            CKN          N  K L+L   ++ E+  +G+  F Y V  DGC +  G    T +E 
Sbjct: 1370 CKNSVAYMDQQTGNLKKALLLKGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEY 1429

Query: 345  DT-EAHRLPIRDVALHMGRSKKQQFGLDIGQVCF 443
             T +  RLPI DVA     +  Q+FG D+G VCF
Sbjct: 1430 KTTKTSRLPIIDVAPLDVGAPDQEFGFDVGPVCF 1463
>sp|P02460|CA12_CHICK Collagen alpha 1(II) chain precursor
          Length = 369

 Score = 78.6 bits (192), Expect = 1e-14
 Identities = 55/153 (35%), Positives = 79/153 (51%), Gaps = 17/153 (11%)
 Frame = +3

Query: 36  KNWIDEASEDQS--WFGEA-TGIFKFDYQIE-------SSQLIFLKLLSSHAKQKLIIHC 185
           KNW    ++D+   WF E   G F F Y  E       S Q+ FL+LLS+   Q +  HC
Sbjct: 216 KNWWTSKTKDKKHVWFAETINGGFHFSYGDENLSPNTASIQMTFLRLLSTEGSQNVTYHC 275

Query: 186 KN-LAVVE----NSPKPLILYSDHDEEVMKDGD-LFRYKVLQDGCKNSEGIVSLTELEMD 347
           KN +A ++    N  K +++   +D E+  +G+  F Y VL+DGC    G    T +E  
Sbjct: 276 KNSIAYMDEETGNLKKAILIQGSNDVEIRAEGNSRFTYSVLEDGCTKHTGKWGKTVIEYR 335

Query: 348 TE-AHRLPIRDVALHMGRSKKQQFGLDIGQVCF 443
           ++   RLPI D+A        Q+FG+DIG VCF
Sbjct: 336 SQKTSRLPIVDIAPMDIGGADQEFGVDIGPVCF 368
>sp|P28481|CO2A1_MOUSE Collagen alpha 1(II) chain precursor [Contains: Chondrocalcin]
          Length = 1459

 Score = 77.8 bits (190), Expect = 2e-14
 Identities = 54/153 (35%), Positives = 79/153 (51%), Gaps = 17/153 (11%)
 Frame = +3

Query: 36   KNWIDEASEDQS--WFGEA-TGIFKFDY-------QIESSQLIFLKLLSSHAKQKLIIHC 185
            KNW    S+++   WFGE   G F F Y          + Q+ FL+LLS+   Q +  HC
Sbjct: 1306 KNWWSSKSKEKKHIWFGETMNGGFHFSYGDGNLAPNTANVQMTFLRLLSTEGSQNITYHC 1365

Query: 186  KN-LAVVE----NSPKPLILYSDHDEEVMKDGDL-FRYKVLQDGCKNSEGIVSLTELEMD 347
            KN +A ++    N  K L++   +D E+  +G+  F Y  L+DGC    G    T +E  
Sbjct: 1366 KNSIAYLDEAAGNLKKALLIQGSNDVEMRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYR 1425

Query: 348  TE-AHRLPIRDVALHMGRSKKQQFGLDIGQVCF 443
            ++   RLPI D+A       +Q+FG+DIG VCF
Sbjct: 1426 SQKTSRLPIIDIAPMDIGGAEQEFGVDIGPVCF 1458
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 58,790,007
Number of Sequences: 369166
Number of extensions: 1112208
Number of successful extensions: 2342
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 2260
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2302
length of database: 68,354,980
effective HSP length: 104
effective length of database: 49,142,540
effective search space used: 3980545740
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)