Planaria EST Database


DrC_00048

BLASTX 2.2.13 [Nov-27-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= DrC_00048
         (833 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P05997|CO5A2_HUMAN  Collagen alpha 2(V) chain precursor         94   4e-19
sp|O93484|CO1A2_ONCMY  Collagen alpha 2(I) chain precursor         88   3e-17
sp|P02458|CO2A1_HUMAN  Collagen alpha 1(II) chain precursor ...    86   2e-16
sp|Q28668|CO1A2_RABIT  Collagen alpha 2(I) chain precursor         85   2e-16
sp|P02460|CA12_CHICK  Collagen alpha 1(II) chain precursor         85   2e-16
sp|P02465|CO1A2_BOVIN  Collagen alpha 2(I) chain precursor         85   3e-16
sp||P12105_3  [Segment 3 of 3] Collagen alpha 1(III) chain p...    84   4e-16
sp|P02461|CO3A1_HUMAN  Collagen alpha 1(III) chain precursor       84   5e-16
sp|O46392|CO1A2_CANFA  Collagen alpha 2(I) chain precursor         84   5e-16
sp|O42350|CO1A2_RANCA  Collagen alpha 2(I) chain precursor         84   5e-16
>sp|P05997|CO5A2_HUMAN Collagen alpha 2(V) chain precursor
          Length = 1496

 Score = 94.4 bits (233), Expect = 4e-19
 Identities = 60/185 (32%), Positives = 92/185 (49%), Gaps = 25/185 (13%)
 Frame = +1

Query: 352  DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCIKS 531
            + +  PDG++  PARTC  L   + + + G YWIDPN G ++DA++VYC ++  +TCI +
Sbjct: 1277 ETMRSPDGSKKHPARTCDDLKLCHSAKQSGEYWIDPNQGSVEDAIKVYCNMETGETCISA 1336

Query: 532  IYRETSLEKPRFNWY-SQGNDNKFINYALDQQ------------------QLTFLKMISN 654
                     PR  W+ S+  DNK + Y LD                    Q+TFL+++S 
Sbjct: 1337 ----NPSSVPRKTWWASKSPDNKPVWYGLDMNRGSQFAYGDHQSPNTAITQMTFLRLLSK 1392

Query: 655  KASQFVTINCQNM-----PIIKNSVKPLRIFTDNDIILDNSDQI-FSYKILQDNCQYNSP 816
            +ASQ +T  C+N         KN  K + +   ND+ +     I F Y +LQD C   + 
Sbjct: 1393 EASQNITYICKNSVGYMDDQAKNLKKAVVLKGANDLDIKAEGNIRFRYIVLQDTCSKRNG 1452

Query: 817  NLSST 831
            N+  T
Sbjct: 1453 NVGKT 1457
>sp|O93484|CO1A2_ONCMY Collagen alpha 2(I) chain precursor
          Length = 1356

 Score = 87.8 bits (216), Expect = 3e-17
 Identities = 72/260 (27%), Positives = 113/260 (43%), Gaps = 33/260 (12%)
 Frame = +1

Query: 151  GFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQGYMIIQADQPT 330
            G  G  GP G+PGL G                              +  GY   +ADQP+
Sbjct: 1078 GHLGPAGPPGSPGLPG--------------------PAGPAGGGYDQSGGYDEYRADQPS 1117

Query: 331  I-AKYLGNDA-----------ITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKI 474
              AK    DA           +  P+G++  PARTC  +   +P +  G YWIDPN G I
Sbjct: 1118 FRAKDYEVDATIKSLNSQIENLLTPEGSKKNPARTCRDIRLSHPDWSSGFYWIDPNQGCI 1177

Query: 475  DDAVQVYCKIKERKTCI----KSIYRET---SLEKPRFNWYSQ----GNDNKFINYALDQ 621
             DA++ YC      TCI    +SI R+    S E  +  W+ +    G +  + +  L  
Sbjct: 1178 ADAIKAYCDFSTGHTCIHPHPESIARKNWYRSSENKKHVWFGETINGGTEFAYNDETLSP 1237

Query: 622  Q----QLTFLKMISNKASQFVTINCQNMPIIK-----NSVKPLRIFTDNDIIL-DNSDQI 771
            Q    QL F+++++N+A+Q +T +C+N          N  K + +   ND+ L    +  
Sbjct: 1238 QSMATQLAFMRLLANQATQNITYHCKNSVAYMDGENGNLKKAVLLQGSNDVELRAEGNSR 1297

Query: 772  FSYKILQDNCQYNSPNLSST 831
            F++ +L+D C  ++   S T
Sbjct: 1298 FTFNVLEDGCTRHTGQWSKT 1317
>sp|P02458|CO2A1_HUMAN Collagen alpha 1(II) chain precursor [Contains: Chondrocalcin]
          Length = 1418

 Score = 85.5 bits (210), Expect = 2e-16
 Identities = 73/248 (29%), Positives = 104/248 (41%), Gaps = 31/248 (12%)
 Frame = +1

Query: 151  GFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQ----GYMIIQA 318
            G  G  GP GNPG  G                              R      G     A
Sbjct: 1125 GETGPAGPPGNPGPPGPPGPPGPGIDMSAFAGLGpreKGPDPLQYMRADQAAGGLRQHDA 1184

Query: 319  DQPTIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQV 492
            +     K L N  ++I  P+G++  PARTC  L   +P +K G YWIDPN G   DA++V
Sbjct: 1185 EVDATLKSLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKV 1244

Query: 493  YCKIKERKTCI---------KSIYRETSLEKPRFNW----------YSQGNDNKFINYAL 615
            +C ++  +TC+         K+ +   S EK    W          +S G+DN   N A 
Sbjct: 1245 FCNMETGETCVYPNPANVPKKNWWSSKSKEKKHI-WFGETINGGFHFSYGDDNLAPNTA- 1302

Query: 616  DQQQLTFLKMISNKASQFVTINCQNM-----PIIKNSVKPLRIFTDNDI-ILDNSDQIFS 777
               Q+TFL+++S + SQ +T +C+N          N  K L I   ND+ I    +  F+
Sbjct: 1303 -NVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFT 1361

Query: 778  YKILQDNC 801
            Y  L+D C
Sbjct: 1362 YTALKDGC 1369
>sp|Q28668|CO1A2_RABIT Collagen alpha 2(I) chain precursor
          Length = 526

 Score = 85.1 bits (209), Expect = 2e-16
 Identities = 66/245 (26%), Positives = 102/245 (41%), Gaps = 27/245 (11%)
 Frame = +1

Query: 148 QGFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQGYMIIQADQP 327
           QG QG  GP G PG  G                            + RP+ Y +      
Sbjct: 243 QGSQGPAGPPGPPGPPGPPGASGGGYDFGYDGDFYRADQPRSPPSL-RPKDYEV-----D 296

Query: 328 TIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCK 501
              K L N  + +  P+G++  PARTC  L   +P +  G YWIDPN G   DA++VYC 
Sbjct: 297 ATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCD 356

Query: 502 IKERKTCIKSIYRETSLEKPRFNWYSQGNDNKFI------------NYALD-------QQ 624
               +TCI++     S++    NWY      K +             Y ++         
Sbjct: 357 FSTGETCIRAQPENISVK----NWYKSSKAKKHVWLGETINGGTQFEYNVEGVTSKEMAT 412

Query: 625 QLTFLKMISNKASQFVTINCQNMPIIK-----NSVKPLRIFTDNDI-ILDNSDQIFSYKI 786
           QL F+++++N ASQ +T +C+N          N  K + +   ND+ ++   +  F+Y +
Sbjct: 413 QLAFMRLLANHASQNITYHCKNSIAYMDEETGNLNKAVILQGSNDVELVAEGNSRFTYTV 472

Query: 787 LQDNC 801
           L D C
Sbjct: 473 LVDGC 477
>sp|P02460|CA12_CHICK Collagen alpha 1(II) chain precursor
          Length = 369

 Score = 85.1 bits (209), Expect = 2e-16
 Identities = 55/176 (31%), Positives = 87/176 (49%), Gaps = 26/176 (14%)
 Frame = +1

Query: 352 DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCIKS 531
           ++I  P+G++  PARTC  +   +P +K G YWIDPN G   DA++V+C ++  +TC+  
Sbjct: 149 ESIRSPEGSKKNPARTCRDIKLCHPEWKSGDYWIDPNQGCTLDAIKVFCNMETGETCV-- 206

Query: 532 IYRETSLEKPRFNWY-SQGNDNKFINYA-------------------LDQQQLTFLKMIS 651
               T    PR NW+ S+  D K + +A                       Q+TFL+++S
Sbjct: 207 --YPTPSSIPRKNWWTSKTKDKKHVWFAETINGGFHFSYGDENLSPNTASIQMTFLRLLS 264

Query: 652 NKASQFVTINCQNMPIIK-----NSVKPLRIFTDNDI-ILDNSDQIFSYKILQDNC 801
            + SQ VT +C+N          N  K + I   ND+ I    +  F+Y +L+D C
Sbjct: 265 TEGSQNVTYHCKNSIAYMDEETGNLKKAILIQGSNDVEIRAEGNSRFTYSVLEDGC 320
>sp|P02465|CO1A2_BOVIN Collagen alpha 2(I) chain precursor
          Length = 1364

 Score = 84.7 bits (208), Expect = 3e-16
 Identities = 67/245 (27%), Positives = 101/245 (41%), Gaps = 27/245 (11%)
 Frame = +1

Query: 148  QGFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQGYMIIQADQP 327
            QG QG  GP G PG  G                          + + RP+ Y +      
Sbjct: 1081 QGSQGPAGPPGPPGPPGPPGPSGGGYEFGFDGDFYRADQPRSPTSL-RPKDYEV-----D 1134

Query: 328  TIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCK 501
               K L N  + +  P+G++  PARTC  L   +P +  G YWIDPN G   DA++VYC 
Sbjct: 1135 ATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCD 1194

Query: 502  IKERKTCIKSIYRETSLEKPRFNWYSQGNDNKFI------------NYALD-------QQ 624
                +TCI    R    + P  NWY      K +             Y ++         
Sbjct: 1195 FSTGETCI----RAQPEDIPVKNWYRNSKAKKHVWVGETINGGTQFEYNVEGVTTKEMAT 1250

Query: 625  QLTFLKMISNKASQFVTINCQNMPIIK-----NSVKPLRIFTDNDI-ILDNSDQIFSYKI 786
            QL F+++++N ASQ +T +C+N          N  K + +   ND+ ++   +  F+Y +
Sbjct: 1251 QLAFMRLLANHASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTV 1310

Query: 787  LQDNC 801
            L D C
Sbjct: 1311 LVDGC 1315
>sp||P12105_3 [Segment 3 of 3] Collagen alpha 1(III) chain precursor
          Length = 340

 Score = 84.3 bits (207), Expect = 4e-16
 Identities = 67/258 (25%), Positives = 110/258 (42%), Gaps = 40/258 (15%)
 Frame = +1

Query: 148 QGFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQGYMIIQADQP 327
           +G +GE GP G PG  G+                              P GY     D+P
Sbjct: 45  RGNRGESGPAGPPGQPGLPGPSGPPGPCCGGGVASLGAGEKG------PVGYGYEYRDEP 98

Query: 328 -----------TIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGG 468
                      +  K + N  + I  PDG++  PAR C  L   +P  K G YWIDPN G
Sbjct: 99  KENEINLGEIMSSMKSINNQIENILSPDGSRKNPARNCRDLKFCHPELKSGEYWIDPNQG 158

Query: 469 KIDDAVQVYCKIKERKTCIKSIYRETSLEKPRFNWY-SQGNDNKFINYA----------- 612
              DA++VYC ++  +TC+ +         PR NW+ ++ +  K + +            
Sbjct: 159 CKMDAIKVYCNMETGETCLSA----NPATVPRKNWWTTESSGKKHVWFGESMKGGFQFSY 214

Query: 613 --------LDQQQLTFLKMISNKASQFVTINCQNMPIIKNSV-----KPLRIFT--DNDI 747
                   + + QL FL+++S++ASQ +T +C+N     N       K L++ +  + DI
Sbjct: 215 GDPDLPEDVSEVQLAFLRILSSRASQNITYHCKNSIAYMNQASGNVKKALKLMSSVETDI 274

Query: 748 ILDNSDQIFSYKILQDNC 801
             + + + + Y +L+D C
Sbjct: 275 KAEGNSK-YMYAVLEDGC 291
>sp|P02461|CO3A1_HUMAN Collagen alpha 1(III) chain precursor
          Length = 1466

 Score = 84.0 bits (206), Expect = 5e-16
 Identities = 54/183 (29%), Positives = 92/183 (50%), Gaps = 23/183 (12%)
 Frame = +1

Query: 352  DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCI-- 525
            +++  PDG++  PAR C  L   +P  K G YW+DPN G   DA++V+C ++  +TCI  
Sbjct: 1246 ESLISPDGSRKNPARNCRDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISA 1305

Query: 526  -------KSIYRETSLEKPRFNWYSQGNDNKF-INYALDQ-------QQLTFLKMISNKA 660
                   K  + ++S EK +  W+ +  D  F  +Y   +        QL FL+++S++A
Sbjct: 1306 NPLNVPRKHWWTDSSAEK-KHVWFGESMDGGFQFSYGNPELPEDVLDVQLAFLRLLSSRA 1364

Query: 661  SQFVTINCQNM-----PIIKNSVKPLRIFTDND-IILDNSDQIFSYKILQDNCQYNSPNL 822
            SQ +T +C+N          N  K L++   N+       +  F+Y +L+D C  ++   
Sbjct: 1365 SQNITYHCKNSIAYMDQASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEW 1424

Query: 823  SST 831
            S T
Sbjct: 1425 SKT 1427
>sp|O46392|CO1A2_CANFA Collagen alpha 2(I) chain precursor
          Length = 1366

 Score = 84.0 bits (206), Expect = 5e-16
 Identities = 68/245 (27%), Positives = 99/245 (40%), Gaps = 27/245 (11%)
 Frame = +1

Query: 148  QGFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQGYMIIQADQP 327
            QG QG  GP G PG  G                            + RP+ Y +      
Sbjct: 1083 QGSQGPAGPPGPPGPPGPPGPSGGGYDFGYEGDFYRADQPRSPPSL-RPKDYEV-----D 1136

Query: 328  TIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCK 501
               K L N  + +  P+G++  PARTC  L   +P +  G YWIDPN G   DA++VYC 
Sbjct: 1137 ATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCD 1196

Query: 502  IKERKTCIKSIYRETSLEKPRFNWYSQGNDNKFI------------NYALD-------QQ 624
                +TCI    R      P  NWY      K I             Y ++         
Sbjct: 1197 FSTGETCI----RAQPENIPAKNWYRNSKVKKHIWLGETINGGTQFEYNVEGVTTKEMAT 1252

Query: 625  QLTFLKMISNKASQFVTINCQNMPIIK-----NSVKPLRIFTDNDI-ILDNSDQIFSYKI 786
            QL F+++++N ASQ +T +C+N          N  K + +   ND+ ++   +  F+Y +
Sbjct: 1253 QLAFMRLLANHASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTV 1312

Query: 787  LQDNC 801
            L D C
Sbjct: 1313 LVDGC 1317
>sp|O42350|CO1A2_RANCA Collagen alpha 2(I) chain precursor
          Length = 1355

 Score = 84.0 bits (206), Expect = 5e-16
 Identities = 52/173 (30%), Positives = 87/173 (50%), Gaps = 23/173 (13%)
 Frame = +1

Query: 352  DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCI-- 525
            + I  P+G++  PARTC  L   +P +  G YWIDPN G   DA++V+C     +TCI  
Sbjct: 1134 EVILTPEGSRKNPARTCRDLRLSHPEWTSGFYWIDPNQGCTSDAIRVFCDFSSGETCIHA 1193

Query: 526  -------KSIYRETSLEKPRFNWYSQ----GNDNKFINYALDQQ----QLTFLKMISNKA 660
                   K+ Y  TS +  +  W+ +    G   ++ +  L  +    QL F+++++N+A
Sbjct: 1194 NPDEITQKNWYINTSNKDKKHLWFGEILNGGTQFEYHDEGLTAKDMATQLAFMRLLANQA 1253

Query: 661  SQFVTINCQNMPIIK-----NSVKPLRIFTDNDIIL-DNSDQIFSYKILQDNC 801
            SQ +T +C+N          N  K + +   ND+ L    +  F+Y +L+D C
Sbjct: 1254 SQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELRAEGNTRFTYSVLEDGC 1306
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.317    0.136    0.407 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 86,019,762
Number of Sequences: 369166
Number of extensions: 1622128
Number of successful extensions: 7133
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4479
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 7076
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 8100769320
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)

Cluster detail

DrC_00048

  1. Dr_sW_024_M04
  2. Dr_sW_002_C14