Planarian EST Database


Dr_sW_024_M04

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_024_M04
         (554 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|P05997|CO5A2_HUMAN  Collagen alpha 2(V) chain precursor         84   3e-16
sp|O93484|CO1A2_ONCMY  Collagen alpha 2(I) chain precursor         79   1e-14
sp|P02458|CO2A1_HUMAN  Collagen alpha 1(II) chain precursor ...    78   1e-14
sp|O42350|CO1A2_RANCA  Collagen alpha 2(I) chain precursor         78   1e-14
sp|P20909|CA1B_RAT  Collagen alpha 1(XI) chain precursor           78   2e-14
sp|P02461|CO3A1_HUMAN  Collagen alpha 1(III) chain precursor       77   2e-14
sp|P02460|CA12_CHICK  Collagen alpha 1(II) chain precursor         77   3e-14
sp|Q61245|COBA1_MOUSE  Collagen alpha 1(XI) chain precursor        77   4e-14
sp|Q28668|CO1A2_RABIT  Collagen alpha 2(I) chain precursor         77   4e-14
sp|P28481|CO2A1_MOUSE  Collagen alpha 1(II) chain precursor ...    77   4e-14
>sp|P05997|CO5A2_HUMAN Collagen alpha 2(V) chain precursor
          Length = 1496

 Score = 83.6 bits (205), Expect = 3e-16
 Identities = 46/132 (34%), Positives = 71/132 (53%), Gaps = 19/132 (14%)
 Frame = +3

Query: 156  DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCIKS 335
            + +  PDG++  PARTC  L   + + + G YWIDPN G ++DA++VYC ++  +TCI +
Sbjct: 1277 ETMRSPDGSKKHPARTCDDLKLCHSAKQSGEYWIDPNQGSVEDAIKVYCNMETGETCISA 1336

Query: 336  IYRETSLEKPRFNWY-SQGNDNKFINYALDQQ------------------QLTFLKMISN 458
                     PR  W+ S+  DNK + Y LD                    Q+TFL+++S 
Sbjct: 1337 ----NPSSVPRKTWWASKSPDNKPVWYGLDMNRGSQFAYGDHQSPNTAITQMTFLRLLSK 1392

Query: 459  KASQFVTINCQN 494
            +ASQ +T  C+N
Sbjct: 1393 EASQNITYICKN 1404
>sp|O93484|CO1A2_ONCMY Collagen alpha 2(I) chain precursor
          Length = 1356

 Score = 78.6 bits (192), Expect = 1e-14
 Identities = 52/158 (32%), Positives = 81/158 (51%), Gaps = 27/158 (17%)
 Frame = +3

Query: 102  GYMIIQADQPTI-AKYLGNDA-----------ITQPDGTQNLPARTCLHLAEINPSFKDG 245
            GY   +ADQP+  AK    DA           +  P+G++  PARTC  +   +P +  G
Sbjct: 1107 GYDEYRADQPSFRAKDYEVDATIKSLNSQIENLLTPEGSKKNPARTCRDIRLSHPDWSSG 1166

Query: 246  LYWIDPNGGKIDDAVQVYCKIKERKTCI----KSIYRET---SLEKPRFNWYSQ----GN 392
             YWIDPN G I DA++ YC      TCI    +SI R+    S E  +  W+ +    G 
Sbjct: 1167 FYWIDPNQGCIADAIKAYCDFSTGHTCIHPHPESIARKNWYRSSENKKHVWFGETINGGT 1226

Query: 393  DNKFINYALDQQ----QLTFLKMISNKASQFVTINCQN 494
            +  + +  L  Q    QL F+++++N+A+Q +T +C+N
Sbjct: 1227 EFAYNDETLSPQSMATQLAFMRLLANQATQNITYHCKN 1264
>sp|P02458|CO2A1_HUMAN Collagen alpha 1(II) chain precursor [Contains: Chondrocalcin]
          Length = 1418

 Score = 78.2 bits (191), Expect = 1e-14
 Identities = 50/146 (34%), Positives = 77/146 (52%), Gaps = 21/146 (14%)
 Frame = +3

Query: 120  ADQPTIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQ 293
            A+     K L N  ++I  P+G++  PARTC  L   +P +K G YWIDPN G   DA++
Sbjct: 1184 AEVDATLKSLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMK 1243

Query: 294  VYCKIKERKTCI---------KSIYRETSLEKPRFNW----------YSQGNDNKFINYA 416
            V+C ++  +TC+         K+ +   S EK    W          +S G+DN   N A
Sbjct: 1244 VFCNMETGETCVYPNPANVPKKNWWSSKSKEKKHI-WFGETINGGFHFSYGDDNLAPNTA 1302

Query: 417  LDQQQLTFLKMISNKASQFVTINCQN 494
                Q+TFL+++S + SQ +T +C+N
Sbjct: 1303 --NVQMTFLRLLSTEGSQNITYHCKN 1326
>sp|O42350|CO1A2_RANCA Collagen alpha 2(I) chain precursor
          Length = 1355

 Score = 78.2 bits (191), Expect = 1e-14
 Identities = 42/130 (32%), Positives = 70/130 (53%), Gaps = 17/130 (13%)
 Frame = +3

Query: 156  DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCI-- 329
            + I  P+G++  PARTC  L   +P +  G YWIDPN G   DA++V+C     +TCI  
Sbjct: 1134 EVILTPEGSRKNPARTCRDLRLSHPEWTSGFYWIDPNQGCTSDAIRVFCDFSSGETCIHA 1193

Query: 330  -------KSIYRETSLEKPRFNWYSQ----GNDNKFINYALDQQ----QLTFLKMISNKA 464
                   K+ Y  TS +  +  W+ +    G   ++ +  L  +    QL F+++++N+A
Sbjct: 1194 NPDEITQKNWYINTSNKDKKHLWFGEILNGGTQFEYHDEGLTAKDMATQLAFMRLLANQA 1253

Query: 465  SQFVTINCQN 494
            SQ +T +C+N
Sbjct: 1254 SQNITYHCKN 1263
>sp|P20909|CA1B_RAT Collagen alpha 1(XI) chain precursor
          Length = 482

 Score = 77.8 bits (190), Expect = 2e-14
 Identities = 52/145 (35%), Positives = 72/145 (49%), Gaps = 19/145 (13%)
 Frame = +3

Query: 171 PDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKE-RKTCIKSIYRE 347
           P GTQ  PARTC  L   +P F DG YWIDPN G   D+ +VYC      +TCI    + 
Sbjct: 272 PMGTQTNPARTCKDLQLSHPDFPDGEYWIDPNQGCSGDSFKVYCNFTAGGETCIYPDKKS 331

Query: 348 TSL-------EKPRFNWYSQGNDNKFINY------ALDQQQLTFLKMISNKASQFVTINC 488
             +       EKP  +WYS+    K ++Y      +++  Q+TFLK++++ A Q  T NC
Sbjct: 332 EGVRLSSWPKEKPG-SWYSEFKRGKLLSYLDVEGNSINMVQMTFLKLLTSSARQNFTYNC 390

Query: 489 QN----MPIIKNSV-KPLRIFTDND 548
                   ++  S  K LR    ND
Sbjct: 391 HQSTAWYDVLSGSYDKALRFLGSND 415
>sp|P02461|CO3A1_HUMAN Collagen alpha 1(III) chain precursor
          Length = 1466

 Score = 77.4 bits (189), Expect = 2e-14
 Identities = 43/130 (33%), Positives = 72/130 (55%), Gaps = 17/130 (13%)
 Frame = +3

Query: 156  DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCI-- 329
            +++  PDG++  PAR C  L   +P  K G YW+DPN G   DA++V+C ++  +TCI  
Sbjct: 1246 ESLISPDGSRKNPARNCRDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISA 1305

Query: 330  -------KSIYRETSLEKPRFNWYSQGNDNKF-INYALDQ-------QQLTFLKMISNKA 464
                   K  + ++S EK +  W+ +  D  F  +Y   +        QL FL+++S++A
Sbjct: 1306 NPLNVPRKHWWTDSSAEK-KHVWFGESMDGGFQFSYGNPELPEDVLDVQLAFLRLLSSRA 1364

Query: 465  SQFVTINCQN 494
            SQ +T +C+N
Sbjct: 1365 SQNITYHCKN 1374
>sp|P02460|CA12_CHICK Collagen alpha 1(II) chain precursor
          Length = 369

 Score = 77.0 bits (188), Expect = 3e-14
 Identities = 44/133 (33%), Positives = 70/133 (52%), Gaps = 20/133 (15%)
 Frame = +3

Query: 156 DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCIKS 335
           ++I  P+G++  PARTC  +   +P +K G YWIDPN G   DA++V+C ++  +TC+  
Sbjct: 149 ESIRSPEGSKKNPARTCRDIKLCHPEWKSGDYWIDPNQGCTLDAIKVFCNMETGETCV-- 206

Query: 336 IYRETSLEKPRFNWY-SQGNDNKFINYA-------------------LDQQQLTFLKMIS 455
               T    PR NW+ S+  D K + +A                       Q+TFL+++S
Sbjct: 207 --YPTPSSIPRKNWWTSKTKDKKHVWFAETINGGFHFSYGDENLSPNTASIQMTFLRLLS 264

Query: 456 NKASQFVTINCQN 494
            + SQ VT +C+N
Sbjct: 265 TEGSQNVTYHCKN 277
>sp|Q61245|COBA1_MOUSE Collagen alpha 1(XI) chain precursor
          Length = 1804

 Score = 76.6 bits (187), Expect = 4e-14
 Identities = 52/145 (35%), Positives = 71/145 (48%), Gaps = 19/145 (13%)
 Frame = +3

Query: 171  PDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKE-RKTCIKSIYRE 347
            P GTQ  PARTC  L   +P F DG YWIDPN G   D+ +VYC      +TCI    + 
Sbjct: 1594 PMGTQTNPARTCKDLQLSHPDFPDGEYWIDPNQGCSGDSFKVYCNFTAGGETCIYPDKKS 1653

Query: 348  TSL-------EKPRFNWYSQGNDNKFINY------ALDQQQLTFLKMISNKASQFVTINC 488
              +       EKP  +WYS+    K ++Y      +++  Q+TFLK+++  A Q  T NC
Sbjct: 1654 EGVRISSWPKEKPG-SWYSEFKRGKLLSYLDVEGNSINMVQMTFLKLLTASARQNFTYNC 1712

Query: 489  QN----MPIIKNSV-KPLRIFTDND 548
                    ++  S  K LR    ND
Sbjct: 1713 HQSAAWYDVLSGSYDKALRFLGSND 1737
>sp|Q28668|CO1A2_RABIT Collagen alpha 2(I) chain precursor
          Length = 526

 Score = 76.6 bits (187), Expect = 4e-14
 Identities = 47/155 (30%), Positives = 74/155 (47%), Gaps = 21/155 (13%)
 Frame = +3

Query: 93  RPQGYMIIQADQPTIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPN 266
           RP+ Y +         K L N  + +  P+G++  PARTC  L   +P +  G YWIDPN
Sbjct: 289 RPKDYEV-----DATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPN 343

Query: 267 GGKIDDAVQVYCKIKERKTCIKSIYRETSLEKPRFNWYSQGNDNKFI------------N 410
            G   DA++VYC     +TCI++     S++    NWY      K +             
Sbjct: 344 QGCTMDAIKVYCDFSTGETCIRAQPENISVK----NWYKSSKAKKHVWLGETINGGTQFE 399

Query: 411 YALD-------QQQLTFLKMISNKASQFVTINCQN 494
           Y ++         QL F+++++N ASQ +T +C+N
Sbjct: 400 YNVEGVTSKEMATQLAFMRLLANHASQNITYHCKN 434
>sp|P28481|CO2A1_MOUSE Collagen alpha 1(II) chain precursor [Contains: Chondrocalcin]
          Length = 1459

 Score = 76.6 bits (187), Expect = 4e-14
 Identities = 46/132 (34%), Positives = 71/132 (53%), Gaps = 19/132 (14%)
 Frame = +3

Query: 156  DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCI-- 329
            ++I  PDG++  PARTC  L   +P +K G YWIDPN G   DA++V+C ++  +TC+  
Sbjct: 1239 ESIRSPDGSRKNPARTCQDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYP 1298

Query: 330  -------KSIYRETSLEKPRFNW----------YSQGNDNKFINYALDQQQLTFLKMISN 458
                   K+ +   S EK    W          +S G+ N   N A    Q+TFL+++S 
Sbjct: 1299 NPATVPRKNWWSSKSKEKKHI-WFGETMNGGFHFSYGDGNLAPNTA--NVQMTFLRLLST 1355

Query: 459  KASQFVTINCQN 494
            + SQ +T +C+N
Sbjct: 1356 EGSQNITYHCKN 1367
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 58,774,401
Number of Sequences: 369166
Number of extensions: 1131530
Number of successful extensions: 2721
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 2661
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2691
length of database: 68,354,980
effective HSP length: 104
effective length of database: 49,142,540
effective search space used: 3931403200
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)