Planarian EST Database


Dr_sW_027_G07

BLASTX 2.2.12 [Aug-07-2005]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dr_sW_027_G07
         (619 letters)

Database: Non-redundant SwissProt sequences 
           184,735 sequences; 68,354,980 total letters



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

sp|Q28668|CO1A2_RABIT  Collagen alpha 2(I) chain precursor        103   4e-22
sp|P05997|CO5A2_HUMAN  Collagen alpha 2(V) chain precursor        102   1e-21
sp|Q01149|CO1A2_MOUSE  Collagen alpha 2(I) chain precursor        100   5e-21
sp|O46392|CO1A2_CANFA  Collagen alpha 2(I) chain precursor         99   9e-21
sp|P02465|CO1A2_BOVIN  Collagen alpha 2(I) chain precursor         98   1e-20
sp|O93484|CO1A2_ONCMY  Collagen alpha 2(I) chain precursor         98   2e-20
sp|O42350|CO1A2_RANCA  Collagen alpha 2(I) chain precursor         97   3e-20
sp|P02452|CO1A1_HUMAN  Collagen alpha 1(I) chain precursor         96   6e-20
sp|P02457|CA11_CHICK  Collagen alpha 1(I) chain precursor          95   1e-19
sp|Q9XSJ7|CO1A1_CANFA  Collagen alpha 1(I) chain precursor         95   2e-19
>sp|Q28668|CO1A2_RABIT Collagen alpha 2(I) chain precursor
          Length = 526

 Score =  103 bits (257), Expect = 4e-22
 Identities = 70/215 (32%), Positives = 108/215 (50%), Gaps = 13/215 (6%)
 Frame = +1

Query: 7   EDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYIDPNGGHWKDA 186
           +D ++   L  L+  ++ L  P GS+  PAR+C D++  +   S+G Y+IDPN G   DA
Sbjct: 291 KDYEVDATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDA 350

Query: 187 IYVFCNFEKLETCI--EPE---VSNYKKTSYMKSHSSWFSILASLNKQVSYKI----PKE 339
           I V+C+F   ETCI  +PE   V N+ K+S  K H  W     +   Q  Y +     KE
Sbjct: 351 IKVYCDFSTGETCIRAQPENISVKNWYKSSKAKKH-VWLGETINGGTQFEYNVEGVTSKE 409

Query: 340 ---QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNSLQLLGDDNQILTVNNDD 510
              QL F++L +   SQN T  C N    S   +++E    N   +L   N +  V   +
Sbjct: 410 MATQLAFMRLLANHASQNITYHCKN----SIAYMDEETGNLNKAVILQGSNDVELVAEGN 465

Query: 511 DLFQYQIIEDNC-KGESSSGRVVIKVELDRPRRLP 612
             F Y ++ D C K  +  G+ +I+ + ++P RLP
Sbjct: 466 SRFTYTVLVDGCTKKTNEWGKTIIEYKTNKPSRLP 500
>sp|P05997|CO5A2_HUMAN Collagen alpha 2(V) chain precursor
          Length = 1496

 Score =  102 bits (253), Expect = 1e-21
 Identities = 68/222 (30%), Positives = 107/222 (48%), Gaps = 17/222 (7%)
 Frame = +1

Query: 1    NDEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYIDPNGGHWK 180
            N  D  +   L  LS+ ++ +  P GSK +PAR+C D++  ++ K +G Y+IDPN G  +
Sbjct: 1259 NKTDPGVHATLKSLSSQIETMRSPDGSKKHPARTCDDLKLCHSAKQSGEYWIDPNQGSVE 1318

Query: 181  DAIYVFCNFEKLETCIEPEVSNYKKTSYMKSHSS-----WFSILASLNKQVSY------K 327
            DAI V+CN E  ETCI    S+  + ++  S S      W+ +  +   Q +Y       
Sbjct: 1319 DAIKVYCNMETGETCISANPSSVPRKTWWASKSPDNKPVWYGLDMNRGSQFAYGDHQSPN 1378

Query: 328  IPKEQLVFLQLSSESTSQNFTLSCDN-IGLVSDNSVNQEN----KYNNSLQLLGDDNQIL 492
                Q+ FL+L S+  SQN T  C N +G + D + N +     K  N L +  + N   
Sbjct: 1379 TAITQMTFLRLLSKEASQNITYICKNSVGYMDDQAKNLKKAVVLKGANDLDIKAEGN--- 1435

Query: 493  TVNNDDDLFQYQIIEDNC-KGESSSGRVVIKVELDRPRRLPI 615
                    F+Y +++D C K   + G+ V +       RLPI
Sbjct: 1436 ------IRFRYIVLQDTCSKRNGNVGKTVFEYRTQNVARLPI 1471
>sp|Q01149|CO1A2_MOUSE Collagen alpha 2(I) chain precursor
          Length = 1372

 Score = 99.8 bits (247), Expect = 5e-21
 Identities = 64/214 (29%), Positives = 108/214 (50%), Gaps = 12/214 (5%)
 Frame = +1

Query: 7    EDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYIDPNGGHWKDA 186
            +D ++   L  L+  ++ L  P GS+  PAR+C D++  +   ++  Y+IDPN G   DA
Sbjct: 1137 KDYEVDATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWNSDYYWIDPNQGCTMDA 1196

Query: 187  IYVFCNFEKLETCIEPE-VSNYKKTSYMKSHSS---WFSILASLNKQVSYKI----PKE- 339
            I V+C+F   ETCI+ + V+   K SY ++ ++   W     +   Q  Y +     KE 
Sbjct: 1197 IKVYCDFSTGETCIQAQPVNTPAKNSYSRAQANKHVWLGETINGGSQFEYNVEGVSSKEM 1256

Query: 340  --QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNSLQLLGDDNQILTVNNDDD 513
              QL F++L +   SQN T  C N    S   +++E    N   LL   N +  V   + 
Sbjct: 1257 ATQLAFMRLLANRASQNITYHCKN----SIAYLDEETGSLNKAVLLQGSNDVELVAEGNS 1312

Query: 514  LFQYQIIEDNCKGESSS-GRVVIKVELDRPRRLP 612
             F Y ++ D C  +++  G+ +I+ + ++P RLP
Sbjct: 1313 RFTYSVLVDGCSKKTNEWGKTIIEYKTNKPSRLP 1346
>sp|O46392|CO1A2_CANFA Collagen alpha 2(I) chain precursor
          Length = 1366

 Score = 99.0 bits (245), Expect = 9e-21
 Identities = 67/216 (31%), Positives = 107/216 (49%), Gaps = 13/216 (6%)
 Frame = +1

Query: 7    EDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYIDPNGGHWKDA 186
            +D ++   L  L+  ++ L  P GS+  PAR+C D++  +   S+G Y+IDPN G   DA
Sbjct: 1131 KDYEVDATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDA 1190

Query: 187  IYVFCNFEKLETCI--EPE---VSNYKKTSYMKSHSSWFSILASLNKQVSYKI----PKE 339
            I V+C+F   ETCI  +PE     N+ + S +K H  W     +   Q  Y +     KE
Sbjct: 1191 IKVYCDFSTGETCIRAQPENIPAKNWYRNSKVKKH-IWLGETINGGTQFEYNVEGVTTKE 1249

Query: 340  ---QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNSLQLLGDDNQILTVNNDD 510
               QL F++L +   SQN T  C N    S   +++E        +L   N +  V   +
Sbjct: 1250 MATQLAFMRLLANHASQNITYHCKN----SIAYMDEETGNLKKAVILQGSNDVELVAEGN 1305

Query: 511  DLFQYQIIEDNCKGESSSGR-VVIKVELDRPRRLPI 615
              F Y ++ D C  +++  R  +I+ + ++P RLPI
Sbjct: 1306 SRFTYTVLVDGCSKKTNEWRKTIIEYKTNKPSRLPI 1341
>sp|P02465|CO1A2_BOVIN Collagen alpha 2(I) chain precursor
          Length = 1364

 Score = 98.2 bits (243), Expect = 1e-20
 Identities = 67/216 (31%), Positives = 107/216 (49%), Gaps = 13/216 (6%)
 Frame = +1

Query: 7    EDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYIDPNGGHWKDA 186
            +D ++   L  L+  ++ L  P GS+  PAR+C D++  +   S+G Y+IDPN G   DA
Sbjct: 1129 KDYEVDATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDA 1188

Query: 187  IYVFCNFEKLETCI--EPE---VSNYKKTSYMKSHSSWFSILASLNKQVSYKI----PKE 339
            I V+C+F   ETCI  +PE   V N+ + S  K H  W     +   Q  Y +     KE
Sbjct: 1189 IKVYCDFSTGETCIRAQPEDIPVKNWYRNSKAKKH-VWVGETINGGTQFEYNVEGVTTKE 1247

Query: 340  ---QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNSLQLLGDDNQILTVNNDD 510
               QL F++L +   SQN T  C N    S   +++E        +L   N +  V   +
Sbjct: 1248 MATQLAFMRLLANHASQNITYHCKN----SIAYMDEETGNLKKAVILQGSNDVELVAEGN 1303

Query: 511  DLFQYQIIEDNCKGESSS-GRVVIKVELDRPRRLPI 615
              F Y ++ D C  +++   + +I+ + ++P RLPI
Sbjct: 1304 SRFTYTVLVDGCSKKTNEWQKTIIEYKTNKPSRLPI 1339
>sp|O93484|CO1A2_ONCMY Collagen alpha 2(I) chain precursor
          Length = 1356

 Score = 97.8 bits (242), Expect = 2e-20
 Identities = 62/215 (28%), Positives = 103/215 (47%), Gaps = 12/215 (5%)
 Frame = +1

Query: 7    EDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYIDPNGGHWKDA 186
            +D ++   +  L++ ++NL  P GSK  PAR+C DI+  + + S+G Y+IDPN G   DA
Sbjct: 1121 KDYEVDATIKSLNSQIENLLTPEGSKKNPARTCRDIRLSHPDWSSGFYWIDPNQGCIADA 1180

Query: 187  IYVFCNFEKLETCIEPEVSNYKKTSYMKSHSS----WF-------SILASLNKQVSYKIP 333
            I  +C+F    TCI P   +  + ++ +S  +    WF       +  A  ++ +S +  
Sbjct: 1181 IKAYCDFSTGHTCIHPHPESIARKNWYRSSENKKHVWFGETINGGTEFAYNDETLSPQSM 1240

Query: 334  KEQLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNSLQLLGDDNQILTVNNDDD 513
              QL F++L +   +QN T  C N     D     EN       LL   N +      + 
Sbjct: 1241 ATQLAFMRLLANQATQNITYHCKNSVAYMDG----ENGNLKKAVLLQGSNDVELRAEGNS 1296

Query: 514  LFQYQIIEDNCKGESSS-GRVVIKVELDRPRRLPI 615
             F + ++ED C   +    + VI+   ++P RLPI
Sbjct: 1297 RFTFNVLEDGCTRHTGQWSKTVIEYRTNKPSRLPI 1331
>sp|O42350|CO1A2_RANCA Collagen alpha 2(I) chain precursor
          Length = 1355

 Score = 97.1 bits (240), Expect = 3e-20
 Identities = 65/217 (29%), Positives = 98/217 (45%), Gaps = 14/217 (6%)
 Frame = +1

Query: 7    EDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYIDPNGGHWKDA 186
            +D ++   L  L+  ++ +  P GS+  PAR+C D++  +   ++G Y+IDPN G   DA
Sbjct: 1118 KDYEVDATLKSLNQQIEVILTPEGSRKNPARTCRDLRLSHPEWTSGFYWIDPNQGCTSDA 1177

Query: 187  IYVFCNFEKLETCIEPEVSN------YKKTSYMKSHSSWFSILASLNKQVSY-------K 327
            I VFC+F   ETCI            Y  TS       WF  + +   Q  Y       K
Sbjct: 1178 IRVFCDFSSGETCIHANPDEITQKNWYINTSNKDKKHLWFGEILNGGTQFEYHDEGLTAK 1237

Query: 328  IPKEQLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNSLQLLGDDNQILTVNND 507
                QL F++L +   SQN T  C N    S   +++E        +L   N +      
Sbjct: 1238 DMATQLAFMRLLANQASQNITYHCKN----SIAYMDEETGNLKKAVILQGSNDVELRAEG 1293

Query: 508  DDLFQYQIIEDNC-KGESSSGRVVIKVELDRPRRLPI 615
            +  F Y ++ED C K     G+ VI+   ++P RLPI
Sbjct: 1294 NTRFTYSVLEDGCTKHTGEWGKTVIEYRTNKPSRLPI 1330
>sp|P02452|CO1A1_HUMAN Collagen alpha 1(I) chain precursor
          Length = 1464

 Score = 96.3 bits (238), Expect = 6e-20
 Identities = 70/218 (32%), Positives = 101/218 (46%), Gaps = 14/218 (6%)
 Frame = +1

Query: 4    DEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYIDPNGGHWKD 183
            D D ++   L  LS  ++N+  P GS+  PAR+C D++  +++  +G Y+IDPN G   D
Sbjct: 1226 DRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSGEYWIDPNQGCNLD 1285

Query: 184  AIYVFCNFEKLETCIEP-EVSNYKKTSYMKSHSS-----WFSILASLNKQVSY----KIP 333
            AI VFCN E  ETC+ P + S  +K  Y+  +       WF    +   Q  Y      P
Sbjct: 1286 AIKVFCNMETGETCVYPTQPSVAQKNWYISKNPKDKRHVWFGESMTDGFQFEYGGQGSDP 1345

Query: 334  KE---QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNSLQLLGDDNQILTVNN 504
             +   QL FL+L S   SQN T  C N     D       K      LL   N+I     
Sbjct: 1346 ADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKK----ALLLKGSNEIEIRAE 1401

Query: 505  DDDLFQYQIIEDNCKGESSS-GRVVIKVELDRPRRLPI 615
             +  F Y +  D C   + + G+ VI+ +  +  RLPI
Sbjct: 1402 GNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPI 1439
>sp|P02457|CA11_CHICK Collagen alpha 1(I) chain precursor
          Length = 1453

 Score = 95.1 bits (235), Expect = 1e-19
 Identities = 67/218 (30%), Positives = 101/218 (46%), Gaps = 14/218 (6%)
 Frame = +1

Query: 4    DEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYIDPNGGHWKD 183
            D D ++   L  LS  ++N+  P G++  PAR+C D++  + +  +G Y+IDPN G   D
Sbjct: 1215 DRDLEVDTTLKSLSQQIENIRSPEGTRKNPARTCRDLKMCHGDWKSGEYWIDPNQGCNLD 1274

Query: 184  AIYVFCNFEKLETCIEPEVSNYKKTSYMKSHSS------WFSILASLNKQVSY----KIP 333
            AI V+CN E  ETC+ P  +   + ++  S +       WF    S   Q  Y      P
Sbjct: 1275 AIKVYCNMETGETCVYPTQATIAQKNWYLSKNPKEKKHVWFGETMSDGFQFEYGGEGSNP 1334

Query: 334  KE---QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNSLQLLGDDNQILTVNN 504
             +   QL FL+L S   +QN T  C N     D+      K      LL   N+I     
Sbjct: 1335 ADVAIQLTFLRLMSTEATQNVTYHCKNSVAYMDHDTGNLKK----ALLLQGANEIEIRAE 1390

Query: 505  DDDLFQYQIIEDNCKGESSS-GRVVIKVELDRPRRLPI 615
             +  F Y + ED C   + + G+ VI+ +  +  RLPI
Sbjct: 1391 GNSRFTYGVTEDGCTSHTGAWGKTVIEYKTTKTSRLPI 1428
>sp|Q9XSJ7|CO1A1_CANFA Collagen alpha 1(I) chain precursor
          Length = 1460

 Score = 94.7 bits (234), Expect = 2e-19
 Identities = 68/218 (31%), Positives = 101/218 (46%), Gaps = 14/218 (6%)
 Frame = +1

Query: 4    DEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYIDPNGGHWKD 183
            D D ++   L  LS  ++N+  P GS+  PAR+C D++  +++  +G Y+IDPN G   D
Sbjct: 1222 DRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCRDLKMCHSDWKSGEYWIDPNQGCNLD 1281

Query: 184  AIYVFCNFEKLETCI---EPEVSN---YKKTSYMKSHSSWFSILASLNKQVSY----KIP 333
            AI VFCN E  ETC+   +P+V+    Y   +  +    W+    +   Q  Y      P
Sbjct: 1282 AIKVFCNMETGETCVYPTQPQVAQKNWYISKNPKEKRHVWYGESMTDGFQFEYGGQGSDP 1341

Query: 334  KE---QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNSLQLLGDDNQILTVNN 504
             +   QL FL+L S   SQN T  C N     D       K      LL   N+I     
Sbjct: 1342 ADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQTGNLKK----ALLLQGSNEIEIRAE 1397

Query: 505  DDDLFQYQIIEDNCKGESSS-GRVVIKVELDRPRRLPI 615
             +  F Y +  D C   + + G+ VI+ +  +  RLPI
Sbjct: 1398 GNSRFTYSVTYDGCTSHTGAWGKTVIEYKTTKTSRLPI 1435
  Database: Non-redundant SwissProt sequences
    Posted date:  Dec 6, 2005  7:40 AM
  Number of letters in database: 68,354,980
  Number of sequences in database:  184,735
  
  Database: swissprot.01
    Posted date:  Dec 6, 2005  8:18 AM
  Number of letters in database: 66,202,850
  Number of sequences in database:  184,431
  
Lambda     K      H
   0.314    0.133    0.386 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 66,222,247
Number of Sequences: 369166
Number of extensions: 1362631
Number of successful extensions: 2941
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 2802
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2910
length of database: 68,354,980
effective HSP length: 105
effective length of database: 48,957,805
effective search space used: 4895780500
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)