Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= Dr_sW_002_C14
(833 letters)
Database: Non-redundant SwissProt sequences
184,735 sequences; 68,354,980 total letters
Score E
Sequences producing significant alignments: (bits) Value
sp|P05997|CO5A2_HUMAN Collagen alpha 2(V) chain precursor 94 4e-19
sp|O93484|CO1A2_ONCMY Collagen alpha 2(I) chain precursor 88 3e-17
sp|P02458|CO2A1_HUMAN Collagen alpha 1(II) chain precursor ... 86 2e-16
sp|Q28668|CO1A2_RABIT Collagen alpha 2(I) chain precursor 85 2e-16
sp|P02460|CA12_CHICK Collagen alpha 1(II) chain precursor 85 2e-16
sp|P02465|CO1A2_BOVIN Collagen alpha 2(I) chain precursor 85 3e-16
sp||P12105_3 [Segment 3 of 3] Collagen alpha 1(III) chain p... 84 4e-16
sp|P02461|CO3A1_HUMAN Collagen alpha 1(III) chain precursor 84 5e-16
sp|O46392|CO1A2_CANFA Collagen alpha 2(I) chain precursor 84 5e-16
sp|O42350|CO1A2_RANCA Collagen alpha 2(I) chain precursor 84 5e-16
>sp|P05997|CO5A2_HUMAN Collagen alpha 2(V) chain precursor
Length = 1496
Score = 94.4 bits (233), Expect = 4e-19
Identities = 60/185 (32%), Positives = 92/185 (49%), Gaps = 25/185 (13%)
Frame = +1
Query: 352 DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCIKS 531
+ + PDG++ PARTC L + + + G YWIDPN G ++DA++VYC ++ +TCI +
Sbjct: 1277 ETMRSPDGSKKHPARTCDDLKLCHSAKQSGEYWIDPNQGSVEDAIKVYCNMETGETCISA 1336
Query: 532 IYRETSLEKPRFNWY-SQGNDNKFINYALDQQ------------------QLTFLKMISN 654
PR W+ S+ DNK + Y LD Q+TFL+++S
Sbjct: 1337 ----NPSSVPRKTWWASKSPDNKPVWYGLDMNRGSQFAYGDHQSPNTAITQMTFLRLLSK 1392
Query: 655 KASQFVTINCQNM-----PIIKNSVKPLRIFTDNDIILDNSDQI-FSYKILQDNCQYNSP 816
+ASQ +T C+N KN K + + ND+ + I F Y +LQD C +
Sbjct: 1393 EASQNITYICKNSVGYMDDQAKNLKKAVVLKGANDLDIKAEGNIRFRYIVLQDTCSKRNG 1452
Query: 817 NLSST 831
N+ T
Sbjct: 1453 NVGKT 1457
>sp|O93484|CO1A2_ONCMY Collagen alpha 2(I) chain precursor
Length = 1356
Score = 87.8 bits (216), Expect = 3e-17
Identities = 72/260 (27%), Positives = 113/260 (43%), Gaps = 33/260 (12%)
Frame = +1
Query: 151 GFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQGYMIIQADQPT 330
G G GP G+PGL G + GY +ADQP+
Sbjct: 1078 GHLGPAGPPGSPGLPG--------------------PAGPAGGGYDQSGGYDEYRADQPS 1117
Query: 331 I-AKYLGNDA-----------ITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKI 474
AK DA + P+G++ PARTC + +P + G YWIDPN G I
Sbjct: 1118 FRAKDYEVDATIKSLNSQIENLLTPEGSKKNPARTCRDIRLSHPDWSSGFYWIDPNQGCI 1177
Query: 475 DDAVQVYCKIKERKTCI----KSIYRET---SLEKPRFNWYSQ----GNDNKFINYALDQ 621
DA++ YC TCI +SI R+ S E + W+ + G + + + L
Sbjct: 1178 ADAIKAYCDFSTGHTCIHPHPESIARKNWYRSSENKKHVWFGETINGGTEFAYNDETLSP 1237
Query: 622 Q----QLTFLKMISNKASQFVTINCQNMPIIK-----NSVKPLRIFTDNDIIL-DNSDQI 771
Q QL F+++++N+A+Q +T +C+N N K + + ND+ L +
Sbjct: 1238 QSMATQLAFMRLLANQATQNITYHCKNSVAYMDGENGNLKKAVLLQGSNDVELRAEGNSR 1297
Query: 772 FSYKILQDNCQYNSPNLSST 831
F++ +L+D C ++ S T
Sbjct: 1298 FTFNVLEDGCTRHTGQWSKT 1317
>sp|P02458|CO2A1_HUMAN Collagen alpha 1(II) chain precursor [Contains: Chondrocalcin]
Length = 1418
Score = 85.5 bits (210), Expect = 2e-16
Identities = 73/248 (29%), Positives = 104/248 (41%), Gaps = 31/248 (12%)
Frame = +1
Query: 151 GFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQ----GYMIIQA 318
G G GP GNPG G R G A
Sbjct: 1125 GETGPAGPPGNPGPPGPPGPPGPGIDMSAFAGLGpreKGPDPLQYMRADQAAGGLRQHDA 1184
Query: 319 DQPTIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQV 492
+ K L N ++I P+G++ PARTC L +P +K G YWIDPN G DA++V
Sbjct: 1185 EVDATLKSLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKV 1244
Query: 493 YCKIKERKTCI---------KSIYRETSLEKPRFNW----------YSQGNDNKFINYAL 615
+C ++ +TC+ K+ + S EK W +S G+DN N A
Sbjct: 1245 FCNMETGETCVYPNPANVPKKNWWSSKSKEKKHI-WFGETINGGFHFSYGDDNLAPNTA- 1302
Query: 616 DQQQLTFLKMISNKASQFVTINCQNM-----PIIKNSVKPLRIFTDNDI-ILDNSDQIFS 777
Q+TFL+++S + SQ +T +C+N N K L I ND+ I + F+
Sbjct: 1303 -NVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFT 1361
Query: 778 YKILQDNC 801
Y L+D C
Sbjct: 1362 YTALKDGC 1369
>sp|Q28668|CO1A2_RABIT Collagen alpha 2(I) chain precursor
Length = 526
Score = 85.1 bits (209), Expect = 2e-16
Identities = 66/245 (26%), Positives = 102/245 (41%), Gaps = 27/245 (11%)
Frame = +1
Query: 148 QGFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQGYMIIQADQP 327
QG QG GP G PG G + RP+ Y +
Sbjct: 243 QGSQGPAGPPGPPGPPGPPGASGGGYDFGYDGDFYRADQPRSPPSL-RPKDYEV-----D 296
Query: 328 TIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCK 501
K L N + + P+G++ PARTC L +P + G YWIDPN G DA++VYC
Sbjct: 297 ATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCD 356
Query: 502 IKERKTCIKSIYRETSLEKPRFNWYSQGNDNKFI------------NYALD-------QQ 624
+TCI++ S++ NWY K + Y ++
Sbjct: 357 FSTGETCIRAQPENISVK----NWYKSSKAKKHVWLGETINGGTQFEYNVEGVTSKEMAT 412
Query: 625 QLTFLKMISNKASQFVTINCQNMPIIK-----NSVKPLRIFTDNDI-ILDNSDQIFSYKI 786
QL F+++++N ASQ +T +C+N N K + + ND+ ++ + F+Y +
Sbjct: 413 QLAFMRLLANHASQNITYHCKNSIAYMDEETGNLNKAVILQGSNDVELVAEGNSRFTYTV 472
Query: 787 LQDNC 801
L D C
Sbjct: 473 LVDGC 477
>sp|P02460|CA12_CHICK Collagen alpha 1(II) chain precursor
Length = 369
Score = 85.1 bits (209), Expect = 2e-16
Identities = 55/176 (31%), Positives = 87/176 (49%), Gaps = 26/176 (14%)
Frame = +1
Query: 352 DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCIKS 531
++I P+G++ PARTC + +P +K G YWIDPN G DA++V+C ++ +TC+
Sbjct: 149 ESIRSPEGSKKNPARTCRDIKLCHPEWKSGDYWIDPNQGCTLDAIKVFCNMETGETCV-- 206
Query: 532 IYRETSLEKPRFNWY-SQGNDNKFINYA-------------------LDQQQLTFLKMIS 651
T PR NW+ S+ D K + +A Q+TFL+++S
Sbjct: 207 --YPTPSSIPRKNWWTSKTKDKKHVWFAETINGGFHFSYGDENLSPNTASIQMTFLRLLS 264
Query: 652 NKASQFVTINCQNMPIIK-----NSVKPLRIFTDNDI-ILDNSDQIFSYKILQDNC 801
+ SQ VT +C+N N K + I ND+ I + F+Y +L+D C
Sbjct: 265 TEGSQNVTYHCKNSIAYMDEETGNLKKAILIQGSNDVEIRAEGNSRFTYSVLEDGC 320
>sp|P02465|CO1A2_BOVIN Collagen alpha 2(I) chain precursor
Length = 1364
Score = 84.7 bits (208), Expect = 3e-16
Identities = 67/245 (27%), Positives = 101/245 (41%), Gaps = 27/245 (11%)
Frame = +1
Query: 148 QGFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQGYMIIQADQP 327
QG QG GP G PG G + + RP+ Y +
Sbjct: 1081 QGSQGPAGPPGPPGPPGPPGPSGGGYEFGFDGDFYRADQPRSPTSL-RPKDYEV-----D 1134
Query: 328 TIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCK 501
K L N + + P+G++ PARTC L +P + G YWIDPN G DA++VYC
Sbjct: 1135 ATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCD 1194
Query: 502 IKERKTCIKSIYRETSLEKPRFNWYSQGNDNKFI------------NYALD-------QQ 624
+TCI R + P NWY K + Y ++
Sbjct: 1195 FSTGETCI----RAQPEDIPVKNWYRNSKAKKHVWVGETINGGTQFEYNVEGVTTKEMAT 1250
Query: 625 QLTFLKMISNKASQFVTINCQNMPIIK-----NSVKPLRIFTDNDI-ILDNSDQIFSYKI 786
QL F+++++N ASQ +T +C+N N K + + ND+ ++ + F+Y +
Sbjct: 1251 QLAFMRLLANHASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTV 1310
Query: 787 LQDNC 801
L D C
Sbjct: 1311 LVDGC 1315
>sp||P12105_3 [Segment 3 of 3] Collagen alpha 1(III) chain precursor
Length = 340
Score = 84.3 bits (207), Expect = 4e-16
Identities = 67/258 (25%), Positives = 110/258 (42%), Gaps = 40/258 (15%)
Frame = +1
Query: 148 QGFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQGYMIIQADQP 327
+G +GE GP G PG G+ P GY D+P
Sbjct: 45 RGNRGESGPAGPPGQPGLPGPSGPPGPCCGGGVASLGAGEKG------PVGYGYEYRDEP 98
Query: 328 -----------TIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGG 468
+ K + N + I PDG++ PAR C L +P K G YWIDPN G
Sbjct: 99 KENEINLGEIMSSMKSINNQIENILSPDGSRKNPARNCRDLKFCHPELKSGEYWIDPNQG 158
Query: 469 KIDDAVQVYCKIKERKTCIKSIYRETSLEKPRFNWY-SQGNDNKFINYA----------- 612
DA++VYC ++ +TC+ + PR NW+ ++ + K + +
Sbjct: 159 CKMDAIKVYCNMETGETCLSA----NPATVPRKNWWTTESSGKKHVWFGESMKGGFQFSY 214
Query: 613 --------LDQQQLTFLKMISNKASQFVTINCQNMPIIKNSV-----KPLRIFT--DNDI 747
+ + QL FL+++S++ASQ +T +C+N N K L++ + + DI
Sbjct: 215 GDPDLPEDVSEVQLAFLRILSSRASQNITYHCKNSIAYMNQASGNVKKALKLMSSVETDI 274
Query: 748 ILDNSDQIFSYKILQDNC 801
+ + + + Y +L+D C
Sbjct: 275 KAEGNSK-YMYAVLEDGC 291
>sp|P02461|CO3A1_HUMAN Collagen alpha 1(III) chain precursor
Length = 1466
Score = 84.0 bits (206), Expect = 5e-16
Identities = 54/183 (29%), Positives = 92/183 (50%), Gaps = 23/183 (12%)
Frame = +1
Query: 352 DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCI-- 525
+++ PDG++ PAR C L +P K G YW+DPN G DA++V+C ++ +TCI
Sbjct: 1246 ESLISPDGSRKNPARNCRDLKFCHPELKSGEYWVDPNQGCKLDAIKVFCNMETGETCISA 1305
Query: 526 -------KSIYRETSLEKPRFNWYSQGNDNKF-INYALDQ-------QQLTFLKMISNKA 660
K + ++S EK + W+ + D F +Y + QL FL+++S++A
Sbjct: 1306 NPLNVPRKHWWTDSSAEK-KHVWFGESMDGGFQFSYGNPELPEDVLDVQLAFLRLLSSRA 1364
Query: 661 SQFVTINCQNM-----PIIKNSVKPLRIFTDND-IILDNSDQIFSYKILQDNCQYNSPNL 822
SQ +T +C+N N K L++ N+ + F+Y +L+D C ++
Sbjct: 1365 SQNITYHCKNSIAYMDQASGNVKKALKLMGSNEGEFKAEGNSKFTYTVLEDGCTKHTGEW 1424
Query: 823 SST 831
S T
Sbjct: 1425 SKT 1427
>sp|O46392|CO1A2_CANFA Collagen alpha 2(I) chain precursor
Length = 1366
Score = 84.0 bits (206), Expect = 5e-16
Identities = 68/245 (27%), Positives = 99/245 (40%), Gaps = 27/245 (11%)
Frame = +1
Query: 148 QGFQGERGPTGNPGLQGVDXXXXXXXXXXXXXXXXXXXXXXXASMISRPQGYMIIQADQP 327
QG QG GP G PG G + RP+ Y +
Sbjct: 1083 QGSQGPAGPPGPPGPPGPPGPSGGGYDFGYEGDFYRADQPRSPPSL-RPKDYEV-----D 1136
Query: 328 TIAKYLGN--DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCK 501
K L N + + P+G++ PARTC L +P + G YWIDPN G DA++VYC
Sbjct: 1137 ATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCD 1196
Query: 502 IKERKTCIKSIYRETSLEKPRFNWYSQGNDNKFI------------NYALD-------QQ 624
+TCI R P NWY K I Y ++
Sbjct: 1197 FSTGETCI----RAQPENIPAKNWYRNSKVKKHIWLGETINGGTQFEYNVEGVTTKEMAT 1252
Query: 625 QLTFLKMISNKASQFVTINCQNMPIIK-----NSVKPLRIFTDNDI-ILDNSDQIFSYKI 786
QL F+++++N ASQ +T +C+N N K + + ND+ ++ + F+Y +
Sbjct: 1253 QLAFMRLLANHASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTV 1312
Query: 787 LQDNC 801
L D C
Sbjct: 1313 LVDGC 1317
>sp|O42350|CO1A2_RANCA Collagen alpha 2(I) chain precursor
Length = 1355
Score = 84.0 bits (206), Expect = 5e-16
Identities = 52/173 (30%), Positives = 87/173 (50%), Gaps = 23/173 (13%)
Frame = +1
Query: 352 DAITQPDGTQNLPARTCLHLAEINPSFKDGLYWIDPNGGKIDDAVQVYCKIKERKTCI-- 525
+ I P+G++ PARTC L +P + G YWIDPN G DA++V+C +TCI
Sbjct: 1134 EVILTPEGSRKNPARTCRDLRLSHPEWTSGFYWIDPNQGCTSDAIRVFCDFSSGETCIHA 1193
Query: 526 -------KSIYRETSLEKPRFNWYSQ----GNDNKFINYALDQQ----QLTFLKMISNKA 660
K+ Y TS + + W+ + G ++ + L + QL F+++++N+A
Sbjct: 1194 NPDEITQKNWYINTSNKDKKHLWFGEILNGGTQFEYHDEGLTAKDMATQLAFMRLLANQA 1253
Query: 661 SQFVTINCQNMPIIK-----NSVKPLRIFTDNDIIL-DNSDQIFSYKILQDNC 801
SQ +T +C+N N K + + ND+ L + F+Y +L+D C
Sbjct: 1254 SQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELRAEGNTRFTYSVLEDGC 1306
Database: Non-redundant SwissProt sequences
Posted date: Dec 6, 2005 7:40 AM
Number of letters in database: 68,354,980
Number of sequences in database: 184,735
Database: swissprot.01
Posted date: Dec 6, 2005 8:18 AM
Number of letters in database: 66,202,850
Number of sequences in database: 184,431
Lambda K H
0.317 0.136 0.407
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 86,019,762
Number of Sequences: 369166
Number of extensions: 1622128
Number of successful extensions: 7133
Number of sequences better than 10.0: 10
Number of HSP's better than 10.0 without gapping: 4479
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 7076
length of database: 68,354,980
effective HSP length: 109
effective length of database: 48,218,865
effective search space used: 8100769320
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)