Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dr_sW_001_I24 (846 letters) Database: Non-redundant SwissProt sequences 184,735 sequences; 68,354,980 total letters Score E Sequences producing significant alignments: (bits) Value sp|P05997|CO5A2_HUMAN Collagen alpha 2(V) chain precursor 120 6e-27 sp|Q28668|CO1A2_RABIT Collagen alpha 2(I) chain precursor 118 2e-26 sp|Q01149|CO1A2_MOUSE Collagen alpha 2(I) chain precursor 117 3e-26 sp|O46392|CO1A2_CANFA Collagen alpha 2(I) chain precursor 114 3e-25 sp|O42350|CO1A2_RANCA Collagen alpha 2(I) chain precursor 114 3e-25 sp|P02465|CO1A2_BOVIN Collagen alpha 2(I) chain precursor 114 4e-25 sp|O93484|CO1A2_ONCMY Collagen alpha 2(I) chain precursor 112 1e-24 sp|P02466|CO1A2_RAT Collagen alpha 2(I) chain precursor 110 5e-24 sp|P02467|CO1A2_CHICK Collagen alpha 2(I) chain precursor 110 6e-24 sp|P02457|CA11_CHICK Collagen alpha 1(I) chain precursor 110 6e-24
>sp|P05997|CO5A2_HUMAN Collagen alpha 2(V) chain precursor Length = 1496 Score = 120 bits (300), Expect = 6e-27 Identities = 81/255 (31%), Positives = 125/255 (49%), Gaps = 17/255 (6%) Frame = +3 Query: 39 EQKSSIDPNDEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYI 218 +Q + D N D + L LS+ ++ + P GSK +PAR+C D++ ++ K +G Y+I Sbjct: 1251 DQAAPDDKNKTDPGVHATLKSLSSQIETMRSPDGSKKHPARTCDDLKLCHSAKQSGEYWI 1310 Query: 219 DPNGGHWKDAIYVFCNFEKLETCIEPEVSNYKKTSYMKSHSS-----WFSILASLNKQVS 383 DPN G +DAI V+CN E ETCI S+ + ++ S S W+ + + Q + Sbjct: 1311 DPNQGSVEDAIKVYCNMETGETCISANPSSVPRKTWWASKSPDNKPVWYGLDMNRGSQFA 1370 Query: 384 Y------KIPKEQLVFLQLSSESTSQNFTLSCDN-IGLVSDNSVNQEN----KYNNSLQL 530 Y Q+ FL+L S+ SQN T C N +G + D + N + K N L + Sbjct: 1371 YGDHQSPNTAITQMTFLRLLSKEASQNITYICKNSVGYMDDQAKNLKKAVVLKGANDLDI 1430 Query: 531 LGDDNQILTVNNDDDLFQYQIIEDNC-KGKSSSGRVVIKVELDRPRRLPIRDFNFKSLES 707 + N F+Y +++D C K + G+ V + RLPI D + Sbjct: 1431 KAEGN---------IRFRYIVLQDTCSKRNGNVGKTVFEYRTQNVARLPIIDLAPVDVGG 1481 Query: 708 SPQAKIGVEIGPVCF 752 + Q + GVEIGPVCF Sbjct: 1482 TDQ-EFGVEIGPVCF 1495
>sp|Q28668|CO1A2_RABIT Collagen alpha 2(I) chain precursor Length = 526 Score = 118 bits (295), Expect = 2e-26 Identities = 83/259 (32%), Positives = 131/259 (50%), Gaps = 13/259 (5%) Frame = +3 Query: 15 FQRDEPQAEQKSSIDPNDEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNN 194 ++ D+P++ S+ P D + D L L+ ++ L P GS+ PAR+C D++ + Sbjct: 277 YRADQPRSPP--SLRPKDYEVDA--TLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPE 332 Query: 195 KSNGMYYIDPNGGHWKDAIYVFCNFEKLETCI--EPE---VSNYKKTSYMKSHSSWFSIL 359 S+G Y+IDPN G DAI V+C+F ETCI +PE V N+ K+S K H W Sbjct: 333 WSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPENISVKNWYKSSKAKKH-VWLGET 391 Query: 360 ASLNKQVSYKI----PKE---QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNN 518 + Q Y + KE QL F++L + SQN T C N S +++E N Sbjct: 392 INGGTQFEYNVEGVTSKEMATQLAFMRLLANHASQNITYHCKN----SIAYMDEETGNLN 447 Query: 519 SLQLLGDDNQILTVNNDDDLFQYQIIEDNCKGKSSS-GRVVIKVELDRPRRLPIRDFNFK 695 +L N + V + F Y ++ D C K++ G+ +I+ + ++P RLP D Sbjct: 448 KAVILQGSNDVELVAEGNSRFTYTVLVDGCTKKTNEWGKTIIEYKTNKPSRLPFLDIAPL 507 Query: 696 SLESSPQAKIGVEIGPVCF 752 + + Q + V++GPVCF Sbjct: 508 DIGGADQ-EFYVDVGPVCF 525
>sp|Q01149|CO1A2_MOUSE Collagen alpha 2(I) chain precursor Length = 1372 Score = 117 bits (294), Expect = 3e-26 Identities = 80/262 (30%), Positives = 133/262 (50%), Gaps = 12/262 (4%) Frame = +3 Query: 3 EEEQFQRDEPQAEQKSSIDPNDEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQR 182 E + ++ D+P+++ S+ P D + D L L+ ++ L P GS+ PAR+C D++ Sbjct: 1119 EGDFYRADQPRSQP--SLRPKDYEVDA--TLKSLNNQIETLLTPEGSRKNPARTCRDLRL 1174 Query: 183 DYNNKSNGMYYIDPNGGHWKDAIYVFCNFEKLETCIEPE-VSNYKKTSYMKSHSS---WF 350 + ++ Y+IDPN G DAI V+C+F ETCI+ + V+ K SY ++ ++ W Sbjct: 1175 SHPEWNSDYYWIDPNQGCTMDAIKVYCDFSTGETCIQAQPVNTPAKNSYSRAQANKHVWL 1234 Query: 351 SILASLNKQVSYKI----PKE---QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENK 509 + Q Y + KE QL F++L + SQN T C N S +++E Sbjct: 1235 GETINGGSQFEYNVEGVSSKEMATQLAFMRLLANRASQNITYHCKN----SIAYLDEETG 1290 Query: 510 YNNSLQLLGDDNQILTVNNDDDLFQYQIIEDNCKGKSSS-GRVVIKVELDRPRRLPIRDF 686 N LL N + V + F Y ++ D C K++ G+ +I+ + ++P RLP D Sbjct: 1291 SLNKAVLLQGSNDVELVAEGNSRFTYSVLVDGCSKKTNEWGKTIIEYKTNKPSRLPFLDI 1350 Query: 687 NFKSLESSPQAKIGVEIGPVCF 752 + + Q + VE+GPVCF Sbjct: 1351 APLDIGGADQ-EFRVEVGPVCF 1371
>sp|O46392|CO1A2_CANFA Collagen alpha 2(I) chain precursor Length = 1366 Score = 114 bits (286), Expect = 3e-25 Identities = 82/263 (31%), Positives = 131/263 (49%), Gaps = 13/263 (4%) Frame = +3 Query: 3 EEEQFQRDEPQAEQKSSIDPNDEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQR 182 E + ++ D+P++ S+ P D + D L L+ ++ L P GS+ PAR+C D++ Sbjct: 1113 EGDFYRADQPRSPP--SLRPKDYEVDA--TLKSLNNQIETLLTPEGSRKNPARTCRDLRL 1168 Query: 183 DYNNKSNGMYYIDPNGGHWKDAIYVFCNFEKLETCI--EPE---VSNYKKTSYMKSHSSW 347 + S+G Y+IDPN G DAI V+C+F ETCI +PE N+ + S +K H W Sbjct: 1169 SHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPENIPAKNWYRNSKVKKH-IW 1227 Query: 348 FSILASLNKQVSYKI----PKE---QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQEN 506 + Q Y + KE QL F++L + SQN T C N S +++E Sbjct: 1228 LGETINGGTQFEYNVEGVTTKEMATQLAFMRLLANHASQNITYHCKN----SIAYMDEET 1283 Query: 507 KYNNSLQLLGDDNQILTVNNDDDLFQYQIIEDNCKGKSSSGR-VVIKVELDRPRRLPIRD 683 +L N + V + F Y ++ D C K++ R +I+ + ++P RLPI D Sbjct: 1284 GNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKKTNEWRKTIIEYKTNKPSRLPILD 1343 Query: 684 FNFKSLESSPQAKIGVEIGPVCF 752 + + Q + V++GPVCF Sbjct: 1344 IAPLDIGDADQ-EFRVDVGPVCF 1365
>sp|O42350|CO1A2_RANCA Collagen alpha 2(I) chain precursor Length = 1355 Score = 114 bits (286), Expect = 3e-25 Identities = 80/262 (30%), Positives = 119/262 (45%), Gaps = 14/262 (5%) Frame = +3 Query: 9 EQFQRDEPQAEQKSSIDPNDEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDY 188 E ++ D+P+ + K D ++ L L+ ++ + P GS+ PAR+C D++ + Sbjct: 1106 EYYRADQPERKPK--------DYEVDATLKSLNQQIEVILTPEGSRKNPARTCRDLRLSH 1157 Query: 189 NNKSNGMYYIDPNGGHWKDAIYVFCNFEKLETCIEPEVSN------YKKTSYMKSHSSWF 350 ++G Y+IDPN G DAI VFC+F ETCI Y TS WF Sbjct: 1158 PEWTSGFYWIDPNQGCTSDAIRVFCDFSSGETCIHANPDEITQKNWYINTSNKDKKHLWF 1217 Query: 351 SILASLNKQVSY-------KIPKEQLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENK 509 + + Q Y K QL F++L + SQN T C N S +++E Sbjct: 1218 GEILNGGTQFEYHDEGLTAKDMATQLAFMRLLANQASQNITYHCKN----SIAYMDEETG 1273 Query: 510 YNNSLQLLGDDNQILTVNNDDDLFQYQIIEDNC-KGKSSSGRVVIKVELDRPRRLPIRDF 686 +L N + + F Y ++ED C K G+ VI+ ++P RLPI D Sbjct: 1274 NLKKAVILQGSNDVELRAEGNTRFTYSVLEDGCTKHTGEWGKTVIEYRTNKPSRLPILDI 1333 Query: 687 NFKSLESSPQAKIGVEIGPVCF 752 + Q +IG EIGPVCF Sbjct: 1334 APLDIGGHDQ-EIGFEIGPVCF 1354
>sp|P02465|CO1A2_BOVIN Collagen alpha 2(I) chain precursor Length = 1364 Score = 114 bits (284), Expect = 4e-25 Identities = 82/259 (31%), Positives = 130/259 (50%), Gaps = 13/259 (5%) Frame = +3 Query: 15 FQRDEPQAEQKSSIDPNDEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNN 194 ++ D+P++ +S+ P D + D L L+ ++ L P GS+ PAR+C D++ + Sbjct: 1115 YRADQPRSP--TSLRPKDYEVDA--TLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPE 1170 Query: 195 KSNGMYYIDPNGGHWKDAIYVFCNFEKLETCI--EPE---VSNYKKTSYMKSHSSWFSIL 359 S+G Y+IDPN G DAI V+C+F ETCI +PE V N+ + S K H W Sbjct: 1171 WSSGYYWIDPNQGCTMDAIKVYCDFSTGETCIRAQPEDIPVKNWYRNSKAKKH-VWVGET 1229 Query: 360 ASLNKQVSYKI----PKE---QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNN 518 + Q Y + KE QL F++L + SQN T C N S +++E Sbjct: 1230 INGGTQFEYNVEGVTTKEMATQLAFMRLLANHASQNITYHCKN----SIAYMDEETGNLK 1285 Query: 519 SLQLLGDDNQILTVNNDDDLFQYQIIEDNCKGKSSS-GRVVIKVELDRPRRLPIRDFNFK 695 +L N + V + F Y ++ D C K++ + +I+ + ++P RLPI D Sbjct: 1286 KAVILQGSNDVELVAEGNSRFTYTVLVDGCSKKTNEWQKTIIEYKTNKPSRLPILDIAPL 1345 Query: 696 SLESSPQAKIGVEIGPVCF 752 + + Q +I + IGPVCF Sbjct: 1346 DIGGADQ-EIRLNIGPVCF 1363
>sp|O93484|CO1A2_ONCMY Collagen alpha 2(I) chain precursor Length = 1356 Score = 112 bits (281), Expect = 1e-24 Identities = 76/255 (29%), Positives = 124/255 (48%), Gaps = 12/255 (4%) Frame = +3 Query: 24 DEPQAEQKSSIDPNDEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSN 203 DE +A+Q S +D ++ + L++ ++NL P GSK PAR+C DI+ + + S+ Sbjct: 1109 DEYRADQPSF---RAKDYEVDATIKSLNSQIENLLTPEGSKKNPARTCRDIRLSHPDWSS 1165 Query: 204 GMYYIDPNGGHWKDAIYVFCNFEKLETCIEPEVSNYKKTSYMKSHSS----WF------- 350 G Y+IDPN G DAI +C+F TCI P + + ++ +S + WF Sbjct: 1166 GFYWIDPNQGCIADAIKAYCDFSTGHTCIHPHPESIARKNWYRSSENKKHVWFGETINGG 1225 Query: 351 SILASLNKQVSYKIPKEQLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNSLQL 530 + A ++ +S + QL F++L + +QN T C N D EN L Sbjct: 1226 TEFAYNDETLSPQSMATQLAFMRLLANQATQNITYHCKNSVAYMDG----ENGNLKKAVL 1281 Query: 531 LGDDNQILTVNNDDDLFQYQIIEDNCKGKSSS-GRVVIKVELDRPRRLPIRDFNFKSLES 707 L N + + F + ++ED C + + VI+ ++P RLPI D + Sbjct: 1282 LQGSNDVELRAEGNSRFTFNVLEDGCTRHTGQWSKTVIEYRTNKPSRLPILDIAPLDIGE 1341 Query: 708 SPQAKIGVEIGPVCF 752 + Q + G++IGPVCF Sbjct: 1342 ADQ-EFGLDIGPVCF 1355
>sp|P02466|CO1A2_RAT Collagen alpha 2(I) chain precursor Length = 1372 Score = 110 bits (275), Expect = 5e-24 Identities = 77/258 (29%), Positives = 128/258 (49%), Gaps = 12/258 (4%) Frame = +3 Query: 15 FQRDEPQAEQKSSIDPNDEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNN 194 ++ D+P+++ S+ P D + D L L+ ++ L P GS+ PAR+C D++ + Sbjct: 1123 YRADQPRSQP--SLRPKDYEVDA--TLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPE 1178 Query: 195 KSNGMYYIDPNGGHWKDAIYVFCNFEKLETCIEPE-VSNYKKTSYMKSHSS---WFSILA 362 + Y+IDPN G DAI V+C+F ETCI+ + V+ K +Y ++ ++ W Sbjct: 1179 WKSDYYWIDPNQGCTMDAIKVYCDFSTGETCIQAQPVNTPAKNAYSRAQANKHVWLGETI 1238 Query: 363 SLNKQ-------VSYKIPKEQLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNS 521 + Q VS K QL F++L + SQN T C N S +++E N Sbjct: 1239 NGGSQFEYNAEGVSSKEMATQLAFMRLLANRASQNITYHCKN----SIAYLDEETGRLNK 1294 Query: 522 LQLLGDDNQILTVNNDDDLFQYQIIEDNCKGKSSS-GRVVIKVELDRPRRLPIRDFNFKS 698 +L N + V + F Y ++ D C K++ + VI+ + ++P RLP D Sbjct: 1295 AVILQGSNDVELVAEGNSRFTYTVLVDGCSKKTNEWDKTVIEYKTNKPSRLPFLDIAPLD 1354 Query: 699 LESSPQAKIGVEIGPVCF 752 + + Q + VE+GPVCF Sbjct: 1355 IGGTNQ-EFRVEVGPVCF 1371
>sp|P02467|CO1A2_CHICK Collagen alpha 2(I) chain precursor Length = 1362 Score = 110 bits (274), Expect = 6e-24 Identities = 78/264 (29%), Positives = 123/264 (46%), Gaps = 14/264 (5%) Frame = +3 Query: 3 EEEQFQRDEPQAEQKSSIDPNDEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQR 182 + E ++ D+P S+ P D + D L L+ ++ L P GSK PAR+C D++ Sbjct: 1111 DAEYYRADQP------SLRPKDYEVDA--TLKTLNNQIETLLTPEGSKKNPARTCRDLRL 1162 Query: 183 DYNNKSNGMYYIDPNGGHWKDAIYVFCNFEKLETCIEPEVSNY-KKTSYMKSHSS----- 344 + S+G Y+IDPN G DAI +C+F ETCI + + KT Y+ + Sbjct: 1163 SHPEWSSGFYWIDPNQGCTADAIRAYCDFATGETCIHASLEDIPTKTWYVSKNPKDKKHI 1222 Query: 345 WFSILASLNKQVSY-------KIPKEQLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQE 503 WF + Q Y K QL F++L + SQN T C N S +++E Sbjct: 1223 WFGETINGGTQFEYNGEGVTTKDMATQLAFMRLLANHASQNITYHCKN----SIAYMDEE 1278 Query: 504 NKYNNSLQLLGDDNQILTVNNDDDLFQYQIIEDNCKGKSSS-GRVVIKVELDRPRRLPIR 680 +L N + + F + ++ D C K++ G+ +I+ ++P RLPI Sbjct: 1279 TGNLKKAVILQGSNDVELRAEGNSRFTFSVLVDGCSKKNNKWGKTIIEYRTNKPSRLPIL 1338 Query: 681 DFNFKSLESSPQAKIGVEIGPVCF 752 D + + Q + G+ IGPVCF Sbjct: 1339 DIAPLDIGGADQ-EFGLHIGPVCF 1361
>sp|P02457|CA11_CHICK Collagen alpha 1(I) chain precursor Length = 1453 Score = 110 bits (274), Expect = 6e-24 Identities = 76/243 (31%), Positives = 115/243 (47%), Gaps = 14/243 (5%) Frame = +3 Query: 66 DEDEDIFEALLRLSTIVDNLFKPPGSKTYPARSCADIQRDYNNKSNGMYYIDPNGGHWKD 245 D D ++ L LS ++N+ P G++ PAR+C D++ + + +G Y+IDPN G D Sbjct: 1215 DRDLEVDTTLKSLSQQIENIRSPEGTRKNPARTCRDLKMCHGDWKSGEYWIDPNQGCNLD 1274 Query: 246 AIYVFCNFEKLETCIEPEVSNYKKTSYMKSHSS------WFSILASLNKQVSY----KIP 395 AI V+CN E ETC+ P + + ++ S + WF S Q Y P Sbjct: 1275 AIKVYCNMETGETCVYPTQATIAQKNWYLSKNPKEKKHVWFGETMSDGFQFEYGGEGSNP 1334 Query: 396 KE---QLVFLQLSSESTSQNFTLSCDNIGLVSDNSVNQENKYNNSLQLLGDDNQILTVNN 566 + QL FL+L S +QN T C N D+ K LL N+I Sbjct: 1335 ADVAIQLTFLRLMSTEATQNVTYHCKNSVAYMDHDTGNLKK----ALLLQGANEIEIRAE 1390 Query: 567 DDDLFQYQIIEDNCKGKSSS-GRVVIKVELDRPRRLPIRDFNFKSLESSPQAKIGVEIGP 743 + F Y + ED C + + G+ VI+ + + RLPI D + +P + G++IGP Sbjct: 1391 GNSRFTYGVTEDGCTSHTGAWGKTVIEYKTTKTSRLPIIDLAPMDV-GAPDQEFGIDIGP 1449 Query: 744 VCF 752 VCF Sbjct: 1450 VCF 1452
Database: Non-redundant SwissProt sequences Posted date: Dec 6, 2005 7:40 AM Number of letters in database: 68,354,980 Number of sequences in database: 184,735 Database: swissprot.01 Posted date: Dec 6, 2005 8:18 AM Number of letters in database: 66,202,850 Number of sequences in database: 184,431 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 89,849,007 Number of Sequences: 369166 Number of extensions: 1863327 Number of successful extensions: 5998 Number of sequences better than 10.0: 10 Number of HSP's better than 10.0 without gapping: 5364 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 5888 length of database: 68,354,980 effective HSP length: 109 effective length of database: 48,218,865 effective search space used: 8293644780 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits)