- 1、有哪些信誉好的足球投注网站(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
- 4、该文档为VIP文档,如果想要下载,成为VIP会员后,下载免费。
- 5、成为VIP后,下载本文档将扣除1次下载权益。下载后,不支持退款、换文档。如有疑问请联系我们。
- 6、成为VIP后,您将拥有八大权益,权益包括:VIP文档下载权益、阅读免打扰、文档格式转换、高级专利检索、专属身份标志、高级客服、多端互通、版权登记。
- 7、VIP文档为合作方或网友上传,每下载1次, 网站将根据用户上传文档的质量评分、类型等,对文档贡献者给予高额补贴、流量扶持。如果你也想贡献VIP文档。上传文档
查看更多
2 seq-comparison
Bio-sequence Comparison
Ying Xu (徐鹰)
Bio-molecules
? Three major types of bio-molecules in our cells
– nucleotides (DNA, RNA)
– proteins
– (poly)sugar
Bio-sequences
? The first two classes of bio-molecules have linear
structures so they can be represented as bio-sequences
– DNA sequences (consisting of four types of letters, A, C, G, T)
– RNA sequences (consisting of four types of letters, A, C, G, U)
– protein sequences (consisting of 20 types of letters)
ccgtacgtacgtagagtgctagtctagtcgtagcgccgtagtcgatcgtgtgggtagtagctgatatgatgcga
ggtaggggataggatagcaacagatgagcggatgctgagtgcagtggcatgcgatgtcgatgatagcggta
ggtagacttcgcgcataaagctgcgcgagatgattgcaaagragttagatgagctgatgctagaggtcagtg
actgatgatcgatgcatgcatggatgatgcagctgatcgatgtagatgcaataagtcgatgatcgatgatgatg
DNA sequence
SAANLEYLKNVLLQFIFLKPG-SERERLLPVINTMLQLSPEEKGKLAAV
NEKNMEYLKNVFVQFLKPESVPAERDQLVIVLQRVLHLSPKEVEILKAA
protein sequence
Bio-sequence Comparison
? Bio-sequence comparison is one of the most basic
problems in bioinformatics
? The basic computational problem is to determine if two
sequences are “similar”, partially similar and how similar
– AACGGTA versus ATCGGGT
DNA Sequence Comparison through
Sequence Alignment
? Defining DNA sequence (dis)similarity in terms of two parameters,
gaps and mismatches
? Example 1: AACG and AACG
? Example 2: AAGG and AACG
? Example 3: AACGGTATGC and ATCGGGTTGC
AACG
AACG
| | | |
AAGG
AACG
| | |
1 mismatch
AACG
ATCG
-
G
GT
GT
A
-
TGC
TGC
2 gaps
and 1
mismatch
DNA Sequence Alignment
? Best alignment: the alignment of two sequences with the
smallest possible number of mismatches and gaps
? Score: each aligned position: +2; each mismatch/ gap:
-1
AACG
AACG
| | | |
AAGG
AACG
| | |
AACG
ATCG
-
G
GT
GT
A
-
TGC
TGC
score = 8 score = 5 score = 13
Protein Sequence Alignment
? Protein sequence alignment: it is more complex to
measure protein sequence similarity than that of DNA
sequences
– protein sequence alignment: “degree” of similarity
? Each pair of amino acids
您可能关注的文档
- !=yTAx6=0,thenthematrixB=A!1AxyTAhasrankexactlyonelessthantherankofA. Abstract.LetA2Rmndeno.pdf
- $Q^2$ Dependence of the Bjorken Sum Rule.pdf
- (0,1)矩阵矩阵积和式的上下界.pdf
- !Prevention and treatment of protein energy wasting in chronic kidney disease patients.pdf
- (2003 OC) Frequency characteristics and dynamical behaviors of self-modulation in vertical-cavity su.pdf
- (1769-HSC Quick Refence)1769-in031_-en-p.pdf
- (2009-Science)Broadband ground-plane cloak.pdf
- (2011 M)Optimization of Multiple Traveling Salesmen Problem by a Novel Representation.pdf
- (2005-Paik)Comparison of Rifaximin and Lactulose for the Treatment of Hepatic EncephalopathyA Prosp.pdf
- (408分)2014年中央财经大学金融硕士(专业)考研经验分享.pdf
- 2-ModCell Architecture.ppt
- 2-RBCD_MD1CE100_Introduction.pdf
- 2. Benefits of a Storage Appliance..........................................................pdf
- 2.1 Setting Permanent Options................... 4.pdf
- 2.0 - Database security_C.pdf
- 2.9 Predicting gas–liquid flow in a mechanically stirred tank.pdf
- 2._Cas_Apple.pdf
- 2000,气管动力学.pdf
- 2001.The chicken Pdcd4 gene is regulated by v-Myb.pdf
- 2002 中国平脐蠕孢属的分类研究I.pdf
最近下载
- 2025至2030中国农产品批发行业发展趋势分析与未来投资战略咨询研究报告.docx
- 2011一汽马自达8车身维修手册(1).pdf VIP
- 2025大学生广西西部计划考试模拟试题题型(含答案).docx
- 《T/ZGZS 0308-2023废活性炭热处理再生技术规范》.pdf
- 中班数学活动《有趣的排序》ppt课件.pptx VIP
- 海尔BCD-218WDGS使用说明书.pdf
- 2025年海南省新高考生物试卷真题(附答案详解) .pdf VIP
- 《贸易单据审核与制作》课件.ppt VIP
- 《贸易单据制作与流转》课件.ppt VIP
- 2025年黑龙江省职业教育春季高考畜牧兽医类专业技能操作考试大纲.docx VIP
文档评论(0)