- 1、本文档共6页,可阅读全部内容。
- 2、有哪些信誉好的足球投注网站(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
- 5、该文档为VIP文档,如果想要下载,成为VIP会员后,下载免费。
- 6、成为VIP后,下载本文档将扣除1次下载权益。下载后,不支持退款、换文档。如有疑问请联系我们。
- 7、成为VIP后,您将拥有八大权益,权益包括:VIP文档下载权益、阅读免打扰、文档格式转换、高级专利检索、专属身份标志、高级客服、多端互通、版权登记。
- 8、VIP文档为合作方或网友上传,每下载1次, 网站将根据用户上传文档的质量评分、类型等,对文档贡献者给予高额补贴、流量扶持。如果你也想贡献VIP文档。上传文档
查看更多
a quick guide to large-scale genomic data mining大规模基因组数据挖掘的快速指南
Education
A Quick Guide to Large-Scale Genomic Data Mining
Curtis Huttenhower*, Oliver Hofmann
Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, United States of America
Introduction coexpress in high-throughput data as great or greater a concern as data
repositories? Under what experimental processing: plain text or XML storage
For the first several hundred years of conditions, or in which tissues? formats, while conveniently human-read-
research in cellular biology, the main able, can waste unsustainable amounts of
bottleneck to scientific progress was data Bringing large quantities of genomic space for large repositories.
collection. Our newfound data-richness, data to bear on such questions involves Solutions to these technical issues in-
however, has shifted this bottleneck from three main tasks: establishing methodology clude software and data access methodol-
collection to analysis [1]. While a variety for efficiently querying large data collec- ogies specifically tailored to large-scale
of options exists for examining any one tions; assembling data from appropriate data manipulation. Three broad catego-
experimental dataset, we are still discov- repositories; and integrating information ries of solutions exist: Web applications
ering what new biological questions can be from a variety of experimental data types. that aggregate information from multiple
answered by mining thousands of
您可能关注的文档
- a negative feedback loop that limits the ectopic activation of a cell type–specific sporulation sigma factor of bacillus subtilis负面的反馈回路,限制了细胞的异位激活特定类型的枯草芽孢杆菌孢子形成σ因子.pdf
- a neural seat for math座位一个神经数学.pdf
- a neuron survival protein may give directions, too神经元生存的蛋白质可能指点.pdf
- a natural system of chromosome transfer in yersinia pseudotuberculosis自然系统的染色体转移就要伪.pdf
- a neurosemantic theory of concrete noun representation based on the underlying brain codesneurosemantic理论具体名词表示大脑基于底层代码.pdf
- a neurodynamic account of spontaneous behaviour一个神经动力的自发行为.pdf
- a neuronal network model for simulating the effects of repetitive transcranial magnetic stimulation on local field potential power spectra神经网络模型模拟重复经颅磁刺激对当地的影响领域潜在的功率谱.pdf
- a neurophysiologically plausible population code model for feature integration explains visual crowding神经生理学合理的人口特性集成代码模型解释视觉拥挤.pdf
- a neutralizing rna aptamer against egfr causes selective apoptotic cell death中和rna适配子与表皮生长因子受体选择性凋亡细胞死亡原因.pdf
- a new adenovirus based vaccine vector expressing an eimeria tenella derived tlr agonist improves cellular immune responses to an antigenic target一种新的基于腺病毒疫苗向量表达一个艾美球虫属tenella派生toll样受体激动剂能提高细胞免疫反应的抗原目标.pdf
文档评论(0)