0


大量的数据,数字化科学和可重复性的结果

Massive data, the digitalization of science, and reproducibility of result
课程网址: http://videolectures.net/cerncolloquium_stodden_result/  
主讲教师: Victoria Stodden
开课单位: 耶鲁大学
开课时间: 2012-03-09
课程语种: 英语
中文简介:
随着科学企业越来越具有计算能力和数据驱动能力,所传达信息的性质必须改变。如果不将代码和数据与已公布的计算结果结合起来,我们将在科学界引发一场信誉危机。诸如Climategate,杜克大学正在研究的基于微阵列的药物敏感性临床试验,以及由于未经验证的代码而从著名期刊上撤回的争论表明,我们的计算科学需要更大的透明度。在这篇文章中,我认为科学方法可以恢复到:(1)把重点放在错误控制上,作为科学传播的中心;(2)完全传播产生结果的基本方法,即再现性。我根据最近的调查工作(Stodden 2010)概述了实现这些目标的障碍,并提出了一些解决方案,如“可复制的研究标准”(Stodden 2009),为科学家提供开放式许可选项,以创建符合长期科学规范的知识产权框架。
课程简介: As the scientific enterprise becomes increasingly computational and data-driven, the nature of the information communicated must change. Without inclusion of the code and data with published computational results, we are engendering a credibility crisis in science. Controversies such as ClimateGate, the microarray-based drug sensitivity clinical trials under investigation at Duke University, and retractions from prominent journals due to unverified code suggest the need for greater transparency in our computational science. In this talk I argue that the scientific method be restored to (1) a focus on error control as central to scientific communication and (2) complete communication of the underlying methodology producing the results, ie. reproducibility. I outline barriers to these goals based on recent survey work (Stodden 2010), and suggest solutions such as the “Reproducible Research Standard” (Stodden 2009), giving open licensing options designed to create an intellectual property framework for scientists consonant with longstanding scientific norms.
关 键 词: 科学研究; 统计; 重复性结果
课程来源: 视频讲座网
最后编审: 2021-01-27:chenxin
阅读次数: 43