首页自然科学
   首页数学
   首页物理学
0


论计算思维、推理思维与数据科学

On Computational Thinking, Inferential Thinking and Data Science
课程网址: http://videolectures.net/BIDSAconference2016_jordan_computational...  
主讲教师: Michael I. Jordan
开课单位: 加州大学
开课时间: 2016-11-28
课程语种: 英语
中文简介:
科学技术中数据集的规模和范围的迅速增长,使得人们对融合了推理科学和计算科学的数据分析的新的基础观点产生了需求。这些领域的经典观点不足以解决“大数据”中新出现的问题,这一点从它们在初级层次上急剧分化的性质来看是显而易见的——在计算机科学中,数据点数量的增长是“复杂性”的来源,必须通过算法或硬件加以控制,而在统计学中,增长是“复杂性”的来源数据点的数量是“简单性”的一个来源,因为推断通常更强,并且可以调用渐近结果。在形式层面上,由于核心统计理论中“运行时”等计算概念的作用缺失,以及核心计算理论中“风险”等统计概念的作用缺失,这一差距明显。我提出了几个旨在连接计算和统计的研究案例,包括隐私和通信约束下的推理问题,以及权衡推理速度和准确性的方法。
课程简介: The rapid growth in the size and scope of datasets in science and technology has created a need for novel foundational perspectives on data analysis that blend the inferential and computational sciences. That classical perspectives from these fields are not adequate to address emerging problems in "Big Data" is apparent from their sharply divergent nature at an elementary level---in computer science, the growth of the number of data points is a source of "complexity" that must be tamed via algorithms or hardware, whereas in statistics, the growth of the number of data points is a source of "simplicity" in that inferences are generally stronger and asymptotic results can be invoked. On a formal level, the gap is made evident by the lack of a role for computational concepts such as "runtime" in core statistical theory and the lack of a role for statistical concepts such as "risk" in core computational theory. I present several research vignettes aimed at bridging computation and statistics, including the problem of inference under privacy and communication constraints, and methods for trading off the speed and accuracy of inference.
关 键 词: 计算; 统计; 数学思维
课程来源: 视频讲座网
数据采集: 2020-11-09:yxd
最后编审: 2020-11-09:zyk
阅读次数: 41