0


数据科学与语言的交叉点

At the Intersection of Data Science and Language
课程网址: http://videolectures.net/iswc2016_mckeown_data_science/  
主讲教师: Kathleen McKeown
开课单位: 哥伦比亚大学
开课时间: 2016-11-10
课程语种: 英语
中文简介:
数据科学有望解决许多社会最紧迫的挑战,但许多必要的数据都被锁定在网络上的大量非结构化数据中,包括语言、语音和视频。在这篇演讲中,我将描述数据科学方法是如何在从事实到虚构的连续过程中从语言数据中提取数据的研究项目中使用的。我将根据最近发表的研究文章中提供的信息,提出关于预测以技术术语表示的科学概念未来影响的研究,关于从过去灾难的知识中学习的研究,通过媒体的视角,以及关于使用数据科学来理解主观、个人叙事的研究。在这些项目中,我们将看到从web和语义中提取的数据如何发挥作用。
课程简介: Data science holds the promise to solve many of society’s most pressing challenges, but much of the necessary data is locked within the volumes of unstructured data on the web including language, speech and video. In this talk, I will describe how data science approaches are being used in research projects that draw from language data along a continuum from fact to fiction. I will present research on predicting the future impact of a scientific concept—represented as a technical term—based on the information available in recently published research articles, research on learning from knowledge of past disasters, as seen through the lens of the media and on the use of data science in understanding subjective, personal narratives. In these projects we will see how data drawn from the web and semantics play a role.
关 键 词: 数据科学; 非结构化数据; 研究项目
课程来源: 视频讲座网
数据采集: 2022-12-04:chenjy
最后编审: 2022-12-04:chenjy
阅读次数: 27