0


处理社交媒体数据:我们能绕过巴别塔吗?

Processing social media data: can we circumvent the Tower of Babel?
课程网址: https://videolectures.net/videos/solomon_ljubesic_tower_of_babel  
主讲教师: Nikola Ljubešić
开课单位: 信息不详。欢迎您在右侧留言补充。
开课时间: 2014-08-14
课程语种: 英语
中文简介:
众所周知,社交媒体是各个研究领域的多样化和丰富的信息来源。然而,由于用户的语言和文化多样性,它们带来了一系列处理挑战。使用标准语言技术处理社交媒体文本的错误率远高于标准文本。此外,研究人员经常需要额外的用户数据,比如他们的社会人口信息。在演讲的第一部分,我将介绍一系列用于处理不同语言生成的技术调整,而在第二部分,我会概述一些与语言无关的用户分析实验,如用户类型识别和性别预测。
课程简介: Social media are known to be a diverse and rich source of information for various areas of research. However, they pose a series of processing challenges due to the linguistic and cultural diversity of their users. Processing social media texts with standard language technologies has an error rate much higher than that on standard texts. Furthermore, researchers are regularly in need of additional user data like their sociodemographic information. In the first part of my talk I will present a series of technology adaptations for processing varying language production, while in the second part I will overview some experiments on language-independent user profiling such as user type identification and gender prediction.
关 键 词: 社交媒体; 标准文本; 用户分析实验
课程来源: 视频讲座网
数据采集: 2025-04-23:yuhongrui
最后编审: 2025-04-23:yuhongrui
阅读次数: 4