老兄,卑鄙?推特语言的形式性令人惊讶Dude, srsly?: The Surprisingly Formal Nature of Twitter’s Language |
|
课程网址: | https://videolectures.net/videos/icwsm2013_hu_formal_nature |
主讲教师: | Yuheng Hu |
开课单位: | 信息不详。欢迎您在右侧留言补充。 |
开课时间: | 2014-04-03 |
课程语种: | 英语 |
中文简介: | 推特已经成为事实上的信息共享和交流平台。考虑到影响推特语言的因素——大小限制以及沟通和内容共享机制——关于推特语言在各种既定媒体语言中的地位仍存在持续的争论。这些包括短信和聊天(大小限制)和电子邮件(通信)、博客和报纸(内容共享)。为了提供一种确定这一点的方法,我们提出了一个计算框架,可以深入了解所有这些媒介的语言风格。我们的框架由两部分组成。第一部分基于一组语言特征来量化给定媒介的语言。第二部分介绍了一个灵活的因子分解框架soclin,它借助外部认知和情感知识库对给定的媒介进行心理语言学分析。将这一分析框架应用于来自几个主要媒体的各种语料库,我们收集统计数据,以便通过定量比较研究将推特的语言学与这些其他媒体进行比较。我们提出了几个关键见解:(1)推特的语言比短信和在线聊天更保守,也不那么随意;(2)推特用户似乎正在形成独特的语言风格;(3)推特对时间引用的使用类似于短信和聊天;(4)推特的影响力变化比其他更正式的媒介要小。因此,推特的语言可以被视为一个更正式的语域投影到一个大小受限的空间中。 |
课程简介: | Twitter has become the de facto information sharing and communication platform. Given the factors that influence language on Twitter - size limitation as well as communication and content-sharing mechanisms - there is a continuing debate about the position of Twitter's language in the spectrum of language on various established mediums. These include SMS and chat on the one hand (size limitations) and email (communication), blogs and newspapers (content sharing) on the other. To provide a way of determining this, we propose a computational framework that offers insights into the linguistic style of all these mediums. Our framework consists of two parts. The first part builds upon a set of linguistic features to quantify the language of a given medium. The second part introduces a flexible factorization framework, soclin, which conducts a psycholinguistic analysis of a given medium with the help of an external cognitive and affective knowledge base. Applying this analytical framework to various corpora from several major mediums, we gather statistics in order to compare the linguistics of Twitter with these other mediums via a quantitative comparative study. We present several key insights: (1) Twitter's language is surprisingly more conservative, and less informal than SMS and online chat; (2) Twitter users appear to be developing linguistically unique styles; (3) Twitter's usage of temporal references is similar to SMS and chat; and (4) Twitter has less variation of affect than other more formal mediums. The language of Twitter can thus be seen as a projection of a more formal register into a size-restricted space. |
关 键 词: | 信息共享; 推特语言; 语言风格 |
课程来源: | 视频讲座网 |
数据采集: | 2025-05-08:yuhongrui |
最后编审: | 2025-05-08:yuhongrui |
阅读次数: | 6 |