
Learning from Partially Annotated Sequences
课程网址: http://videolectures.net/ecmlpkdd2011_brefeld_annotated/  
主讲教师: Ulf Brefeld
开课单位: 莱芬娜大学
开课时间: 2011-11-30
课程语种: 英语
课程简介: We study sequential prediction models in cases where only fragments of the sequences are annotated with the ground-truth. The task does not match the standard semi-supervised setting and is highly relevant in areas such as natural language processing, where completely labeled instances are expensive and require editorial data. We propose to generalize the semi-supervised setting and devise a simple transductive loss-augmented perceptron to learn from inexpensive partially annotated sequences that could for instance be provided by laymen, the wisdom of the crowd, or even automatically. Experiments on mono- and crosslingual named entity recognition tasks with automatically generated partially annotated sentences from Wikipedia demonstrate the effectiveness of the proposed approach. Our results show that learning from partially labeled data is never worse than standard supervised and semi-supervised approaches trained on data with the same ratio of labeled and unlabeled tokens.
关 键 词: 顺序预测模型; 序列片段; 编辑数据; 数据处理; 半监督设置
课程来源: 视频讲座网公开课
最后编审: 2019-05-26:cwx
阅读次数: 58