0


预测结构化输出的集合

Ensembles for predicting structured outputs
课程网址: http://videolectures.net/solomon_kocev_epso/  
主讲教师: Dragi Kocev
开课单位: 约瑟夫·斯特凡学院
开课时间: 2010-02-16
课程语种: 英语
中文简介:
在许多现实世界中,例如生物信息学(功能基因组学),文本分类和图像批注,目标是预测复杂的输出。例如,在功能基因组学中,目标通常是预测基因的功能,而功能集可以组织为树(FunCat)或图(GO本体)。树木。所提出的方法可扩展到大型数据集,不同类型的输出,并且适用于广泛的领域。首先,我们描述通常遇到的结构化输出的类型,然后解释基本分类器预测聚类树(PCT)。接下来,我们讨论了扩展的集成方法(袋装和随机森林)以处理结构化输出并相应地调整投票方案。然后,我们在广泛的现实领域中对该方法进行了实验评估。最后,我们介绍了所提出的方法在功能基因组学中的应用,并表明我们的方法与最新方法具有竞争性。
课程简介: In many real-world domains, such as bioinformatics (functional genomics), text classification and image annotation, the goal is to predict a complex output. For example, in functional genomics, the goal is to predict the function of a gene, while the set of functions can be organized as tree (FunCat) or graph (GO ontology). In this talk, we present an approach for predicting structured outputs using ensembles of trees. The proposed approach is scalable to large datasets, different types of outputs and it is applicable to wide range of domains. First, we describe the types of structured outputs that we typically encounter, and then we explain the base classifiers - predictive clustering trees (PCTs). Next, we discuss the ensemble methods that we extended (bagging and random forests) to deal with structured outputs and accordingly adapted the voting schemes. Afterwards, we present experimental evaluation of the proposed approach on wide range of real-world domains. At the end, we present an application of the proposed approach in functional genomics and show that our approach is competitive with state-of-the-art approaches.
关 键 词: 功能基因组学; 预测基因; 预测聚类树
课程来源: 视频讲座网
最后编审: 2019-09-22:cwx
阅读次数: 35