通过机器学习进行代谢物鉴定和分子指纹预测Metabolite identification and molecular fingerprint prediction via machine learning |
|
课程网址: | http://videolectures.net/mlsb2012_heinonen_metabolite/ |
主讲教师: | Markus Heinonen |
开课单位: | 赫尔辛基大学 |
开课时间: | 2012-10-23 |
课程语种: | 英语 |
中文简介: | **动机:**来自串联质谱的代谢物鉴定是代谢组学中的一个重要问题,是后续代谢建模和网络分析的基础。然而,目前该任务要求将观察到的光谱与来自类似设备的参考光谱数据库相匹配,并且紧密匹配操作参数,这是在公共存储库中完全满足的条件。此外,缺乏对参考数据库中不存在的分子识别的计算支持。最近组装大型公共质量光谱数据库(如MassBank)的努力为开发新的代谢物鉴定方法打开了大门。结果:**我们引入了一种新的框架,用于预测分子特征和从串联质量中鉴定代谢物。使用支持向量机(SVM)的机器学习的光谱。我们的方法是首先从显着串联质谱信号中预测未知代谢物的大量分子特性,并在第二步中使用预测特性与大分子数据库匹配,例如PubChem。我们证明了几种分子特性可以被预测为高精度,并且它们可用于从头代谢物鉴定,其中参考数据库不包含相同分子的分析物。\\ |
课程简介: | **Motivation:** Metabolite identification from tandem mass spectra is an important problem in metabolomics, underpinning subsequent metabolic modelling and network analysis. Yet, currently this task requires matching the observed spectrum against a database of reference spectra originating from similar equipment and closely matching operating parameters, a condition that is rarely satisfied in public repositories. Furthermore, the computational support for identification of molecules not present in reference databases is lacking. Recent efforts in assembling large public mass spectral databases such as MassBank have opened the door for the development of a new genre of metabolite identification methods.\\ **Results:** We introduce a novel framework for prediction of molecular characteristics and identification of metabolites from tandem mass spectra using machine learning with the support vector machine (SVM). Our approach is to first predict a large set of molecular properties of the unknown metabolite from salient tandem mass spectral signals, and in the second step to use the predicted properties for matching against large molecule databases, such as PubChem. We demonstrate that several molecular properties can be predicted to high accuracy, and that they are useful in de novo metabolite identification, where the reference database does not contain any spectra of the same molecule.\\ |
关 键 词: | 串联质谱; 代谢组学; 网络分析 |
课程来源: | 视频讲座网 |
最后编审: | 2021-12-22:liyy |
阅读次数: | 75 |