医学数据分析方法Methodology for data analysis in medical sciences |
|
课程网址: | http://videolectures.net/promo_analyzing_medical_data/ |
主讲教师: | Janez Stare |
开课单位: | 卢布尔雅那大学 |
开课时间: | 2011-09-23 |
课程语种: | 英语 |
中文简介: | 我们研究项目的主题是发现医学数据中实际或可能的模式、趋势和关联的方法。这些是发现新知识或产生可能导致知识的新假设的方法。我们正在分析的数据主要来自研究,但也来自医学的常规实践。近年来,我们特别关注从书目数据生成假设的方法。我们也在谨慎地扩大我们的研究范围,目前在平滑肌电刺激和相关的电生理学领域。简言之,我们的研究可分为三个子领域: 生物统计学 科学计量学 文献数据库中的数据挖掘 我们在生物统计学方面的研究重点是生存分析的回归模型,特别是Cox模型。除了解释变异、预后价值和脆弱性(到目前为止,这些一直是我们与Cox模型相关的主要主题)之外,我们将在未来五年内集中精力研究时变系数,并测试特定的替代假设,如交叉危险。目前可用的测试并没有将后来的情况与无效假设区分开来。除了考克斯模型之外,我们还将继续深入研究相对生存领域,我们最近开发了一种全新的方法。我们还将把研究范围扩大到生存分析之外。我们将研究评估logistic回归模型拟合优度的方法。目前的方法基于单元分组,这有几个缺点。我们的方法将基于应用随机过程理论的结果,特别是布朗运动。 一般而言,科学计量学研究相对较新,而斯洛文尼亚几乎不存在这方面的研究。如果没有足够的书目数据库,几乎不可能进行这样的研究,因此斯洛文尼亚生物医学对我们至关重要。另一个不可或缺的工具是我们开发的自动引文分析系统。第三个关键因素是选择适当的指标,这也是我们多年来的经验领域。只有将这三个组成部分结合起来,才有可能开展研究评估工作,尽管方法问题并没有就此结束。也就是说,所有不同的书目数据库都以一种防止使用标准数据分析方法的方式进行组织。因此,我们开发了一个基于OLAP(联机分析处理)方法的系统,该方法将书目数据库中的数据转换为多维正交结构,然后可以通过常用(统计)方法对其进行分析。我们的两名员工最近在该领域的领先杂志《科学计量学》上发表了一篇关于这一点的文章。在未来五年中,我们将主要关注斯洛文尼亚医学的研究趋势,以及作者数量、机构间合作、作者引用历史和其他因素对出版物影响的影响。 书目数据库中的数据挖掘是浏览书目数据库的一种新方法。到目前为止,我们已经开发了一个支持生物医学发现的系统。该系统帮助研究人员创建新的假设,然后可以使用已建立的研究方法对其进行测试。我们的方法将假设视为尚未在科学文献中发表的生物医学概念之间的关系。该系统的核心是Medline书目数据库,该数据库与当前版本中的LocusLink、HUGO、OMIM和UniGene遗传数据库相结合。这使得该系统对于发现遗传学领域的新关系特别有用,例如预测新疾病的候选基因。 |
课程简介: | The topic of our research program is methodology for discovering actual or possible patterns, trends and assotiations in medical data. These are the methods for discovering new knowledge or generating new hypotheses that may lead to knowledge. The data that we are analysing manly arise from research, but also from routine practice in medicine. In the recent years, we are paying special attention to methods for generating hypotheses from bibliographic data. We are also cautiously expanding the scope of our research, presently into the field of electric stimulation of smooth muscles and associated electromiography. In brief, our research can be divided into three sub-fields: Biostatistics Scientometrics Data mining in v bibliographic databases The focus of our research in biostatistics is on regression models for survival analysis, especially the Cox model. In addition to explained variation, prognostic value and frailties, which have so far been our main topics related to the Cox model, we will concentrate our efforts on time-varying coefficients and testing specific alternative hypotheses, such as crossing hazards, during the forthcomming five-year period. The presently available tests do not distinguish the later situations from the null hypothesis. Beside the Cox model, we will continue studying intensely the field of relative survival, where we have recently developed an entirely new method. We will also extend the scope of our research outside survival analysis. We will investigate the methods for assessing goodness-of-fit of the logistic regression model. The present approaches are based on unit grouping, which has several disadvantages. Our approach will be based on application of results from the theory of stochastic processes, especially Brownian motion. Research in scientometrics is relatively new in general, while it is virtually nonexistent in Slovenia. It is almost impossible to conduct such research without an adequate bibliographic database, so the Biomedicina Slovenica is of fundamental importance for us. Another indispensable tool is a system for automated citation analysis, which we have also developed. The third key factor is selection of appropriate indicators, which has also been a field of our experiseje for a number of years. It is only the combination of these three components that gives one with the possibility to work on research evaluation, even though the methodological problems do not end there. Namely, all the various bibliographic databases are organised in a way that prevents the usage of standard data-analytic approaches. Hence, we have developed a system based on OLAP (On Line Analytical Processing) methodology that transforms data from bibliographic databases into a multidimensional orthogonal structure, which can then be analysed by means of the usual (statistical) methods. Two of our staff recently published an article on this in Scientometrics, the leading journal in the field. During the next five years, we will be mainly interested in research trends in Slovene medicine, as well as the ifluence of the number of authors, inter-institutional co-operation, authors' citation history and other factors on the impact of publications. Data mining in bibliographic databases is a novel approach to browsing such databases. So far, we have developed a system for supporting biomedical discovery. The system aids researchers in creating new hypotheses, which can then be tested using the established research methods. Our approach treats hypotheses as relations between biomedical concepts that have not been published in the scientific literature yet.The core of the system is the Medline bibliographic database, which is joined with the LocusLink, HUGO, OMIM and UniGene genetic databases in the present version. This makes the system particularly useful for discovering new relations in the field of genetics, such as predicting candidate genes for a new disease. |
关 键 词: | 医学数据; 平滑肌电刺激; 电生理学领域 |
课程来源: | 视频讲座网 |
数据采集: | 2021-12-25:zkj |
最后编审: | 2021-12-25:zkj |
阅读次数: | 74 |