0


用于理解图像的概率模型

Probabilistic models for understanding images
课程网址: http://videolectures.net/icml08_winn_pmui/  
主讲教师: John Winn
开课单位: 剑桥大学
开课时间: 2008-07-24
课程语种: 英语
中文简介:
由于影响成像过程的众多可变性来源,让计算机理解图像是一项挑战。典型照片的像素取决于场景类型和几何体、场景中存在的对象的数量、形状和外观、它们的3D位置和方向,以及遮挡、着色和阴影等效果。好消息是,对物理学和计算机图形学的研究让我们详细了解了这些变量如何影响生成的图像。这种理解可以帮助我们将正确的先验知识构建到图像的概率模型中。理论上,建立一个包含所有这些知识的模型可以解决图像理解问题。在实践中,这样的模型对于当前的推理方法来说是难以处理的。机器学习和机器视觉研究人员面临的挑战是创建一个尽可能准确地捕捉成像过程的模型,同时保持可追踪性以进行准确推断。为了说明这一挑战,我将展示如何将成像过程的不同方面结合到对象检测和分割模型中,并讨论在这些模型中使推理易于处理的技术。
课程简介: Getting a computer to understand an image is challenging due to the numerous sources of variability that influence the imaging process. The pixels of a typical photograph will depend on the scene type and geometry, the number, shape and appearance of objects present in the scene, their 3D positions and orientations, as well as effects such as occlusion, shading and shadows. The good news is that research into physics and computer graphics has given us a detailed understanding of how these variables affect the resulting image. This understanding can help us to build the right prior knowledge into our probabilistic models of images. In theory, building a model containing all of this knowledge would solve the image understanding problem. In practice, such a model would be intractable for current inference methods. The open challenge for machine learning and machine vision researchers is to create a model which captures the imaging process as accurately as possible, whilst remaining tractable for accurate inference. To illustrate this challenge, I will show how different aspects of the imaging process can be incorporated into models for object detection and segmentation, and discuss techniques for making inference tractable in such models.
关 键 词: 成像过程; 可变性来源; 概率模型
课程来源: 视频讲座网
数据采集: 2022-12-12:chenjy
最后编审: 2022-12-12:chenjy
阅读次数: 16