0


图像和音频数字处理的最新进展

Recent Advances in Digital Processing of Images and Audio
课程网址: http://videolectures.net/mitworld_malvar_radp/  
主讲教师: Henrique S.Malvar
开课单位: 微软研究院
开课时间: 2014-01-06
课程语种: 英语
中文简介:
亨里克·马尔瓦尔(Henrique Malvar)希望用来自微软研究实验室的热门“技巧”来打动一大群人。他的节目讲述了声音和图像数字信号处理的最新魔力,很快就会出现在你的电脑或笔记本电脑上。 马尔瓦尔的研究人员改进了计算机麦克风,使其能够在嘈杂声中隔离人声。他们还设法减少了手机通话中经常遇到的回声,并消除了扬声器手机的“哞哞”声。Malvar的另一个妙招是:通过加快或减慢传入数据包,消除IP语音传输延迟。 Malvar对图像处理实验室的突破寄予厚望。“我希望这些会影响每个人的生活,”他说。Malvar发布了“图像拼接”软件,该软件可以从一组未对齐的共同主题图片中自动创建全景图。新软件还可以将同一主题的不同照片合并为一张照片,方法是“拍摄每一张照片的最佳部分”。再见,红眼。还有Malvar的Big Picture应用程序,它可以自动融合一些环境的数百张照片,“以创建一个巨大的全景”,在其中可以钻研最精细的分辨率。他用一张整个西雅图城市景观的照片展示了这一点,他放大照片,发现一对工人的绿色手套躺在摩天大楼的工地上。 最近在网络上发布的“Photosynth”是一款抗干扰产品,它可以让不同的用户将“共享上下文的组图像”拼凑成一幅三维图像。想象一下,在圣彼得广场这样的著名景点上,数百张单独的旅游照片被一个数字更高的力量组合成一个有组织的整体。用户可以逐个或作为一个统一的空间在整个环境中导航。这是“增强现实”,马尔瓦尔说,“虚拟世界和现实世界的碰撞。人们可以去参观,看看他们从未见过的东西。”
课程简介: Henrique Malvar hopes to impress a tough crowd with “tricks” hot from Microsoft’s research labs. His show-and-tell features the latest magic in sound and image digital signal processing, soon to appear on your PC or laptop. Malvar’s researchers have improved computer microphones so they can isolate a human voice within an acoustic din. They’ve also managed to reduce the echo often encountered on cell phone calls, and to clean up the “mooshy” sound of speaker phones. Another nifty advance from Malvar’s folks: eliminating transmission delays in voice over IP, by speeding up or slowing down incoming data packets. Malvar’s got high expectations for breakthroughs from the image processing labs. “I hope these will affect everybody’s lives,” he says. Malvar unveiled “image stitching” software that can automatically create panoramas from a bunch of unaligned pictures of a common theme. New software can also merge different photographs of the same subject into a single shot by “taking the best part of each one.” Bye-bye red eye. There’s also Malvar’s Big Picture application, which can automatically meld hundreds of photos of some environment “to create a humongous panorama” in which one can delve at the finest of resolutions. He demonstrated this using a photograph of the entire Seattle cityscape, into which he zoomed to discover a pair of worker’s green gloves lying atop a skyscraper work site. The piece de resistance was “Photosynth,” recently released on the web, which enables disparate users to piece together “group images that share a context” into a single, three-dimensional image. Imagine hundreds of separate tourist photos of a famous site like St. Peter’s Square, assembled into an organized whole by a digital higher power. Users can navigate through the entire environment, piece by piece or as a unified space. It is “augmented reality,” says Malvar, “the collision of virtual and real worlds. People can go and visit to see things they’ve never seen before.”
关 键 词: 图像数字; 虚拟世界; 三维图像
课程来源: 视频讲座网
数据采集: 2022-12-14:chenjy
最后编审: 2022-12-14:chenjy
阅读次数: 27