0


窥视 A/B 测试:为什么它很重要,以及如何应对它

Peeking at A/B Tests: Why it matters, and what to do about it
课程网址: http://videolectures.net/kdd2017_walsh_peeking_tests/  
主讲教师: David Walsh
开课单位: 斯坦福大学
开课时间: 2017-10-09
课程语种: 英语
中文简介:
本文报告了新颖的统​​计方法,该方法已由商业 A/B 测试平台 Optimizely 部署,用于向客户传达实验结果。我们的方法解决了传统 p 值和置信区间给出不可靠推断的问题。这是因为众所周知,A/B 测试软件的用户会在实验运行时持续监控这些测量值。我们提供始终有效的 p 值和置信区间,经证明对于这种效果是稳健的。这不仅使用户能够安全地持续监控,而且使她能够更有效地检测真实效果。本文对 Optimizely 的数据进行了模拟和数值研究,证明了检测性能相对于传统方法的改进。
课程简介: This paper reports on novel statistical methodology, which has been deployed by the commercial A/B testing platform Optimizely to communicate experimental results to their customers. Our methodology addresses the issue that traditional p-values and confidence intervals give unreliable inference. This is because users of A/B testing software are known to continuously monitor these measures as the experiment is running. We provide always valid p-values and confidence intervals that are provably robust to this effect. Not only does this make it safe for a user to continuously monitor, but it empowers her to detect true effects more efficiently. This paper provides simulations and numerical studies on Optimizely's data, demonstrating an improvement in detection performance over traditional methods.
关 键 词: 统​​计方法; A/B 测试软件; 数据科学
课程来源: 视频讲座网
数据采集: 2023-12-25:wujk
最后编审: 2023-12-25:wujk
阅读次数: 11