越简单越好:基于大规模在线预测出租车原始需求的统一方法The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands based on Large-Scale Online |
|
课程网址: | http://videolectures.net/kdd2017_ye_online_platforms/ |
主讲教师: | Jieping Ye |
开课单位: | 密歇根大学 |
开课时间: | 2017-10-09 |
课程语种: | 英语 |
中文简介: | 出租车呼叫应用程序越来越受欢迎,因为它们能高效地将闲置出租车调度给有需要的乘客。为了准确平衡出租车的供需,在线出租车平台必须预测单位原始出租车需求(UOTD),即每单位时间(如每小时)和每单位地区(如每个POI)提交的出租车呼叫需求数量。UOTD的预测对于大型工业在线出租车平台来说是不平凡的,因为准确性和灵活性都至关重要。GBRT和深度学习等复杂的非线性模型通常是准确的,但在场景变化后(例如,由于新法规而产生的额外约束),劳动密集型模型的重新设计是必不可少的。为了准确预测UOTD,同时对场景变化保持灵活性,我们提出了LinUOTD,这是一个具有超过2亿个维度特征的统一线性回归模型。简单的模型结构消除了重复建模的需要。 |
课程简介: | Taxi-calling apps are gaining increasing popularity for their efficiency in dispatching idle taxis to passengers in need. To precisely balance the supply and the demand of taxis, it is essential for the online taxicab platforms to predict Unit Original Taxi Demands (UOTD), which refers to the number of taxi-calling requirements submitted per unit time (e.g. every hour) and per unit region (e.g. each POI). Prediction of UOTD is non-trivial for large-scale industrial online taxicab platforms because both accuracy and flexibility are essential. Complex non-linear models such as GBRT and deep learning are generally accurate, yet labor-intensive model redesign is indispensable after scenario changes (e.g. extra constraints due to new regulations). To accurately predict UOTD while remaining flexible to scenario changes, we propose LinUOTD, a unified linear regression model with more than 200 million dimensions of features. The simple model structure eliminates the need of repeatedly model redesign, while the high-dimensional features contribute to accurate UOTD prediction. Furthermore, we design a series of optimization techniques for efficient model training and updating. Evaluations on two large-scale datasets from an industrial online taxicab platform verify that LinUOTD outperforms popular non-linear models in accuracy. We envision our experiences to adopt simple linear models with high-dimensional features in UOTD prediction as a pilot study and can shed insights upon other industrial large-scale spatio-temporal prediction problems. |
关 键 词: | 在线平台; 非线性模型; 需求预测 |
课程来源: | 视频讲座网 |
数据采集: | 2023-12-25:wujk |
最后编审: | 2024-01-22:liyy |
阅读次数: | 24 |