Timezone: »
A mixture of multinomial logits (MMNL) generalizes the single logit model, which is commonly used in predicting the probabilities of different outcomes. While extensive algorithms have been developed in the literature to learn MMNL models, theoretical results are limited. Built on the Frank-Wolfe (FW) method, we propose a new algorithm that learns both mixture weights and component-specific logit parameters with provable convergence guarantees for an arbitrary number of mixtures. Our algorithm utilizes historical choice data to generate a set of candidate choice probability vectors, each being close to the ground truth with a high probability. We further provide a sample complexity analysis to show that only a polynomial number of samples is required to secure the performance guarantee of our algorithm. Finally, we conduct simulation studies to evaluate the performance and demonstrate how to apply our algorithm to real-world applications.
Author Information
Yiqun Hu (Massachusetts Institute of Technology / AWS AI Labs)
David Simchi-Levi (MIT)
Zhenzhen Yan (Nanyang Technological University)
More from the Same Authors
-
2021 : Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation »
Dylan Foster · Akshay Krishnamurthy · David Simchi-Levi · Yunzong Xu -
2023 Poster: Non-stationary Experimental Design under Linear Trends »
David Simchi-Levi · Chonghuan Wang · Zeyu Zheng -
2023 Poster: Stochastic Multi-armed Bandits: Optimal Trade-off among Optimality, Consistency, and Tail Risk »
Feng Zhu · Zeyu Zheng · David Simchi-Levi -
2022 : Importance of Synthesizing High-quality Data for Text-to-SQL Parsing »
Yiyun Zhao · Jiarong Jiang · Yiqun Hu · Wuwei Lan · Henghui Zhu · Anuj Chauhan · Hanbo Li · Lin Pan · Jun Wang · Chung-Wei Hang · Sheng Zhang · Mingwen Dong · Joseph Lilien · Patrick Ng · Zhiguo Wang · Vittorio Castelli · Bing Xiang -
2022 Poster: A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk »
David Simchi-Levi · Zeyu Zheng · Feng Zhu -
2022 Poster: Context-Based Dynamic Pricing with Partially Linear Demand Model »
Jinzhi Bu · David Simchi-Levi · Chonghuan Wang -
2021 : Contributed Talk 3: Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation »
Yunzong Xu · Akshay Krishnamurthy · David Simchi-Levi -
2019 Poster: Phase Transitions and Cyclic Phenomena in Bandits with Switching Constraints »
David Simchi-Levi · Yunzong Xu -
2018 Poster: The Lingering of Gradients: How to Reuse Gradients Over Time »
Zeyuan Allen-Zhu · David Simchi-Levi · Xinshang Wang