Timezone: »
This paper presents the key algorithmic techniques behind CatBoost, a new gradient boosting toolkit. Their combination leads to CatBoost outperforming other publicly available boosting implementations in terms of quality on a variety of datasets. Two critical algorithmic advances introduced in CatBoost are the implementation of ordered boosting, a permutation-driven alternative to the classic algorithm, and an innovative algorithm for processing categorical features. Both techniques were created to fight a prediction shift caused by a special kind of target leakage present in all currently existing implementations of gradient boosting algorithms. In this paper, we provide a detailed analysis of this problem and demonstrate that proposed algorithms solve it effectively, leading to excellent empirical results.
Author Information
Liudmila Prokhorenkova (Yandex)
Gleb Gusev (Yandex LLC)
Aleksandr Vorobev (Yandex LLC)
Anna Veronika Dorogush (Yandex)
Andrey Gulin (Yandex)
More from the Same Authors
-
2021 : Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks »
Andrey Malinin · Neil Band · Yarin Gal · Mark Gales · Alexander Ganshin · German Chesnokov · Alexey Noskov · Andrey Ploskonosov · Liudmila Prokhorenkova · Ivan Provilkov · Vatsal Raina · Vyas Raina · Denis Roginskiy · Mariya Shmatova · Panagiotis Tigas · Boris Yangel -
2023 Poster: Neural Algorithmic Reasoning Without Intermediate Supervision »
Gleb Rodionov · Liudmila Prokhorenkova -
2023 Poster: Evaluating Robustness and Uncertainty of Graph Models Under Structural Distributional Shifts »
Gleb Bazhenov · Denis Kuznedelev · Andrey Malinin · Artem Babenko · Liudmila Prokhorenkova -
2023 Poster: Characterizing Graph Datasets for Node Classification: Homophily-Heterophily Dichotomy and Beyond »
Oleg Platonov · Denis Kuznedelev · Artem Babenko · Liudmila Prokhorenkova -
2021 Poster: Good Classification Measures and How to Find Them »
Martijn Gösgens · Anton Zhiyanov · Aleksey Tikhonov · Liudmila Prokhorenkova -
2021 Poster: Overlapping Spaces for Compact Graph Representations »
Kirill Shevkunov · Liudmila Prokhorenkova -
2021 : Shifts Challenge: Robustness and Uncertainty under Real-World Distributional Shift + Q&A »
Andrey Malinin · Neil Band · German Chesnokov · Yarin Gal · Alexander Ganshin · Mark Gales · Alexey Noskov · Liudmila Prokhorenkova · Mariya Shmatova · Vyas Raina · Vatsal Raina · Panagiotis Tigas · Boris Yangel