An ensemble learning approach for the kaggle taxi travel time prediction challenge

Autoren Thomas Hoch
Editoren A. Martínez-Usó
J. Mendes-Moreira
L. Moreira-Matias
M. Kull
N. Lachiche
Titel An ensemble learning approach for the kaggle taxi travel time prediction challenge
Buchtitel Proceedings of the ECML/PKDD 2015 Discovery Challenges
Typ in Konferenzband
Verlag CEUR
Serie CEUR Workshop Proceedings
Band 1526
Monat January
Jahr 2016
Seiten http://ceur-ws.org/Vol-1526/paper22.pdf
SCCH ID# 1547
Abstract

This paper describes the winning solution to the Taxi Trip Time Prediction Challenge run by Kaggle.com. The goal of the competition was to build a predictive framework that is able to predict the final destination and the total traveling time of taxi rides based on their (initial) partial trajectories. The available data consists of all taxi trips of 442 taxis running in the city of Porto within one year. The presented solution consists of an ensemble of expert models combined with a spatial clustering approach. The base classifiers consist of Random Forest Regressors where as the expert models for each test trip where based on a combination of gradient boosting and random forest. The paper shows how these models can be combined in order to generate accurate predictions of the remaining traveling time of a taxi.