Analysis & Prediction of New York City Taxi and Uber Demands

Diego Correa; Christian Moyano

doi:10.22201/icat.24486736e.2023.21.5.2074

PDF

Published: Oct 30, 2023

DOI: https://doi.org/10.22201/icat.24486736e.2023.21.5.2074

Keywords:

Large Scale Data Analysis, GPS-enabled Taxi Data, Machine Learning Algorithms, Taxi & Uber demand Prediction, Visual Analytics, New York City.

Diego Correa

University of Azuay

Christian Moyano

Faculty of Science and Technology, University of Azuay, Cuenca, 6 Ecuador

Abstract

Taxi and Uber are an imperative transportation mode in New York City (NYC). This paper investigates the spatiotemporal distribution of pickups of medallion taxi (Yellow), Street Hail Livery Service taxi (Green), and Uber services in NYC, within the five boroughs: Brooklyn, the Bronx, Manhattan, Queens, and Staten Island. Regression Models and Machine Learning algorithms such as XGboost and Random Forest are used to predict the ridership of taxis and Uber dataset combined in NYC, given a time window of one-hour and locations within zip-code areas. The dataset consisting of over 90 million trips within the period April-September 2014, being Yellow with 86% the most used in the city, followed by Green with 9% and Uber with 5%. In outer boroughs, the number of pickups is 12.9 million (14%), while 77.9 million (86%) were made in Manhattan only. Yellow is the predominant option in Manhattan and Queens, while Green is preferred in Brooklyn and Bronx. In Staten Island, the market is shared between the three services. However, Uber presents a highly rising trend of 81% in Manhattan and 145% in outer boroughs during the analysis period. The regression model XGboost performed best because of its exceptional capacity to catch complex feature dependencies. The XGboost model accomplished an estimation of 38.51 for RMSE and 0.97 for R^2. This model could present valuable insights to taxi companies, decision-makers, and city planners in responding to questions, e.g., how to situate taxis where they are generally required, understand how ridership shifts over time, and the total number of taxis needed to dispatch in order to meet de the demand.

How to Cite

Correa, D., & Moyano, C. (2023). Analysis & Prediction of New York City Taxi and Uber Demands. Journal of Applied Research and Technology, 21(5), 886–898. https://doi.org/10.22201/icat.24486736e.2023.21.5.2074

Issue

Vol. 21 No. 5 (2023)

Section

Articles

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Article Sidebar

Main Article Content

Abstract

Article Details