Analyzing rideshare data

栏目: IT技术 · 发布时间: 5年前

内容简介:Over the past decade, ride-hailing services such as Uber, Lyft, Gett, Juno, and Via have revolutionized the transportation space. However, profitability for drivers varies across all these platforms in complicated ways. Drivers are forced to make mental ca

The process of building the quantitative backbone for an app

Feb 9 ·6min read

Analyzing rideshare data

Original app description

Over the past decade, ride-hailing services such as Uber, Lyft, Gett, Juno, and Via have revolutionized the transportation space. However, profitability for drivers varies across all these platforms in complicated ways. Drivers are forced to make mental calculations based on the time of day, surge rate, wait time to pick up a rider, etc. for each driving service, and choose to ride for a given service at any point based on the optimization of such factors. As a result of this difficult optimization problem — given the number of potential factors as well as the many unknowns — the driver has no certainty that they have chosen the most profitable service with which to ride. Ride King eliminates such uncertainty by pooling historical and to-date data for each driving service based on the driver’s location as well as the time at which the driver is looking to drive. The ultimate goal with Ride King is to eventually extend this optimization service to riders, such that they may be able to select the cheapest rideshare company with which to ride at any given point. The convergence of the respective optimizations for riders and drivers will hopefully detract enough from each rideshare service to engender a better sense of transparency and/or greater payouts for drivers.

Ride King: Ride Intelligently

A look at the raw data

The data collection for this app idea was by no means straightforward. To avoid having their information used for competitive purposes, most rideshare companies will do everything in their power to protect their data. However, with some deliberation, we can actually discover a couple backdoor methods to access these data. All rideshare apps in NYC are under the Taxi and Limousine Commission (TLC) and are required by law to submit data on every ride that occurs. With some investigation, the keen observer will find that TLC is part of the NYC Open Data initiative, which was established to increase transparency in state and city affairs. While the data these rideshare companies do provide are incredibly shallow (minimal information per ride), we can still pull quite a bit of information from them. To start us off, however, I‘m using Uber data from April 2014 (provided generously by the company, fivethirtyeight ) that have a slightly different format to the TLC dataset.

Analyzing rideshare data

Example rows from fivethirtyeight Uber NYC April 2014 dataset

From a quick glance, we can see that each row in this set provides the date, time, location of pickup (in geographic coordinates), and ‘base’. We will get to the last column a bit later—for now, let’s just look at the raw data. Please note that, while we could just plot the raw geographic coordinates, I’ve actually overlaid NYC neighborhood boundary data that was generously made public by Zillow. This now has added another dimension to our data!

Analyzing rideshare data

All Uber pickups from April 2014 in mid/lower Manhattan

Okay, cool. So we can now see all the Uber pickups from April 2014 in Manhattan (zoomed, for readability) with neighborhood boundaries and all! Since the goal of this project is to try and predict the profitability of a rideshare service based on the time of day and location, we’ve only scratched the surface so far, but we can already see that rides are definitely not distributed evenly!


以上就是本文的全部内容,希望对大家的学习有所帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

超简单!一学就懂的互联网金融

超简单!一学就懂的互联网金融

视觉图文 / 人民邮电出版社 / 2015-2-1 / 45.00元

零基础、全图解,通过130多个精辟的知识点、220多张通俗易懂的逻辑图表,让您一书在手,即可彻底看懂、玩转互联网金融从菜鸟成为达人,从新手成为互联网金融高手! 本书主要特色:最简洁的版式+最直观的图解+最实用的内容。 本书细节特色:10章专题内容详解+80多个特别提醒奉献+130多个知识点讲解+220多张图片全程图解,深度剖析互联网金融的精华之处,帮助读者在最短的时间内掌握互联网金融知......一起来看看 《超简单!一学就懂的互联网金融》 这本书的介绍吧!

CSS 压缩/解压工具
CSS 压缩/解压工具

在线压缩/解压 CSS 代码

在线进制转换器
在线进制转换器

各进制数互转换器

正则表达式在线测试
正则表达式在线测试

正则表达式在线测试