himanshunagdive / ny-taxi-trip-demand-predictor

New York Taxi trip demand prediction using Machine learning models trained with temporal features

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Overview

Dataset used

New York City Taxi Trip Data (2013) Source: NYC Taxi & Limousine Commission

Data Dictionary

TRIP_DATA.csv

  • medallion: a permit to operate a yellow taxi cab in New York City, it is effectively a (randomly assigned) car ID.
  • hack_license: a license to drive the vehicle, it is effectively a (randomly assigned) driver ID.
  • vendor_id: e.g., Verifone Transportation Systems (VTS), or Mobile Knowledge Systems Inc (CMT), implemented as part of the Technology Passenger Enhancements Project.
  • rate_code: taximeter rate. Check http://www.nyc.gov/html/tlc/html/passenger/taxicab_rate.shtml.
  • pickup_datetime: start time of the trip, mm-dd-yyyy hh24:mm:ss EDT.
  • dropoff_datetime: end time of the trip, mm-dd-yyyy hh24:mm:ss EDT.
  • passenger_count: number of passengers on the trip, default value is one.
  • trip_time_in_secs: trip time measured by the taximeter in seconds.
  • trip_distance: trip distance measured by the taximeter in miles.
  • pickup_longitude and pickup_latitude: GPS coordinates at the start of the trip.
  • dropoff_longitude and dropoff_latitude: GPS coordinates at the end of the trip.

FARE_DATA.csv

  • medallion: a permit to operate a yellow taxi cab in New York City, it is effectively a (randomly assigned) car ID.
  • hack_license: a license to drive the vehicle, it is effectively a (randomly assigned) driver ID.
  • vender_id: e.g., Verifone Transportation Systems (VTS), or Mobile Knowledge Systems Inc (CMT), implemented as part of the Technology Passenger Enhancements Project.
  • pickup_datetime: start time of the trip, mm-dd-yyyy hh24:mm:ss EDT.
  • payment_type: cash or credit card.
  • fare_amount: the meter fare, it should include the Newark surcharge, in USD.
  • surcharge: extra fees, such as rush hour and overnight surcharges, in USD.
  • mta_tax: metropolitan commuter transportation mobility tax, in USD.
  • tip_amount: tip amount (for credit card transactions only), in USD.
  • tolls_amount: total price paid for tolls, summed across all tolls for the trip, in USD.
  • total_amount: all charges that are presented to the passenger at time of fare payment (includes tip for non-cash trips), in USD.

Sampling

Randomly selected 1,000 medallions from January and extracted all of their trips throughout the year.

About

New York Taxi trip demand prediction using Machine learning models trained with temporal features


Languages

Language:Jupyter Notebook 100.0%