mkhuthir / ai

AI/ML using Python3 and OpenCV

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

AI_python

AI/ML using Python3 and OpenCV

Following datasets can be used:

Data.gov

NOAA - https://www.ncdc.noaa.gov/cdo-web/ atmospheric, ocean

Bureau of Labor Statistics - https://www.bls.gov/data/ employment, inflation

US Census Data - https://www.census.gov/data.html demographics, income, geo, time series

Bureau of Economic Analysis - http://www.bea.gov/data/gdp/gross-domestic-product GDP, corporate profits, savings rates

Federal Reserve - https://fred.stlouisfed.org/ curency, interest rates, payroll

Quandl - https://www.quandl.com/ financial and economic

Data.gov.uk

UK Dataservice - https://www.ukdataservice.ac.uk Census data and much more

WorldBank - https://datacatalog.worldbank.org census, demographics, geographic, health, income, GDP

IMF - https://www.imf.org/en/Data economic, currency, finance, commodities, time series

OpenData.go.ke Kenya govt data on agriculture, education, water, health, finance, … https://data.world/

Open Data for Africa - http://dataportal.opendataforafrica.org/ agriculture, energy, environment, industry, …

Kaggle - https://www.kaggle.com/datasets A huge variety of different datasets

Amazon Reviews - https://snap.stanford.edu/data/web-Amazon.html 35M product reviews from 6.6M users

GroupLens - https://grouplens.org/datasets/movielens/ 20M movie ratings

Yelp Reviews - https://www.yelp.com/dataset 6.7M reviews, pictures, businesses

IMDB Reviews - http://ai.stanford.edu/~amaas/data/sentiment/ 25k Movie reviews

Twitter Sentiment 140 - http://help.sentiment140.com/for-students/ 160k Tweets

Airbnb - http://insideairbnb.com/get-the-data.html A TON of data by geo

UCI ML Datasets - http://mlr.cs.umass.edu/ml/ iris, wine, abalone, heart disease, poker hands, ….

Enron Email dataset - http://www.cs.cmu.edu/~enron/ 500k emails from 150 people From 2001 energy scandal. See the movie: The Smartest Guys in the Room.

Spambase - https://archive.ics.uci.edu/ml/datasets/Spambase Emails

Jeopardy Questions - https://www.reddit.com/r/datasets/comments/1uyd0t/200000_jeopardy_questions_in_a_json_file/ 200k Questions and answers in json

Gutenberg Ebooks - http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs Large collection of books

IMAGES

ImageNet - http://image-net.org 14M images of objects

Google - https://ai.googleblog.com/2016/09/introducing-open-images-dataset.html 9M image URLs with labels

Microsoft Coco - http://cocodataset.org 330k images, most labeled

Labelled Faces in the Wild - http://vis-www.cs.umass.edu/lfw/ 13k face images with names

Stanford Dogs - http://vision.stanford.edu/aditya86/ImageNetDogs/ 120 dog breeds, 20k images

AUTONOMOUS CARS

Berkeley DeepDrive - https://bdd-data.berkeley.edu/ Massive dataset including 100k videos with 1100 hours of hd driving

Belgian Traffic Signs - http://www.vision.ee.ethz.ch/~timofter/traffic_signs/ 10k images

Bosch Small Traffic Signals - https://hci.iwr.uni-heidelberg.de/node/6132 5k training and 8k test images

WPI Traffic Light, Pedestrian, Lane-Keeping - http://computing.wpi.edu/dataset.html 30GB of training and test data from Worcester, Mass

UCSD Lisa - http://cvrr.ucsd.edu/LISA/datasets.html Vehicle detection, traffic signals

About

AI/ML using Python3 and OpenCV


Languages

Language:Python 98.3%Language:Shell 1.7%