cristina95138 / CS105_Stock_Market_News_Analysis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CS105: Stock Market News Analysis

Project Idea

For the project we plan to analyze stock price and stock news data to ultimately create a machine learning model to predict stock prices based on current events.

Data Sets We Plan to Use

Stock price dataset (Download): https://www.kaggle.com/borismarjanovic/price-volume-data-for-all-us-stocks-etfs
News headlines dataset (Download): https://www.kaggle.com/aaron7sun/stocknews
Additional headlines dataset (Web-Crawling): We will be scraped from crawling the reddit news page (r/news) with the pushshift.io Reddit API (https://github.com/pushshift/api).

How the Datasets are Correlated and What We’re Doing with Them

The stock market and news headlines datasets are correlated since national and international events have effects on economic outlook thus causing stock prices to fluctuate. We plan on finding if there are correlations between the type of news event (war, presidential election outcomes) and changes to stock prices of companies in certain industries. To show these correlations we will apply EDA to the data, comparing variables such as names, places, and other key words in the headlines and seeing if there are any strong connections to stock prices.

Information the Datasets Provide

Stock price dataset provides the low, high, open and close price for a stock, the date that the prices were recorded and the volume. They’re all stocks traded on the NYSE, NASDAQ, and NYSE MKT.

News headlines dataset provides the top 25 headlines from /r/worldnews, a Reddit community where people post articles relating to news outside of the US. The dataset contains headlines from 2008-06-08 to 2016-07-01.

Phases

Phase 1

https://github.com/CS-UCR/cs105-prj-phase3-fintech-bros/tree/master/cs105-prj-phase1-fintech-bros-master

Phase 2

https://github.com/CS-UCR/cs105-prj-phase3-fintech-bros/tree/master/cs105-prj-phase2-fintech-bros-master

Phase 3

https://github.com/CS-UCR/cs105-prj-phase3-fintech-bros/tree/master/cs105-prj-phase3-fintech-bros-master

About


Languages

Language:Jupyter Notebook 99.5%Language:Python 0.5%