Sapphirine / 202312-14-Financial-Analysis-on-Spatio-Temporal-Graph-Data

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

EECS E6893 Fall of 2023

Project: Financial Analysis on Spatio-Temporal Graph Data

Team 14: Shaokun Feng(sf3209), Qingyuan Liu(ql2505), Xinzi Yu(xy2590)

This project presents a comprehensive exploration into leveraging Big Data Analysis to predict stock prices using Spatio-Temporal Graph data. Focused on S&P 500 companies, the study delves into the complexities of financial markets, aiming to construct dynamic relationships between companies and develop predictive models. The report outlines meticulous data collection, preprocessing methodologies, and the development of predictive models—LSTM for temporal data and STGCN for static and dynamic Spatio-Temporal data. It discusses experimental outcomes, visualization strategies, and references recent works in the field. The endeavor seeks to contribute to revolutionizing financial forecasting and decision-making processes by enhancing predictive analytics in the financial domain.

Tree Diagram of Project Body

image-20231215135034651

Data Generation

Target data selection:

•Step 1: Identify companies from the S&P 500 list, analyzing the HTML structure of the Yahoo economic news website for each company to extract relevant news titles.

•Step 2: After crafting web crawlers to extract company-specific news titles, we detect and confirm company mentions within the extracted text.

image-20231215134632655

Stock data acquisition:

•Step 1: Obtain comprehensive stock data for each identified company using the Finnhub library.

•Step 2: Construct a spatio-temporal graph

Type Value
Nodes 503
Edges 4255
Features 5
Temporal Length 3650

LSTM&CNN Based STGNN&LSTM Based STGNN

image-20231215134913566

image-20231215134928065

Airflow Deployment

image-20231215135143114

About


Languages

Language:Jupyter Notebook 99.2%Language:Python 0.8%