Version: 2018 Feb 23
- Introductory R material:
intro.R
and related data sets from UCI data science course - Stock market data set from Kaggle
- R reference sheet
- R forecast documentation
data_files
- CSV files are for
intro.R
work - Three .txt files from ETF data (aadr, aaxj, acim)
- Three .txt files from Stock data (nvda, nvec, nvee)
- CSV files are for
intro.R
- Simple, commented script on getting acquainted with R.work.R
- Analysis work.
- There are two categories of data: exchange-traded funds (ETFs) and stocks.
- The information on each company or exchange is listed in its own text file, labeled as
[stock-symbol].us.txt
. - Each text file lists the following information:
- Date
- Open - opening price of the day
- High - highest price of the day
- Low - lowest price of the day
- Close - closing price of the day
- Volume - amount traded that day
- OpenInt - open interest (total number of trades that have not yet been liquidated) see Investopedia
- A representative text file is structured as comma-separated values in this order:
Date | Open | High | Low | Close | Volume | OpenInt |
---|---|---|---|---|---|---|
2011-02-01 | 24.092 | 24.226 | 24.092 | 24.226 | 827 | 0 |
2011-02-02 | 23.934 | 23.934 | 23.759 | 23.759 | 827 | 0 |
2011-02-04 | 23.739 | 23.739 | 23.724 | 23.724 | 22177 | 0 |
2011-02-08 | 24.064 | 24.074 | 24.064 | 24.074 | 1886 | 0 |
- Numerical details on the data set:
- 8539 total entries
- 1344 ETFs
- 7195 stocks
- Dates range from: XXXX to 2017-11-10.
- XXX-XXX lines per text file
- What is the average trend of the stock market? (should be upwards) (validate with online stock trends)
- What stock has the best forecast outlook? (maybe measured in magnitude of difference from Nov 2017 to forecast end)
- What are the best/worst performers? (most upwards/downwards trend)
- What sectors performs best/worst? (will need to obtain sector information)
- What day and stock had the highest close-minus-open difference? (is there related news coverage)
- What day and stock had the highest high-minus-low difference? (is there related news coverage)
- What is the correlation time in the market? A few days? A couple of weeks?
- Time Series Forecasting with Recurrent Neural Networks
- A Guide For Time Series Prediction Using Recurrent Neural Networks (LSTMs)
- How to apply Neural Network to time series forecasting?
- Candle charts
- LSTM networks
- Quantifying correlation between financial news and stocks
Although I have R and RStudio set up on my own computer, I wanted to be able to run RStudio
on my remote Linux server, which has a lot more computing power.
- On remote server: install R
- Note: This gave me an older version, 3.0 something, so I updated it after.
- On remote server: install RStudio Server
- Configure to work over SSH
- Login to the remote server and start RStudio Server:
rstudio-server start
You might not have to do this. Mine was already active, even after reboot. - From local server:
ssh -f <username>@<server> -L :8787:127.0.0.1:8787 -N
(Thank you StackExchange!) - Open an Internet browser and type in "http://127.0.0.1:8787/"
- Log in with your username/password for the remote server.
- Login to the remote server and start RStudio Server: