This will be our data science python labs
The labs here are jupyter notebooks. To start them,
Type in the following:
jupyter notebook
For labs that involve Pyspark, you have to do somehthing different.
export PYSPARK_DRIVER_PYTHON="jupyter"
export PYSPARK_DRIVER_PYTHON_OPTS="notebook"
~/pyspark
This will open up a browser window that will allow you to open the notebooks.
- 01-intro/01-LearningNotebooks.ipynb
- 01-intro/02-LearningPython.ipynb
- 01-intro/03-NumPy.ipynb
- 01-intro/04-Pandas.ipynb
- 01-intro/05-Exploring_Pandas.ipynb
- 02-stats/1c-scipy-stats-intro.ipynb
- 02-stats/lr.ipynb
- 02-stats/stats-basics.ipynb
- 03-visualization/1-viz-intro.ipynb
- 03-visualization/2-viz-more.ipynb
- 04-exploration/1-explore-prosper.ipynb
- 04-exploration/2-explore-walmart.ipynb
- 04-exploration/data-cleanup.ipynb
- 04-exploration/explore-house-sales.ipynb
- 04-exploration/visualize-house-sales.ipynb
- 05-sklearn/06-Sklearn_Introduction.ipynb
- 05-sklearn/06a-Sklearn_LRegression.ipynb
- 05-sklearn/07-Sklearn_Clustering.ipynb
- 05-sklearn/08-Sklearn_Classification.ipynb
- 06-text/1-nltk-intro.ipynb
- 06-text/2-analyzing-text-with-nltk.ipynb
- 06-text/3-ngrams.ipynb
- 06-text/4-textblob.ipynb
- 06-text/5-tf-idf-intro.ipynb
- 06-text/6-tf-idf-with-scikit-learn.ipynb
- 06-text/7-gensim-intro.ipynb
- 06-text/8-gensim-newsgroups.ipynb
- 07-spark/09-Introducing_PySpark.ipynb
- 07-spark/10-Learning_Spark.ipynb
- 07-spark/11-Pyspark_ML_Clustering.ipynb