CornellCAC / CVW_PyDataSci2

Supplementary material for Cornell CVW on "Python for Data Science - Part 2: Data Modeling and Machine Learning"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CVW Notebooks and Data

This repository provides supplementary material (in the form of Jupyter notebooks and data files) for use with the Cornell Virtual Workshop (CVW) tutorial on Python for Data Science - Part 2: Data Modeling and Machine Learning.

The code in this repository was run in a conda environment through a series of installs as follows:

  • conda install numpy pandas matplotlib seaborn scipy networkx bokeh jupyter ipython scikit-learn tensorflow=2.0.0
  • conda install -c conda-forge textblob
  • conda install xlrd
  • conda install statsmodels

A full YAML file describing versions for these packages as well as their dependencies are in the repository file PyDS2.yml .

Although tensorflow=2.0.0 was included in the environment, it is not in fact used in any of the code, so it can be excluded from the environment if desired.

Slides (notebook) for Webinar on "Python Tools for Data Science"

This repository also contains a Jupyter notebook used for presentation of an XSEDE Webinar on "Python Tools for Data Science" (7/28/2021). Specifically, this includes:

  • PyDSWebinar.ipynb: Jupyter notebook containing webinar presentation slides
  • PyDSWebinar.html: statically rendered version of the webinar notebook
  • images/*.png: image files that are included in the notebook

About

Supplementary material for Cornell CVW on "Python for Data Science - Part 2: Data Modeling and Machine Learning"


Languages

Language:Jupyter Notebook 63.3%Language:HTML 36.7%