roshan2004 / nano281

Data Science for Materials Science

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

NANO281 - Data Science in Materials Science

Welcome to the official Github repo for UCSD Course NANO281 - "Data Science in Materials Science". Here, you will find the materials for your lectures, labs, as well as other useful resources.

Lecture materials

All lecture materials are in the lectures folder.

  • slides contains the lecture slides in PDF format.
  • slides_tex contains the lecture slides in latex beamer format. This is useful if you want to extract an equation or figure.
  • notebooks contains the Jupyter notebooks that we will be using for in-lecture demos.

Labs

You will be assessed on three lab sessions. The instructions for all labs are in the labs folder.

Programming language

All lectures and labs will be conducted in Python 3.9+. Prior to the labs, please do the following:

  1. Install python on your laptop.
  2. If you are unfamiliar with Python, please go through the official Python tutorials to understand the basics of variable assignment, if and for loops, etc. While we are not expecting advanced Python knowledge, everything will go much quicker if you are already familiar with the basics.

Preparing for your lectures and labs

In preparation for your lectures and labs, please ensure that you have your Python environment setup properly. You only need to do this once. There are alternative approaches to setting up your machine, but the approach outlined here is guaranteed to work and is reproducible. Your instructors would be better able to assist if there are issues in the installation.

  1. Download the Python 3.9+ version of Miniconda (recommended) or Anaconda for your OS.
  2. Follow the official instructions and install Miniconda/Anaconda.
  3. Start the terminal (Mac/Linux) or Anaconda Prompt (Windows).
  4. Create a virtual environment for NANO281:
conda create --name nano281 python=3.9
  1. Activate the virtual environment.
conda activate nano281
  1. Install the necessary Python libraries.
conda install --yes numpy scipy matplotlib pandas jupyter seaborn scikit-learn
conda install --channel conda-forge --yes pymatgen
  1. An alternative to steps 4-6 is to download the nano281_env.yml file from the Google classroom or Github repo and create the environment directly.
conda env create -f nano281_env.yml
  1. Subsequently, always activate your nano281 environment prior to working on your lectures/labs using:
conda activate nano281

If for any reason you find that your conda environment is corrupted and you need to start over, please run:

conda env remove --name nano281

and simply start from step 4 above to redo your setup.

Course textbooks

The course is intended to be self-contained and all textbooks are optional. However, the following are useful to have around:

  1. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition Amazon, or get the free online version.
  2. Python Data Science Handbook. Buy from Amazon or get the free online version.

About

Data Science for Materials Science

License:BSD 3-Clause "New" or "Revised" License


Languages

Language:Jupyter Notebook 59.8%Language:TeX 40.0%Language:Python 0.1%Language:Java 0.0%