sliwet / matplotlib-challenge

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Matplotlib Homework - The Power of Plots

Background

What good is data without a good plot to tell the story?

So, let's take what you've learned about Python Matplotlib and apply it to a real-world situation.

Pymaceuticals Instructions

While your data companions rushed off to jobs in finance and government, you remained adamant that science was the way for you. Staying true to your mission, you've since joined Pymaceuticals Inc., a burgeoning pharmaceutical company based out of San Diego, CA. Pymaceuticals specializes in drug-based, anti-cancer pharmaceuticals. In their most recent efforts, they've since begun screening for potential treatments to squamous cell carcinoma (SCC), a commonly occurring form of skin cancer.

As their Chief Data Analyst, you've been given access to the complete data from their most recent animal study. In this study, 250 mice were treated through a variety of drug regimes over the course of 45 days. Their physiological responses were then monitored over the course of that time. Your objective is to analyze the data to show how four treatments (Capomulin, Infubinol, Ketapril, and Placebo) compare.

To do this you are tasked with:

  • Creating a scatter plot that shows how the tumor volume changes over time for each treatment.
  • Creating a scatter plot that shows how the number of metastatic (cancer spreading) sites changes over time for each treatment.
  • Creating a scatter plot that shows the number of mice still alive through the course of treatment (Survival Rate)
  • Creating a bar graph that compares the total % tumor volume change for each drug across the full 45 days.
  • Include 3 observations about the results of the study. Use the visualizations you generated from the study data as the basis for your observations.

As final considerations:

  • You must use the Pandas Library and the Jupyter Notebook.
  • You must use the Matplotlib library.
  • You must include a written description of three observable trends based on the data.
  • You must use proper labeling of your plots, including aspects like: Plot Titles, Axes Labels, Legend Labels, X and Y Axis Limits, etc.
  • Your scatter plots must include error bars. This will allow the company to account for variability between mice. You may want to look into pandas.DataFrame.sem for ideas on how to calculate this.
  • Remember when making your plots to consider aesthetics!
    • Your legends should not be overlaid on top of any data.
    • Your bar graph should indicate tumor growth as red and tumor reduction as green. It should also include a label with the percentage change for each bar. You may want to consult this tutorial for relevant code snippets.
  • See Starter Workbook for a reference on expected format. (Note: For this example, you are not required to match the tables or data frames included. Your only goal is to build the scatter plots and bar graphs. Consider the tables to be potential clues, but feel free to approach this problem, however, you like.)

Copyright

Trilogy Education Services © 2019. All Rights Reserved.

About


Languages

Language:Jupyter Notebook 100.0%