haminse / Data-analysis

Practice Data analysis with Kaggle

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data-analysis

Practice Data analysis with Kaggle, + learn data crawling

Choosing Plot Types and Custom Styles

image

Since it's not always easy to decide how to best tell the story behind your data, we've broken the chart types into three broad categories to help with this.

1. Trends - A trend is defined as a pattern of change

sns.lineplot

Line charts are best to show trends over a period of time, and multiple lines can be used to show trends in more than one group.

image

2. Relationship - There are many different chart types that you can use to understand relationships between variables in your data.

sns.barplot

Bar charts are useful for comparing quantities corresponding to different groups.

image

sns.heatmap

Heatmaps can be used to find color-coded patterns in tables of numbers.

image

sns.scatterplot

Scatter plots show the relationship between two continuous variables; if color-coded, we can also show the relationship with a third categorical variable.

image

sns.regplot

Including a regression line in the scatter plot makes it easier to see any linear relationship between two variables.

image

sns.lmplot

This command is useful for drawing multiple regression lines, if the scatter plot contains multiple, color-coded groups.

image

sns.swarmplot

Categorical scatter plots show the relationship between a continuous variable and a categorical variable.

image

3. Distribution - We visualize distributions to show the possible values that we can expect to see in a variable, along with how likely they are.

sns.distplot

Histograms show the distribution of a single numerical variable.

image

sns.kdeplot

KDE plots (or 2D KDE plots) show an estimated, smooth distribution of a single numerical variable (or two numerical variables).

image

sns.jointplot

This command is useful for simultaneously displaying a 2D KDE plot with the corresponding KDE plots for each individual variable.

image

About

Practice Data analysis with Kaggle


Languages

Language:Jupyter Notebook 100.0%