In this lab we will be working with files_for_lab/abTesting.csv
file. Please refer to the case_study_ab_test.md
file to understand more about the data collected. The file also has description of the different data fields given in the csv file.
Import the csv into Tableau and answer the following questions:
-
What is the distribution of number of participants at each stage for control variation and test variation? Please make sure that the process steps are in order. The final plot would look like this:
-
Now, we want to compare the total number of participants for each variation. Pay attention: can you take the sum of all the process steps or not?
-
Duplicate the previous plot onto a new sheet. Now we want to find the total number of participants by gender, ie. what is the number of males and females in each variation. You can filter out the other gender categories present in the data? The final plot would look like this:
-
Duplicate the previous sheet. Now the objective is to represent the number of participants by gender as percentage of totals. Use the appropriate quick table calculation for this. Round the percentages to one decimal point. The final plot would look like this:
-
We have already seen how to group the data by making a new calculated field (using
IF...ELSE
conditional statement). Now we will create groups using thecreate group
option in the Tableau. We will create different age groups and analyze the average balance for each group. Follow the steps:- Click the down arrow on
clnt_age
in the dimensions pane on the left hand side. Go to create and selectgroups
:
- Create the following groups: Age 17 to 30 , 31 to 40, 40 to 55, 55 to 70, and 70 and above
- Edit group names:
- Plot average balance for each group. Do you observe any trend?
- Click the down arrow on