An exploratory data analysis project to analyse datasets for users corresponding to their activity on an LMS (Learning Management System) Portal.
Table of Contents
- user_purchase_data: contains the user id for paid and unpaid users with their joining date and date of first purchase
- track_subscription_data: contains user id and courses users subscribed for
- user_video_activity_data: contains user id, activity id, video duration and watched duration along with the date the video was watched
- user_studycard_data: contains user id, activity id, and date when a studycard was accessed
- user_quiz_attempt_data: contains user id with details of quizzes users took
Datasets are available with in this repository to be downloaded
- numpy==1.18.5
- matplotlib==3.2.2
- pandas==1.0.5
- No. of paid and unpaid users
- Days paid users took before signing up
- Average no. of days before signing up
- Maximum and Minimum days users took to signup
- Top 10 delays in days before users signup
- Weekly and Monthly User Signups
- Percentage of users who signed up within 3 days
- Percentage of users who took more than 3 days to signup
- Total no. of courses
- Most popular course
- How many courses/subscriptions on an avg. users took?
- Total No. of videos
- Longest and Shortest Videos
- Count of videos watched by each user
- The average number of videos watched by each user
- Most Popular Videos
- Assigning scores to videos based on:
- Correlation between length of the videos and Watch percentage
- The average length of videos with a 100% watch percentage
- The average length of 100 most popular videos
- Percentage of CREATED, PAUSED, IN PROGRESS, COMPLETED quizzes by each user
- No. of quizzes completed by each user
- No. of studycard accessed by each user
- Paid and Unpaid users Comparision weekly watched duration, subscription, quizzes are taken, studycards