There are 122 repositories under exploratory-data-analysis topic.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Always know what to expect from your data.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Visualize and compare datasets, target values and associations, with one line of code.
Beautiful visualizations of how language differs among document types.
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Automatically find issues in image datasets and practice data-centric computer vision.
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Developer-first embedded analytics
Build 12 Data Apps in Python with Streamlit
Complete-Life-Cycle-of-a-Data-Science-Project
Ways of doing Data Science Engineering and Machine Learning in R and Python
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
A list of software and papers related to automatic and fast Exploratory Data Analysis
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
dataโฐdescribe: Pythonic EDA Accelerator for Data Science
Data Science Feature Engineering and Selection Tutorials
this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.
:full_moon_with_face: Lottery prediction besides of following "law of proability","Probability: Independent Events", there are still "Saying "a Tail is due", or "just one more go, my luck is due to change" is called The Gambler's Fallacy" existed.
Tutorials on visualizing data using python packages like bokeh, plotly, seaborn and igraph
Data Analysis Using Python: A Beginnerโs Guide Featuring NYC Open Data.
๐ ๏ธ ๐ Tools for Exploring and Comparing Data Frames
Classification of Breast Cancer diagnosis Using Support Vector Machines
Functionalities in Excel translated to Python
A day to day plan for this challenge. Covers both theoritical and practical aspects
Exploratory data analysis ๐using python ๐of used car ๐ database taken from โ๐๐๐๐๐
HandySpark - bringing pandas-like capabilities to Spark dataframes