There are 1 repository under data-quality-assessment topic.
In this project, a RFM model is implemented to relate to customers in each segment. Assessed the Data Quality, performed EDA using Python and created Dashboard using Tableau.
Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
This repository contains solutions to the 3 different tasks that must be performed during the data analytics virtual internship provided by KPMG via Forage.
Step-by-step exploratory movement data analysis protocol in a Jupyter notebook
🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎
Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data (NeurIPS 2022)
Health Data Metrics (HDM) a Data Quality assessment Application.
🧼🔎 SelfClean revised versions of benchmark datasets for more reliable performance estimation.
A highly-configurable, real-time data quality monitoring tool designed for streaming data
SDQCPy is a comprehensive Python package designed for synthetic data management, quality control, and validation.
A function that automatically generates a Data Quality Report for your data
Collection of R scripts to test packages in conducting data quality assessments
A signal quality assessment pipeline and dashboard for ambulatory cardiovascular data
Addressing Data Quality Challenges in Ambulatory Wrist-worn Wearable Monitoring Through Analytical and Practical Approaches
To provide Sales trend visibility on monthly, Quarterly and yearly basis.
KGHeartBeat is a community-shared open-source knowledge graph quality assessment tool to perform quality analysis on a wide range of freely available knowledge graphs registered on the LOD cloud and DataHub. Web-App: http://www.isislab.it:12280/kgheartbeat/
This project involves analyzing Sprocket Central Pty Ltd Data to help the marketing department unveil useful insights that could help them optimize resources allocation for targeted marketing
This project involves analyzing customer data for Sprocket Central Pty Ltd. The goal is to optimize the company's marketing strategy. We will assess data quality, target high-value customers, and develop a data-driven marketing plan. By leveraging customer data, we aim to provide valuable insights and recommendations to drive business growth.
Data quality, maturity and utility labelling tool for the EHDS (HealthData@EU)
This Repository consist of all the Jupyter Notebooks, Images and .CSV files of the tasks that were assigned during the KPMG Data Analytics Course hosted on Forage
Data quality made simple
Sprocket Central is medium size bike company which requires analytical insights regarding marketing strategy and which customers to target from both current and future customers. A final visualisation input needed to be given to get a sign-off to work further.
Data Quality Assessment for Industrial Data
PySpark and Python ML and Data Science Projects on a variety of Topics
This repository contains solution data analytics virtual internship provided by KPMG via Forge Academy
Tunable Query Optimizer for Web APIs and User Preferences
In this project, a RFM model is implemented to relate to customers in each segment. Assessed the Data Quality, performed EDA using Python .
A service designed to analyze and assess the quality of high frequency data collected from Industrial Internet of Things (IIoT) sensors, efficiently.## Dependencies This app reads multiple sensor readings that monitor a machine from LeanXcale database supporting energy efficient and incremental analytics.
This project involves analyzing Sprocket Central Pty Ltd Data to help the marketing department unveil useful insights that could help them optimize resources allocation for targeted marketing
In this project, a RFM model is implemented to relate to customers in each segment. Assessed the Data Quality, performed EDA using Python and created Dashboard using Tableau.
Generate valuable insights from customer and transactions data.
Dieses Repository spezifiziert Methoden und Verfahren für Datenqualitätsfragestellungen.
This project aims to show capabilities of Alteryx software on the Kaggle data named "Superstore". The project includes creation of an analytic application, spatial analysis, creating report visualisations, exploratory data analysis, and a unique Alteryx approach to the traveling salesman problem.
KPMG-Virtual-Internship: This repo contains all the solutions and resources for the data analytics virtual internship provided by KPMG via Forage