data-partitioning

There are 0 repository under data-partitioning topic.

SourabhSinghRana / zomato_data_warehouse_design
I recently tried designing a Data Warehouse for Zomato/Swiggy, one of the leading online food delivery platforms. This project has provided me with invaluable hands-on experience, allowing me to apply my skills and knowledge in a real-world scenario.
data-modeling data-partition data-partitioning data-warehouse data-warehousing snowflake star data-warehouse-designing
2
Lefteris-Souflas / Business-Analytics-Case-Studies
Three business analytics case studies were undertaken, encompassing market basket analysis, customer segmentation, and campaign management. SAS Visual Data Mining and Machine Learning on SAS Viya was utilized to explore data and provide insights. A comprehensive report addressing both technical and business aspects was delivered.
customer-segmentation fraud-detection market-basket-analysis sas-visual-analytics sas-viya association-rules data-partitioning decision-tree logistic-regression neural-network rfm-segmentation stratified-sampling cut-off-point-calculation maximal-tree
1
osiastossou / ProjetTD-AC
This paper presents TD-AC which is an effective algorithm for the truth discovery problem when the attributes over data are structurally correlated. We build our procedure on an abstract representation of the truth in the data, the k-means clustering technique and the silhouette measure to automatically find an optimal partitioning of the input data (or a near-optimal) maximizing the accuracy of any base truth discovery process. The intensive experiments conducted on synthetic and real datasets show that TD-AC outperforms existing partitioning approaches with a more reasonable running time. It improves on synthetic datasets the accuracy of standard truth discovery algorithms by 6% at least and by 16% at most and also significantly when the data coverage rate is high for the other types of datasets
truth-discovery attribute data-partitioning clustering attribute-truth-vector k-means silhouette-index
Language:Python 1
sketch-imiss / Sketch
This repository provides streaming algorithms that can be used for monitoring large-scale data streams.
cardinality-estimation frequency-estimation streaming persistency-estimation data-partitioning sampling
Language:Python 1
sudheerkodali / system-Design
system-design-and-it-types
availability caching data-partitioning database load-balancing publisher-subscriber evaluate-your-design network-protocals proxy-server-load-balancers publish-subscriber-pattern rate-limiters steps-to-approach-any-system-design-problems system-design-fundamentals

data-partitioning

SourabhSinghRana / zomato_data_warehouse_design

Lefteris-Souflas / Business-Analytics-Case-Studies

osiastossou / ProjetTD-AC

sketch-imiss / Sketch

sudheerkodali / system-Design