There are 2 repositories under public-dataset topic.
A comprehensive list of annotated training datasets classified by use case.
Preprocessed IXI brain MRI dataset with subcortical segmentation
The MAMA-MIA Dataset: A Multi-Center Breast Cancer DCE-MRI Public Dataset with Expert Segmentations
A structured dataset of emails sent at Atari from 1983 to 1992.
GitHub repository for the Kvasir-instrument dataset
Turkish, Hungarian and English handwritten offline character dataset.
Flask, Chart.js drilldown example project
Web app using babashka/apache + ETL pipeline
Orchid2024: A cultivar-level dataset and methodology for fine-grained classification of Chinese Cymbidium Orchids.
Scrapers used to acquire snapshots of raw data inputs for versioned archiving and replicable analysis.
I used a public database in order to create a logistic regression model for detecting suspicious credit card activities.
Dataset release in association with Gabriel 2024, Frontiers (in review)
Prever se um paciente tem diabetes utilizando algoritmos de classificação binárias.
BCCD (Blood Cell Count and Detection) Dataset is a small-scale dataset for blood cells detection.
A list of approved tourist mountain hiking trails from the Romanian "Ministerul Economiei, Antreprenoriatului și Turismului". Original copy and cleaned.
a ready-to-use-and-share graphing tool. Visualize your data in the blink of an eye!
Data Analysis - university team project.
A small project to test out whether is it the model or dataset causing miss prediection.
2014년부터 2023년까지 서울시 집값 데이터 분석 및 예측 프로그램입니다.
SPA and simple API to visualize and investigate New York State Quick Draw data
Raw data files from "SNIS - Série Histórica" web site
🟥 Contains time-series data from a 3-kilowatt micro gas turbine. It records electrical power output in relation to input control signals. Designed for regression analysis, it aims to predict energy output based on control signals. The dataset includes eight time series with varying durations and input signal patterns.
🟥🟩 Comprises 10,000 two-dimensional points organized into 100 distinct circles. Designed for evaluating clustering algorithms like k-means, it presents a well-defined clustering challenge. Each point is labeled with its corresponding circle, making it suitable for both classification and clustering tasks.
🟥 Provides a comprehensive overview of weekly hospital respiratory data and metrics aggregated to national and state/territory levels, reported to the Centers for Disease Control and Prevention's (CDC) National Health Safety Network (NHSN) from August 2020 through October 2024.
📊 List of Free Data Science Courses/Podcasts, Open/Public datasets repo , data visualization resources, and data science news. Contributions are welcome! 🤝
Learn deeplearning from public datasets & state-of-the-art papers.