There are 10 repositories under data-versioning topic.
🌀 𝗧𝗵𝗲 𝗙𝘂𝗹𝗹 𝗦𝘁𝗮𝗰𝗸 𝟳-𝗦𝘁𝗲𝗽𝘀 𝗠𝗟𝗢𝗽𝘀 𝗙𝗿𝗮𝗺𝗲𝘄𝗼𝗿𝗸 | 𝗟𝗲𝗮𝗿𝗻 𝗠𝗟𝗘 & 𝗠𝗟𝗢𝗽𝘀 for free by designing, building and deploying an end-to-end ML batch system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 2.5 𝘩𝘰𝘶𝘳𝘴 𝘰𝘧 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 & 𝘷𝘪𝘥𝘦𝘰 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴
Curated list of open source tooling for data-centric AI on unstructured data.
Distributed version-control for geospatial and tabular data
A versioning data store for time-variant graph data.
Collecting thoughts about data versioning
A curated list to help you manage temporal data across many modalities 🚀.
Metadata store for Production ML
Data version control for reproducible analysis pipelines in R with {targets}.
"1 config, 1 command from Jupyter Notebook to serve Millions of users", Full-stack On-Premises MLOps system for Computer Vision from Data versioning to Model monitoring and drift detection.
This repo contains sample code and sample notebooks to illustrate how to work with Amazon FinSpace
Python framework for artificial text detection: NLP approaches to compare natural text against generated by neural networks.
Document versioning library for MongoDB using the mongoose package.
A demonstration of how DVC and MLFlow can be used in the task of data relabeling
A CKAN extension for data versioning.
Deprecated. See https://github.com/datopian/ckanext-versions. ⏰ CKAN extension providing data versioning (metadata and files) based on git and github.
Python Data as Code core implementation
The provided demo project demonstrates the practical implementation and advantages of using DVC. It showcases how DVC simplifies data versioning and model versioning while working in tandem with Git to create a cohesive version control system tailored for data science projects.
Newron is a data-centric ML platform to easily build, manage, deploy and continuously improve models through data driven development.
Deploying a Machine Learning Model on Heroku with FastAPI using CI/CD tools as GitHub Actions and Heroku Automatic Deployment.
Verta ai ModelDB on AWS Cloud with integration into Amazon SageMaker for ML training data versioning and experiment tracking
Repository for evaluating the different approaches to data versioning
Learning data and model versioning with ClearML while cleaning and modeling happiness by country with a Kaggle dataset
Obtain data versioning tag using ML models
Advanced Machine Learning Regression: Predicting Car Prices
In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy custom Large Language Models (LLMs).