Sato's repositories
mediapipe-edge
Cross-platform, customizable ML solutions for live and streaming media.
path-to-senior-engineer-handbook
All the resources you need to get to Senior Engineer and beyond
ambari
Mirror of Apache Ambari
atrocore-MDM
AtroCore is an open-source Data Platform, Data Management and Master Data Management (MDM) software, which can be used to quickly create any business application.
Awesome-Knowledge-Graph-Reasoning
AKGR: Awesome Knowledge Graph Reasoning is a collection of knowledge graph reasoning works, including papers, codes and datasets
awesome-notebooks
Ready to use data science templates, organized by tools to jumpstart your projects in minutes. 😎 published by the Naas community.
BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
best-of-ml-python
A ranked list of awesome machine learning Python libraries. Updated weekly.
causalml
Uplift modeling and causal inference with machine learning algorithms
confluent-examples
Apache Kafka and Confluent Platform examples and demos
cruise-control
Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of Kafka clusters.
cs-video-courses
List of Computer Science courses with video lectures.
debezium-examples
Examples for running Debezium (Configuration, Docker Compose files etc.)
dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
dremio-cloud-tools
Dremio Container Tools
druid
Apache Druid: a high performance real-time analytics database.
FATE
An Industrial Grade Federated Learning Framework
h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
ignite
Apache Ignite
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
models
Models and examples built with TensorFlow
neural_prophet
NeuralProphet: A simple forecasting package
OpenMLDB
OpenMLDB is an open-source machine learning database that provides a feature platform computing consistent features for training and inference.
seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
seldon-core-MLOps
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
tidb
TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try free: https://tidbcloud.com/signup
tsai
Time series Timeseries Deep Learning Machine Learning Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai
ultralytics
YOLOv8 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
vectordb-recipes
High quality resources & applications for LLMs, multi-modal models and VectorDBs