Sato's repositories
avatarify-python
Avatars for Zoom, Skype and other video-conferencing apps.
path-to-senior-engineer-handbook
All the resources you need to get to Senior Engineer and beyond
ambari
Mirror of Apache Ambari
ambari-infra
Apache Ambari Infra is a sub project of Apache Ambari.
kylin
Mirror of Apache Kylin
amb-clemlab
Fork of Apache Ambari maintained by Clemlab Company
ambari-metrics
Apache Ambari Metrics is a sub project of Apache Ambari.
arrow-ballista
Apache Arrow Ballista Distributed Query Engine
awesome-AutoML
Curating a list of AutoML-related research, tools, projects and other resources
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
Awesome-Knowledge-Graph-Reasoning
AKGR: Awesome Knowledge Graph Reasoning is a collection of knowledge graph reasoning works, including papers, codes and datasets
best-of-ml-python
A ranked list of awesome machine learning Python libraries. Updated weekly.
causalml
Uplift modeling and causal inference with machine learning algorithms
cruise-control
Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of Kafka clusters.
debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
debezium-examples
Examples for running Debezium (Configuration, Docker Compose files etc.)
dockerize-amb
Let's run Ambari using docker compose. (feat. ApacheDS)
dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
dremio-cloud-tools
Dremio Container Tools
dremio-oss
Dremio - the missing link in modern data
duckdb
DuckDB is an in-process SQL OLAP Database Management System
h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
linkis
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
ranger
Ranger-AAA
tsai
Time series Timeseries Deep Learning Machine Learning Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai
vectordb-recipes
High quality resources & applications for LLMs, multi-modal models and VectorDBs