stepbystep's repositories
atlas
Apache Atlas
ChatGPT-Prompt-Engineering-for-Developers-in-Chinese
《面向开发者的 ChatGPT 提示词工程》非官方版中英双语字幕 Unofficial subtitles of "ChatGPT Prompt Engineering for Developers"
ColossalAI
Colossal-AI: A Unified Deep Learning System for Big Model Era
cortex
A horizontally scalable, highly available, multi-tenant, long term Prometheus.
dagster
An orchestration platform for the development, production, and observation of data assets.
DataSphereStudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
DeepRec-1
DeepRec is a recommendation engine based on TensorFlow.
dlink
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Batch & Streaming and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
EasyNLP
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
feathub
FeatHub - A stream-batch unified feature store for real-time machine learning
featuretools
An open source python library for automated feature engineering
FiloDB
Distributed Prometheus time series database
flink-remote-shuffle
Remote Shuffle Service for Flink
flink-sql-cookbook
The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.
gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Hetu
A high-performance distributed deep learning system targeting large-scale and automated distributed training.
ignite
Apache Ignite
kairosdb
Fast scalable time series database
machine-learning-engineering-for-production-public
Public repo for DeepLearning.AI MLEP Specialization
metrictank
metrics2.0 based, multi-tenant timeseries store for Graphite and friends.
ML-Papers-Explained
Explanation to key concepts in ML
multi-cluster-app-dispatcher
Holistic job manager on Kubernetes
oneflow
OneFlow is a performance-centered and open-source deep learning framework.
pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
recommenders-addons
Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.
seata
:fire: Seata is an easy-to-use, high-performance, open source distributed transaction solution.
submarine
Submarine is Cloud Native Machine Learning Platform.
torchrec
Pytorch domain library for recommendation systems
Transformers4Rec
Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.
TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.