shelbin yang's repositories
imageSearch
以图搜图
flinksql-datahub-connector
flinksql 1.13-datahub-connector
ChatSelfData
用OpenAI构建私有知识库
cube-studio
云原生一站式机器学习平台,多租户,notebook在线开发,拖拉拽任务流编排,多机多卡分布式训练,超参搜索,推理服务,多集群调度,多项目组资源组,边缘计算,大模型实时训练
datavines
Know your data better!Datavines is Next-gen Data Observability Platform
dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
ddia
《Designing Data-Intensive Application》DDIA中文翻译
dolphinscheduler
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
fun-rec
本推荐算法教程主要是针对具有机器学习基础并想找推荐算法岗位的同学,教程由推荐算法基础、推荐算法入门赛、新闻推荐项目及推荐算法面经组成,形成了一个完整的从基础到实战再到面试的闭环。
llama_index
LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data.
LogiKM
一站式Apache Kafka集群指标监控与运维管控平台
mysql2word
获取mysql数据,输出自定义的word文档
nebula-docker-compose
Docker compose for Nebula Graph
nebula-operator
Operation utilities for Nebula Graph
open-interpreter
OpenAI's Code Interpreter in your terminal, running locally
PaddleNLP
Easy-to-use and powerful NLP library with Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including Neural Search, Question Answering, Information Extraction and Sentiment Analysis end-to-end system.
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
pai-spark
用spark实现了部分阿里PAI的算法组件
rudder-server
Privacy and Security focused Segment-alternative, in Golang and React
SparrowRecSys
A Deep Learning Recommender System
Taier
大数据平台-分布式任务调度系统
xwl_bi
铸龙BI是一个用GO+Vue开发的用于用户事件分析的开源软件