ahuachen's repositories
act-plus-plus
Imitation Learning algorithms with Co-traing for Mobile ALOHA: ACT, Diffusion Policy, VINN
akcio
Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses vector databases to fetch relevant documents to enhance the quality and relevance of the output.
alldata
🔥🔥 AllData大数据产品是可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台,GPT平台为框架,提供全链路数字化解决方案。微信群:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo
BigDataGuide
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
common-data-model-and-service
Documentation for the Common Data Model and Service
conductor
Conductor is a microservices orchestration engine.
data-diff
Compare tables within or across databases
dataService
dataService platform is a low-code platform, which only needs to write SQL to realize the development of API services, solve the unification of data services, facilitate the governance of data services, and unify the caliber of indicators. It can improve the development efficiency of business and face business changes faster
DataSphereStudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
dbt-duckdb
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
dbt-metabase
Model synchronization from dbt to Metabase
dbt-utils
Utility functions for dbt projects.
dinky
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Streaming & Batch and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
farmOS
farmOS: A web-based farm record keeping application.
God-Of-BigData
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
iztro
⭐A lightweight Open-Source javascript library of getting The Purple Star Astrology(Zi Wei Dou Shu) astrolabe information. 支持多语言轻量级获取紫微斗数排盘信息的javascript开源库。
LarkMidTable
LarkMidTable 是一站式开源的数据中台,实现中台的 基础建设,数据治理,数据开发,监控告警,数据服务,数据的可视化,实现高效赋能数据前台并提供数据服务的产品。
llama-recipes
Examples and recipes for Llama 2 model
meltano
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
mobile-aloha
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
ocap
🛰 Metad® Open Platform for Enterprise Data Analysis, Indicator Management and Reporting
open-source-modern-data-stack
This repo demonstrate a comprehensive modern data stack using popular open-source tools.
QAnything
Question and Answer based on Anything.
rudder-server
Privacy and Security focused Segment-alternative, in Golang and React
rudder-transformer
Open-source, warehouse-first Customer Data Pipeline and Segment-alternative. Collects and routes clickstream data and builds your customer data lake on your data warehouse.
seatunnel-web
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
snowplow
The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP
surya
OCR, layout analysis, reading order, line detection in 90+ languages
temporal
Temporal service
TypeChat
TypeChat is a library that makes it easy to build natural language interfaces using types.