zhihanz's starred repositories

bend-ingest-kafka

Ingest kafka data into databend

Language:GoLicense:Apache-2.0Stargazers:3Issues:0Issues:0

titan

Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool for the Snowflake data warehouse.

Language:PythonLicense:Apache-2.0Stargazers:390Issues:0Issues:0

amazon-bedrock-serverless-prompt-chaining

Build complex, serverless, and highly scalable generative AI applications with prompt chaining.

Language:PythonLicense:MIT-0Stargazers:177Issues:0Issues:0

cron

a cron library for go, updated to have removable jobs

Language:GoLicense:MITStargazers:1Issues:0Issues:0

zigrocks

Writing a SQL database, take two: Zig and RocksDB

Language:ZigStargazers:137Issues:0Issues:0

Pandora

Pandora: Towards General World Model with Natural Language Actions and Video States

Language:PythonStargazers:466Issues:0Issues:0

unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:16272Issues:0Issues:0

sqlframe

Turning PySpark Into a Universal DataFrame API

Language:PythonLicense:MITStargazers:289Issues:0Issues:0

CaaS-LSM

[SIGMOD '24] CaaS-LSM: Compaction-as-a-Service for LSM-based Key-Value Stores in Storage Disaggregated Infrastructure

Language:C++License:GPL-2.0Stargazers:58Issues:0Issues:0

quary

Open-source BI for engineers

Language:RustLicense:Apache-2.0Stargazers:2159Issues:0Issues:0

pyodide

Pyodide is a Python distribution for the browser and Node.js based on WebAssembly

Language:PythonLicense:MPL-2.0Stargazers:12063Issues:0Issues:0

river

Fast and reliable background jobs in Go

Language:GoLicense:MPL-2.0Stargazers:3419Issues:0Issues:0

ratchet

A cross-platform browser ML framework.

Language:RustLicense:MITStargazers:580Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5561Issues:0Issues:0

nimble

New file format for storage of large columnar datasets.

Language:C++License:Apache-2.0Stargazers:428Issues:0Issues:0

twenty

Building a modern alternative to Salesforce, powered by the community.

Language:TypeScriptLicense:AGPL-3.0Stargazers:15994Issues:0Issues:0

paxml

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.

Language:PythonLicense:Apache-2.0Stargazers:451Issues:0Issues:0

recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Language:PythonLicense:Apache-2.0Stargazers:597Issues:0Issues:0

jaffle-shop

🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.

Stargazers:93Issues:0Issues:0

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:62255Issues:0Issues:0

opentelemetry-go-contrib

Collection of extensions for OpenTelemetry-Go.

Language:GoLicense:Apache-2.0Stargazers:1151Issues:0Issues:0

peerdb

Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage

Language:GoLicense:NOASSERTIONStargazers:2168Issues:0Issues:0

arroyo

Distributed stream processing engine in Rust

Language:RustLicense:Apache-2.0Stargazers:3668Issues:0Issues:0

crabml

a fast cross platform AI inference engine 🤖 using Rust 🦀 and WebGPU 🎮

Language:RustLicense:Apache-2.0Stargazers:395Issues:0Issues:0
Language:C++License:NOASSERTIONStargazers:644Issues:0Issues:0

databend-docs

Official repository for Databend documentation

Language:JavaScriptLicense:Apache-2.0Stargazers:12Issues:0Issues:0

openhouse

Open Control Plane for Tables in Data Lakehouse

Language:JavaLicense:BSD-2-ClauseStargazers:294Issues:0Issues:0

JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Language:PythonLicense:Apache-2.0Stargazers:198Issues:0Issues:0

pingora

A library for building fast, reliable and evolvable network services.

Language:RustLicense:Apache-2.0Stargazers:21398Issues:0Issues:0

matrixcalc

MIT IAP short course: Matrix Calculus for Machine Learning and Beyond

Language:Jupyter NotebookStargazers:297Issues:0Issues:0