cj (chaojun-zhang)

chaojun-zhang

Geek Repo

Location:shanghai

Github PK Tool:Github PK Tool

cj's repositories

JavaFamily

记录学习点滴,分享技术干货

arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

arrow2

Transmute-free Rust library to work with the Arrow format

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

statd

A simple light-weigh Data Statistics API Service Component on top of Apache Calcite

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

CPlusPlusThings

C++那些事

Language:C++Stargazers:0Issues:0Issues:0
Language:ShellLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dpkb

大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse

Stargazers:0Issues:0Issues:0

duckdb

DuckDB is an in-process SQL OLAP Database Management System

Language:C++License:MITStargazers:0Issues:0Issues:0

e2eAIOK

Intel® End-to-End AI Optimization Kit

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

God-Of-BigData

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

Stargazers:0Issues:0Issues:0

Jlama

Jlama is a modern Java inference engine for LLMs

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

jvector

JVector: the most advanced embedded vector search engine

License:Apache-2.0Stargazers:0Issues:0Issues:0

llm-examples

Streamlit LLM app examples for getting started

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

presto

The official home of the Presto distributed SQL query engine for big data

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-llama

LLaMA 2 implemented from scratch in PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch_geometric

Graph Neural Network Library for PyTorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

substrait

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:JavaStargazers:0Issues:1Issues:0

velox

A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0