ipconfigme

ipconfigme

Geek Repo

Company:Baidu

Github PK Tool:Github PK Tool

ipconfigme's starred repositories

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:46509Issues:348Issues:3917

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:27405Issues:223Issues:4582

surrealdb

A scalable, distributed, collaborative, document-graph database, for the realtime web

Language:RustLicense:NOASSERTIONStargazers:26992Issues:160Issues:1684

Chat2DB

🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.

Language:JavaLicense:Apache-2.0Stargazers:14648Issues:103Issues:995

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

quickwit

Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.

Language:RustLicense:NOASSERTIONStargazers:7944Issues:63Issues:2289

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonLicense:GPL-3.0Stargazers:5701Issues:78Issues:142

lance

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

Language:RustLicense:Apache-2.0Stargazers:3809Issues:42Issues:969

fastllm

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

Language:C++License:Apache-2.0Stargazers:3290Issues:41Issues:357

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2692Issues:36Issues:134

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonLicense:Apache-2.0Stargazers:2311Issues:22Issues:178

DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Language:PythonLicense:Apache-2.0Stargazers:1847Issues:41Issues:297

CUDA-Learn-Notes

🎉 Modern CUDA Learn Notes with PyTorch: fp32, fp16, bf16, fp8/int8, flash_attn, sgemm, sgemv, warp/block reduce, dot, elementwise, softmax, layernorm, rmsnorm.

Language:CudaLicense:GPL-3.0Stargazers:1181Issues:12Issues:5

LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Language:PythonLicense:Apache-2.0Stargazers:1107Issues:11Issues:55

streaming

A Data Streaming Library for Efficient Neural Network Training

Language:PythonLicense:Apache-2.0Stargazers:1081Issues:21Issues:166

DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Language:C++License:NOASSERTIONStargazers:1068Issues:26Issues:195

data-centric-AI

A curated, but incomplete, list of data-centric AI resources.

efficient-dl-systems

Efficient Deep Learning Systems course materials (HSE, YSDA)

Language:Jupyter NotebookLicense:MITStargazers:648Issues:14Issues:4

MatrixSlow

A simple deep learning framework in pure python for purpose of learning in DL

puck

Puck is a high-performance ANN search engine

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:331Issues:13Issues:13

DistServe

Disaggregated serving system for Large Language Models (LLMs).

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:286Issues:4Issues:37

BinaryVectorDB

Efficient vector database for hundred millions of embeddings.

Language:PythonLicense:Apache-2.0Stargazers:197Issues:9Issues:1

bitalosdb

Bitalosdb is a high-performance KV storage engine.

Language:GoLicense:Apache-2.0Stargazers:160Issues:16Issues:0

erasure-coding-durability

Python code to calculate the durability of data stored with erasure coding, such as Reed-Solomon.

Language:PythonLicense:NOASSERTIONStargazers:42Issues:13Issues:2

accelerator-solution-zoo

Intel accelerators Zoo, like one of its solution Intel® Vector Data Streaming Library, it's a zoo of solutions based on Intel 4th Xeon processor or later HW accelerators, such as DSA, IAA, DLB,QAT and etc.

Language:CStargazers:27Issues:8Issues:0

MINI-TORCH

Mini-pytorch implemented from scratch using Python

Language:PythonStargazers:9Issues:0Issues:0