xincheng (xinchengxx)

xinchengxx

Geek Repo

Company:Previous Interned at ByteDance, Tencent, RisingWave Labs, Now In DeepSeek

Location:WuHan

Github PK Tool:Github PK Tool


Organizations
UniqueStudio

xincheng's starred repositories

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6362Issues:0Issues:0

expert_readed_books

2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,**类,数学类,人物传记书籍

Stargazers:6204Issues:0Issues:0

Scrapegraph-ai

Python scraper based on AI

Language:PythonLicense:MITStargazers:13072Issues:0Issues:0

stock

30天掌握量化交易 (持续更新)

Language:PythonLicense:BSD-3-ClauseStargazers:4941Issues:0Issues:0

aimoneyhunter

ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English version for more insights.

Stargazers:11971Issues:0Issues:0

lalrpop

LR(1) parser generator for Rust

Language:RustLicense:Apache-2.0Stargazers:2946Issues:0Issues:0

hana

Your standard library for metaprogramming

Language:C++License:BSL-1.0Stargazers:1660Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:21716Issues:0Issues:0

eventpp

Event Dispatcher and callback list for C++

Language:C++License:NOASSERTIONStargazers:1292Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:11413Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:3261Issues:0Issues:0

HUST-typst-template

华科毕业论文(本科)的 typst 模板

Language:TypstLicense:MITStargazers:174Issues:0Issues:0

openai-cookbook

Examples and guides for using the OpenAI API

Language:MDXLicense:MITStargazers:57561Issues:0Issues:0

Atom

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Language:CudaStargazers:220Issues:0Issues:0

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonLicense:Apache-2.0Stargazers:2072Issues:0Issues:0

course

高性能并行编程与优化 - 课件

Language:C++License:NOASSERTIONStargazers:3464Issues:0Issues:0

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:8013Issues:0Issues:0

rabbit_trading

🍍 Monorepo of rust based (stock/options) trading bot and its affiliated libs.

Language:RustStargazers:8Issues:0Issues:0

qdrant

Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Language:RustLicense:Apache-2.0Stargazers:18833Issues:0Issues:0

rust-blog

Educational blog posts for Rust beginners

Language:RustLicense:Apache-2.0Stargazers:6876Issues:0Issues:0

derive_more

Some more derive(Trait) options

Language:RustLicense:MITStargazers:1522Issues:0Issues:0

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Language:C++License:Apache-2.0Stargazers:1587Issues:0Issues:0

zellij

A terminal workspace with batteries included

Language:RustLicense:MITStargazers:19503Issues:0Issues:0

Rust-CUDA

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

Language:RustLicense:Apache-2.0Stargazers:2957Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:61436Issues:0Issues:0

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5771Issues:0Issues:0

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7647Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8363Issues:0Issues:0

monoio

Rust async runtime based on io-uring.

Language:RustLicense:Apache-2.0Stargazers:3752Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35670Issues:0Issues:0