Victor's starred repositories

fiddler

Fast Inference of MoE Models with CPU-GPU Orchestration

Language:PythonLicense:Apache-2.0Stargazers:125Issues:0Issues:0

cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Language:PythonLicense:AGPL-3.0Stargazers:8593Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8006Issues:0Issues:0
Language:C++License:BSL-1.0Stargazers:5Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:17919Issues:0Issues:0

zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Language:PythonLicense:MITStargazers:2448Issues:0Issues:0

open-parse

Improved file parsing for LLM’s

Language:PythonLicense:MITStargazers:1551Issues:0Issues:0

CUDA-Learn-Notes

🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaLicense:GPL-3.0Stargazers:489Issues:0Issues:0

BERT-NER-Pytorch

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

Language:PythonLicense:MITStargazers:1986Issues:0Issues:0

git-cliff

A highly customizable Changelog Generator that follows Conventional Commit specifications ⛰️

Language:RustLicense:Apache-2.0Stargazers:7532Issues:0Issues:0

makeMoE

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Language:Jupyter NotebookLicense:MITStargazers:495Issues:0Issues:0

supersonic

SuperSonic is the next-generation BI platform that integrates Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.

Language:JavaLicense:NOASSERTIONStargazers:395Issues:0Issues:0

surya

OCR, layout analysis, and line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:5565Issues:0Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.

Language:PythonLicense:MITStargazers:9645Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

iresearch

IResearch is a cross-platform, high-performance search analytics library written entirely in C++ with the focus on a pluggability of different ranking/similarity models

Language:C++License:NOASSERTIONStargazers:180Issues:0Issues:0

RAG-Retrieval

Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT,Cross Encoder

Language:PythonLicense:MITStargazers:99Issues:0Issues:0

deepdoctection

A Repo For Document AI

Language:PythonLicense:Apache-2.0Stargazers:2157Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:5157Issues:0Issues:0

one-transformer

a tutorial for training a PyTorch transformer from scratch

Language:PythonLicense:MITStargazers:16Issues:0Issues:0

CPlusPlusThings

C++那些事

Language:C++Stargazers:37113Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:16665Issues:0Issues:0

stella

text embedding

Language:PythonLicense:Apache-2.0Stargazers:104Issues:0Issues:0

ChainForge

An open-source visual programming environment for battle-testing prompts to LLMs.

Language:TypeScriptLicense:MITStargazers:1946Issues:0Issues:0

llm-books

利用LLM构建应用实践笔记

Language:PythonStargazers:490Issues:0Issues:0

Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Language:PythonStargazers:2976Issues:0Issues:0

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Language:PythonLicense:BSD-3-ClauseStargazers:25959Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:32524Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:14995Issues:0Issues:0

quantum

Powerful multi-threaded coroutine dispatcher and parallel execution engine

Language:C++License:Apache-2.0Stargazers:565Issues:0Issues:0