Beast code in Giters

wm901115nwpu's starred repositories

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.031201 473 17415

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonApache-2.028045 323 5132

generative-models

Generative Models by Stability AI

Language:PythonMIT22375 236 259

mlx

MLX: An array framework for Apple silicon

Language:C++MIT14371 133 397

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonMIT10205 152 149

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonGPL-3.07959 53 360

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.07937 99 1050

guided-diffusion

Language:PythonMIT5615 141 130

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++Apache-2.05529 38 65

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT4331 49 282

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonMIT4114 41 98

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.03842 110 111

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonNOASSERTION2516 36 125

Qwen-Agent

Agent framework and applications built upon Qwen1.5, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonNOASSERTION1689 27 135

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonApache-2.01126 19 34

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonApache-2.01006 20 51

LlamaEdge

The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge

Language:RustApache-2.0645 16 79

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonApache-2.0485 24 353

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT457 14 1

SoraReview

The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".

453 8 1

ring-flash-attention

Ring attention implementation with flash attention

Language:Python343 8 17

piecewise-rectified-flow

Language:PythonBSD-3-Clause283 15 4

LLM_MultiAgents_Survey_Papers

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

242 6 4

OCR_ICDAR_label_revise

Language:Python231 7 2

BitDelta

Language:Jupyter NotebookApache-2.0160 3 4

self-speculative-decoding

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**

Language:Jupyter NotebookApache-2.096 4 14

fp6_llm

An efficient GPU support for LLM inference with 6-bit quantization (FP6).

Language:CudaApache-2.08000

flash-linear-rnn

Implementations of various linear RNN layers using pytorch and triton

Language:Python34 2 1

QLLM

[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"

Language:PythonApache-2.021 70

ComPEFT

Language:PythonBSD-3-Clause18 10