Alex Wu's repositories
AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
Anima
第一个开源的基于QLoRA的33B中文大语言模型First QLoRA based open source 33B Chinese LLM
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
camel
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Language Model Society (NeruIPS'2023) https://www.camel-ai.org
cudf
cuDF - GPU DataFrame Library
cuml
cuML - RAPIDS Machine Learning Library
DeepKE
An Open Toolkit for Knowledge Graph Extraction and Construction published at EMNLP2022 System Demonstrations.
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
duckduckgo_search
Search for words, documents, images, videos, news, maps and text translation using the DuckDuckGo.com search engine. Downloading files and images to a local hard drive.
flash-attention
Fast and memory-efficient exact attention
Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
HeyGenClone
A simple and open-source analogue of the HeyGen system
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
mathpix-markdown-it
Markdown rendering + Latex extras (equations, tables, ...), with conversion features, for the scientific community
Megatron-LM
Ongoing research training transformer models at scale
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
NeMo-text-processing
NeMo text processing for ASR and TTS
raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
rmm
RAPIDS Memory Manager
safetensors
Simple, safe way to store and distribute tensors
serverless-application-model
The AWS Serverless Application Model (AWS SAM) transform is a AWS CloudFormation macro that transforms SAM templates into CloudFormation templates.
terminal
The new Windows Terminal and the original Windows console host, all in the same place!
text-generation-inference
Large Language Model Text Generation Inference
ToolBench
An open platform for training, serving, and evaluating large language model for tool learning.
triton
Development repository for the Triton language and compiler
VisRTX
NVIDIA RTX based implementation of ANARI
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
yarn
YaRN: Efficient Context Window Extension of Large Language Models