蒲源's starred repositories
PromptAgent
This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel automatic prompt optimization method that autonomously crafts prompts equivalent in quality to those handcrafted by experts, i.e., expert-level prompts.
nano-llama31
nanoGPT style version of Llama 3.1
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
mistral-inference
Official inference library for Mistral models
soft-moe-pytorch
Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch
LanguageAgentTreeSearch
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
rl-learned-optimization
Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"
GenerativeRL
Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).
gemma_pytorch
The official PyTorch implementation of Google's Gemma models