yuanenming

Enming Yuan's starred repositories

mamba

Mamba SSM architecture

Language:PythonApache-2.01187700

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Language:PythonMIT583800

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.01104400

LLM101n

LLM101n: Let's build a Storyteller

2526000

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonMIT1210300

evalverse-IFEval

Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/master/instruction_following_eval)

Language:Python900

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonNOASSERTION8069400

HumanSystemOptimization

健康学习到150岁 - 人体系统调优不完全指南

1291600

LWM

Language:PythonApache-2.0702800

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.0179800

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.099400

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION575700

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT879000

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonApache-2.090200

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION949000

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Language:TypeScriptApache-2.0756500

self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Language:PythonMIT127300

MOSS-RLHF

Language:PythonApache-2.0123500

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonApache-2.0246400

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonApache-2.045200

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause538400

inshellisense

IDE style command line auto complete

Language:TypeScriptMIT824700

ChatGPT-AutoExpert

🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).

Language:JavaScriptNOASSERTION655100

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.0338000