jamesliu

James's repositories

nanoPPO

An efficient implementation of the Proximal Policy Optimization (PPO) algorithm with linear and attention policy for reinforcement learning.

Language:PythonApache-2.06 30

A nimble and innovative implementation of the Direct Preference Optimization (DPO) algorithm with Causal Transformer and LSTM model, inspired by the paper of DPO in fine-tuning unsupervised Language Models

Language:PythonApache-2.05 30

nanoTransformer

A PyTorch-based featuring an efficiently implemented Transformer model. The core of our attention mechanisms is powered by torch.einsum, ensuring clean, readable, and highly optimized tensor operations.

Language:Python2 20

nChain

a flexible and efficient implementation to create LLM bots over extensible dataset.

Language:Python200

amago

a simple and scalable agent for training adaptive policies with sequence-based RL

MIT000

anthropic-cookbook

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

MIT000

autogen

Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ

Language:Jupyter NotebookCC-BY-4.0000

bayjarvis-app

000

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

MIT000

lag-llama

Apache-2.0000

litgpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Apache-2.0000

LLaMA-Factory

Unify Efficient Fine-tuning of 100+ LLMs

Apache-2.0000

llm-foundry

LLM training code for MosaicML foundation models

Apache-2.0000

llm.c

LLM training in simple, raw C/CUDA

000

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

MIT000