CarperAI

CarperAI

Geek Repo

FOSS RLHF

Home Page:https://carper.ai

Twitter:@carperai

Github PK Tool:Github PK Tool

CarperAI's repositories

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4383Issues:49Issues:284

OpenELM

Evolution Through Large Models

Language:PythonLicense:MITStargazers:664Issues:25Issues:11

cheese

Used for adaptive human in the loop evaluation of language and embedding models.

Language:PythonLicense:MITStargazers:294Issues:10Issues:16

DRLX

Diffusion Reinforcement Learning Library

Language:PythonLicense:MITStargazers:166Issues:10Issues:12

Code-Pile

This repository contains all the code for collecting large scale amounts of code from GitHub.

Language:PythonLicense:MITStargazers:103Issues:8Issues:27

autocrit

A repository for transformer critique learning and generation

InstructGPT

For experiments involving instruct gpt. Currently used for documenting open research questions.

squeakily

A library for squeakily cleaning and filtering language datasets.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45Issues:3Issues:4

decontamination

This repository contains code for cleaning your training data of benchmark data to help combat data snooping.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:25Issues:3Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:22Issues:4Issues:0

nmmo-environment

Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research

Language:PythonLicense:MITStargazers:15Issues:3Issues:0

CodeReviewSE

Stuff related to scraping the Code Review StackExchange

Language:PythonStargazers:11Issues:6Issues:0
Language:PythonLicense:MITStargazers:11Issues:2Issues:2

magicarp-v2

magiCARP is an API used for crossencoder training.

Language:PythonLicense:MITStargazers:8Issues:3Issues:1

Polygraph

RLHF Mechanistic Interpretability and Deception

License:MITStargazers:6Issues:4Issues:0

nmmo-baselines

Baselines for Neural MMO -- new users should treat this repo as a starter project

Language:PythonLicense:MITStargazers:5Issues:1Issues:0

FastChat

An open platform for training, serving, and evaluating large language model based chatbots.

Language:PythonLicense:Apache-2.0Stargazers:4Issues:1Issues:0

data-preparation

Code used for sourcing and cleaning the BigScience ROOTS corpus

Language:ELicense:Apache-2.0Stargazers:3Issues:1Issues:0

contriever

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Language:PythonLicense:NOASSERTIONStargazers:2Issues:2Issues:0
Language:Jupyter NotebookStargazers:2Issues:1Issues:0
Language:PythonStargazers:2Issues:1Issues:0

goosebox

sandboxed eval server for running code snippets

License:MITStargazers:1Issues:1Issues:0

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:1
Language:PythonStargazers:0Issues:2Issues:0