Anton Schäfer's starred repositories

ml-4m

4M: Massively Multimodal Masked Modeling

Language:PythonLicense:Apache-2.0Stargazers:1494Issues:0Issues:0

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2774Issues:0Issues:0

axlearn

An Extensible Deep Learning Library

Language:PythonLicense:Apache-2.0Stargazers:1714Issues:0Issues:0

racer

Black-box, gradient-free optimization of car-racing policies.

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

XPretrain

Multi-modality pre-training

Language:PythonLicense:NOASSERTIONStargazers:459Issues:0Issues:0

video2dataset

Easily create large video dataset from video urls

Language:PythonLicense:MITStargazers:521Issues:0Issues:0

dynamic-pooling

Efficient Transformers with Dynamic Token Pooling

Language:PythonStargazers:51Issues:0Issues:0

tokenmonster

Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript

Language:GoLicense:MITStargazers:533Issues:0Issues:0

languini-kitchen

The official Languini Kitchen repository

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:14Issues:0Issues:0

notion-backup

Simple command to backup a Notion workspace

Language:JavaScriptLicense:MITStargazers:406Issues:0Issues:0

notion-up

Use NotionUp (Notion Backup) + CircleCI to backup your notion data nightly.|自动备份 Notion 数据。|Notion データのバックアップを自動化する

Language:PythonStargazers:121Issues:0Issues:0

MEGABYTE-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Language:PythonLicense:MITStargazers:607Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8959Issues:0Issues:0

mamba

The Fast Cross-Platform Package Manager

Language:C++License:BSD-3-ClauseStargazers:6632Issues:0Issues:0

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7480Issues:0Issues:0

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:7213Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11437Issues:0Issues:0

LOMO

LOMO: LOw-Memory Optimization

Language:PythonLicense:MITStargazers:960Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6102Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1940Issues:0Issues:0

MAmmoTH

Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)

Language:Jupyter NotebookStargazers:307Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29252Issues:0Issues:0

Diffusion-LM

Diffusion-LM

Language:PythonLicense:Apache-2.0Stargazers:1020Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36081Issues:0Issues:0

open-interpreter

A natural language interface for computers

Language:PythonLicense:AGPL-3.0Stargazers:51395Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34243Issues:0Issues:0

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Language:PythonLicense:Apache-2.0Stargazers:2784Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7473Issues:0Issues:0

graph-of-thoughts

Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"

Language:PythonLicense:NOASSERTIONStargazers:2020Issues:0Issues:0

gorilla

Gorilla: An API store for LLMs

Language:PythonLicense:Apache-2.0Stargazers:11017Issues:0Issues:0