dgolchin

dgolchin

Geek Repo

Github PK Tool:Github PK Tool

dgolchin's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36255Issues:348Issues:1752

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:30189Issues:281Issues:1252

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:29551Issues:192Issues:4639

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:7361Issues:48Issues:632

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4389Issues:110Issues:133

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4383Issues:49Issues:288

whisper-asr-webservice

OpenAI Whisper ASR Webservice API

Language:PythonLicense:MITStargazers:1948Issues:28Issues:150

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1769Issues:17Issues:26

llm-datasets

High-quality datasets, tools, and concepts for LLM fine-tuning.

self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Language:PythonLicense:MITStargazers:1300Issues:23Issues:17

Gemini

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

Language:PythonLicense:MITStargazers:401Issues:12Issues:8

laserRMT

This is our own implementation of 'Layer Selective Rank Reduction'

Language:PythonLicense:Apache-2.0Stargazers:228Issues:10Issues:7
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:63Issues:3Issues:1

AllTheWorldAPlay

All the world is a play, we are but actors in it.

Language:PythonStargazers:27Issues:1Issues:0

model-similarity

Simple Model Similarities Analysis

Language:HTMLLicense:Apache-2.0Stargazers:18Issues:1Issues:0

extract-expert

Extract a single expert from a Mixture Of Experts model using slerp interpolation.

Language:PythonLicense:Apache-2.0Stargazers:17Issues:0Issues:0

Vanguard_L3_70B

Vanguard is an Office 365 Add-in for advanced email scanning to prevent malicous attacks on an enterprise scale

Language:JavaScriptStargazers:8Issues:1Issues:0

synth

synth. is a framework designed for the generation of synthetic instructions to enhance LLM training.

Language:PythonLicense:Apache-2.0Stargazers:7Issues:1Issues:0