eugeneyan

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Language:Jupyter NotebookApache-2.03264 79 1

picoGPT

An unnecessarily tiny implementation of GPT-2 in NumPy.

Language:PythonMIT3103 29 10

CTranslate2

Fast inference engine for Transformer models

Language:C++MIT2884 56 636

vocode-python

🤖 Build voice-based LLM agents. Modular + open source.

Language:PythonMIT2385 42 139

ChainForge

An open-source visual programming environment for battle-testing prompts to LLMs.

Language:TypeScriptMIT2037 24 164

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Language:PythonMIT1286 11 303

mup

maximal update parametrization (µP)

Language:Jupyter NotebookMIT1201 29 57

lm-human-preferences

Code for the paper Fine-Tuning Language Models from Human Preferences

Language:PythonMIT1127 23 15

hallucination-leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Apache-2.01066 37 13

summarize-from-feedback

Code for "Learning to summarize from human feedback"

Language:PythonNOASSERTION952 147 21

obsidian-kindle-plugin

Sync your Kindle notes and highlights directly into your Obsidian vault

Language:TypeScriptMIT864 7 160

bleurt

BLEURT is a metric for Natural Language Generation based on transfer learning.

Language:PythonApache-2.0653 13 49

COMET

A Neural Framework for MT Evaluation

Language:PythonApache-2.0415 17 157

llm_distillation_playbook

Best practices for distilling large language models.

Language:Jupyter Notebook286 100

Whisper-transcription_and_diarization-speaker-identification-

How to use OpenAIs Whisper to transcribe and diarize audio files

Language:Jupyter Notebook244 5 8

n-levels-of-rag

Language:PythonMIT144 11 3

visualizing-finetunes

Language:Jupyter NotebookApache-2.021 2 1

ft-drift

Check for data drift between two OpenAI multi-turn chat jsonl files.

Language:Jupyter NotebookApache-2.011 20

swe-study-group

Code for the SWE study group

Language:PythonApache-2.06 20

Clautero

Language:PythonMIT600