joelrorseth

Joel Rorseth's starred repositories

llama.cpp

LLM inference in C/C++

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.035112 344 1687

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Language:C++Apache-2.025697 913 5176

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.023245 191 3627

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonMIT19161 255 70

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookMIT17863 116 475

ml-stable-diffusion

Stable Diffusion with Core ML on Apple Silicon

Language:PythonMIT16335 140 232

litellm

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

Language:PythonNOASSERTION9502 62 2384

EdgeGPT

Reverse engineered API of Microsoft's Bing Chat AI

Language:PythonUnlicense8106 92 364

notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.

Language:HTMLMIT4768 146 9

lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

Language:TypeScriptApache-2.03416 69 131

eli5

A library for debugging/inspecting machine learning classifiers and explaining their predictions

Language:Jupyter NotebookMIT2736 67 257

ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Language:PythonMIT2579 40 249

TransformerLens

A library for mechanistic interpretability of GPT-style language models

Language:PythonMIT920 13 192

automated-interpretability

Language:Python896 16 20

Treasure-of-Transformers

💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️

Language:Jupyter NotebookMIT857 28 1

bleurt

BLEURT is a metric for Natural Language Generation based on transfer learning.

Language:PythonApache-2.0659 13 50

rome

Locating and editing factual associations in GPT (NeurIPS 2022)

Language:PythonMIT508 7 24

memit

Mass-editing thousands of facts into a transformer memory (ICLR 2023)

Language:PythonMIT395 6 16

landmark-attention

Landmark Attention: Random-Access Infinite Context Length for Transformers

Language:PythonApache-2.0394 40 14

honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Language:PythonMIT371 9 31

tuned-lens

Tools for understanding how transformer predictions are built layer-by-layer

Language:PythonMIT369 6 52

rax

Rax is a Learning-to-Rank library written in JAX.

Language:PythonApache-2.0308 5 3

lost-in-the-middle

Code and data for "Lost in the Middle: How Language Models Use Long Contexts"

Language:PythonMIT273 5 14

ToolQA

ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.

Language:Jupyter NotebookApache-2.0211 5 6

FlexNeuART

Flexible classic and NeurAl Retrieval Toolkit

Language:JavaApache-2.0211 12 7

baukit

Language:Python139 11 3

polyjuice

Language:PythonBSD-3-Clause93 4 11

belief-localization

This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Can Be Injected in Language Models."

Apache-2.051 3 5

CNN-Units-in-NLP

:scissors: Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs

Language:PythonMIT27 30