Joel Rorseth (joelrorseth)

joelrorseth

Geek Repo

Location:Waterloo, Ontario

Home Page:joelrorseth.github.io

Twitter:@jerorset

Github PK Tool:Github PK Tool

Joel Rorseth's starred repositories

llama.cpp

LLM inference in C/C++

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35112Issues:344Issues:1687

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Language:C++License:Apache-2.0Stargazers:25697Issues:913Issues:5176

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23245Issues:191Issues:3627

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:19161Issues:255Issues:70

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:17863Issues:116Issues:475

ml-stable-diffusion

Stable Diffusion with Core ML on Apple Silicon

Language:PythonLicense:MITStargazers:16335Issues:140Issues:232

litellm

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

Language:PythonLicense:NOASSERTIONStargazers:9502Issues:62Issues:2384

EdgeGPT

Reverse engineered API of Microsoft's Bing Chat AI

Language:PythonLicense:UnlicenseStargazers:8106Issues:92Issues:364

ai-notes

notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.

Language:HTMLLicense:MITStargazers:4768Issues:146Issues:9

lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

Language:TypeScriptLicense:Apache-2.0Stargazers:3416Issues:69Issues:131

eli5

A library for debugging/inspecting machine learning classifiers and explaining their predictions

Language:Jupyter NotebookLicense:MITStargazers:2736Issues:67Issues:257

ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Language:PythonLicense:MITStargazers:2579Issues:40Issues:249

TransformerLens

A library for mechanistic interpretability of GPT-style language models

Language:PythonLicense:MITStargazers:920Issues:13Issues:192

Treasure-of-Transformers

💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️

Language:Jupyter NotebookLicense:MITStargazers:857Issues:28Issues:1

bleurt

BLEURT is a metric for Natural Language Generation based on transfer learning.

Language:PythonLicense:Apache-2.0Stargazers:659Issues:13Issues:50

rome

Locating and editing factual associations in GPT (NeurIPS 2022)

Language:PythonLicense:MITStargazers:508Issues:7Issues:24

memit

Mass-editing thousands of facts into a transformer memory (ICLR 2023)

Language:PythonLicense:MITStargazers:395Issues:6Issues:16

landmark-attention

Landmark Attention: Random-Access Infinite Context Length for Transformers

Language:PythonLicense:Apache-2.0Stargazers:394Issues:40Issues:14

honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Language:PythonLicense:MITStargazers:371Issues:9Issues:31

tuned-lens

Tools for understanding how transformer predictions are built layer-by-layer

Language:PythonLicense:MITStargazers:369Issues:6Issues:52

rax

Rax is a Learning-to-Rank library written in JAX.

Language:PythonLicense:Apache-2.0Stargazers:308Issues:5Issues:3

lost-in-the-middle

Code and data for "Lost in the Middle: How Language Models Use Long Contexts"

Language:PythonLicense:MITStargazers:273Issues:5Issues:14

ToolQA

ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:211Issues:5Issues:6

FlexNeuART

Flexible classic and NeurAl Retrieval Toolkit

Language:JavaLicense:Apache-2.0Stargazers:211Issues:12Issues:7
Language:PythonLicense:BSD-3-ClauseStargazers:93Issues:4Issues:11

belief-localization

This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Can Be Injected in Language Models."

CNN-Units-in-NLP

:scissors: Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs

Language:PythonLicense:MITStargazers:27Issues:3Issues:0