Harel Gal's starred repositories

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

License:MITStargazers:2263Issues:0Issues:0

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:7266Issues:0Issues:0

trulens

Evaluation and Tracking for LLM Experiments

Language:Jupyter NotebookLicense:MITStargazers:1879Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8425Issues:0Issues:0

PurpleLlama

Set of tools to assess and improve LLM security.

Language:PythonLicense:NOASSERTIONStargazers:2176Issues:0Issues:0

TruthfulQA

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:547Issues:0Issues:0
Language:TypeScriptLicense:NOASSERTIONStargazers:21Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23281Issues:0Issues:0

bias-mitigation-foundation-models

Bias mitigation in foundation models

Language:Jupyter NotebookLicense:MIT-0Stargazers:3Issues:0Issues:0

BentoML

The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!

Language:PythonLicense:Apache-2.0Stargazers:6821Issues:0Issues:0

Yatai

Model Deployment at Scale on Kubernetes 🦄️

Language:TypeScriptLicense:NOASSERTIONStargazers:776Issues:0Issues:0

prefect

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Language:PythonLicense:Apache-2.0Stargazers:15452Issues:0Issues:0

fmeval

Foundation Model Evaluations Library

Language:PythonLicense:Apache-2.0Stargazers:162Issues:0Issues:0

jailbreak_llms

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Language:Jupyter NotebookLicense:MITStargazers:1657Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5896Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:41Issues:0Issues:0

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:6969Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32130Issues:0Issues:0

detoxify

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.

Language:PythonLicense:Apache-2.0Stargazers:894Issues:0Issues:0

narrator

David Attenborough narrates your life

Language:PythonStargazers:4330Issues:0Issues:0

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1512Issues:0Issues:0

fairscale

PyTorch extensions for high performance and large scale training.

Language:PythonLicense:NOASSERTIONStargazers:2997Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:10530Issues:0Issues:0

wandb

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

Language:PythonLicense:MITStargazers:8652Issues:0Issues:0

aws-cdk-examples

Example projects using the AWS CDK

Language:PythonLicense:Apache-2.0Stargazers:4971Issues:0Issues:0

redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

Language:C++Stargazers:9206Issues:0Issues:0

amazon-bedrock-workshop

This is a workshop designed for Amazon Bedrock a foundational model service.

Language:Jupyter NotebookLicense:MIT-0Stargazers:2Issues:0Issues:0

DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Language:C++License:NOASSERTIONStargazers:978Issues:0Issues:0

eitans-sagemaker-examples

Amazon SageMaker Examples

Language:Jupyter NotebookStargazers:4Issues:0Issues:0

dynibar

Implementation of DynIBaR Neural Dynamic Image-Based Rendering (CVPR 2023)

Language:PythonLicense:Apache-2.0Stargazers:841Issues:0Issues:0