Beast code in Giters

Harel Gal's starred repositories

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.032904 277 1089

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.025310 219 4089

prefect

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Language:PythonApache-2.015665 162 5374

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookNOASSERTION11398 91 312

redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

Language:C++9312 137 11235

wandb

🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

Language:PythonMIT8801 57 3259

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.08630 97 1267

outlines

Structured Text Generation

Language:PythonApache-2.08002 47 537

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.07333 48 631

BentoML

The easiest way to serve AI apps and models - Build reliable Inference APIs, LLM apps, Multi-model chains, RAG service, and much more!

Language:PythonApache-2.06905 77 1059

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT6211 35 1019

aws-cdk-examples

Example projects using the AWS CDK

Language:PythonApache-2.05030 80 339

narrator

David Attenborough narrates your life

Language:Python4336 28 35

fairscale

PyTorch extensions for high performance and large scale training.

Language:PythonNOASSERTION3121 45 358

PurpleLlama

Set of tools to assess and improve LLM security.

Language:PythonNOASSERTION2430 37 27

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

MIT2371 46 3

trulens

Evaluation and Tracking for LLM Experiments

Language:PythonMIT1995 17 269

jailbreak_llms

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Language:Jupyter NotebookMIT1857 25 7

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

MIT1540 190

DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

Language:C++NOASSERTION1017 25 191

detoxify

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.

Language:PythonApache-2.0914 15 63

dynibar

Implementation of DynIBaR Neural Dynamic Image-Based Rendering (CVPR 2023)

Language:PythonApache-2.0845 32 47

Yatai

Model Deployment at Scale on Kubernetes 🦄️

Language:TypeScriptNOASSERTION782 19 116

TruthfulQA

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Language:Jupyter NotebookApache-2.0566 8 10

fmeval

Foundation Model Evaluations Library

Language:PythonApache-2.0179 9 20

llm-sagemaker-sample

Language:Jupyter NotebookApache-2.047 3 18

aws-rtb-intelligence-kit

Language:TypeScriptNOASSERTION21 11 2

eitans-sagemaker-examples

Amazon SageMaker Examples

Language:Jupyter Notebook4 10

bias-mitigation-foundation-models

Bias mitigation in foundation models

Language:Jupyter NotebookMIT-03 200

amazon-bedrock-workshop

This is a workshop designed for Amazon Bedrock a foundational model service.

Language:Jupyter NotebookMIT-0200

harelix