Beast code in Giters

ChTauchmann's starred repositories

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookNOASSERTION11092 91 301

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonMIT6292 38 900

Verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

Language:PythonBSD-3-Clause5112 60 190

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04216 46 268

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonApache-2.02567 12 170

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.02062 33 81

langserve

LangServe 🦜️🏓

Language:JavaScriptNOASSERTION1816 20 217

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonMIT1661 17 78

RAG-Survey

1609 29 15

pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Language:PythonApache-2.01576 19 536

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:PythonApache-2.01443 26 24

llama-lab

Language:Python1373 17 12

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION1357 12 25

Awesome-LLM-Reasoning

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

MIT1328 31 1

TransformerLens

A library for mechanistic interpretability of GPT-style language models

Language:PythonMIT920 13 192

gritlm

Generative Representational Instruction Tuning

Language:Jupyter NotebookMIT493 8 42

Causality4NLP_Papers

A reading list for papers on causality for natural language processing (NLP)

476 230

tevatron

Tevatron - A flexible toolkit for neural retrieval research and development.

Language:PythonApache-2.0445 10 89

landmark-attention

Landmark Attention: Random-Access Infinite Context Length for Transformers

Language:PythonApache-2.0401 40 15

task_vectors

Editing Models with Task Arithmetic

Language:Python387 11 12

distilling-step-by-step

Language:PythonApache-2.0370 4 9

dpr-scale

Scalable training for dense retrieval models.

Language:Python262 19 13

soft-moe-pytorch

Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch

Language:PythonMIT229 11 8

ACL2023-Retrieval-LM.github.io

https://acl2023-retrieval-lm.github.io/

Language:JavaScript149 5 1

OpenMatch

An Open-Source Package for Information Retrieval

Language:PythonMIT141 4 58

DallEval

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)

Language:Jupyter NotebookMIT136 7 2

RAG-query-rewriting

Language:Python92 2 6

fneval

Functional Benchmarks and the Reasoning Gap

Language:TeXGPL-3.073 1 7

icl_task_vectors

Language:Python58 1 5

belief-localization

This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Can Be Injected in Language Models."

Apache-2.051 3 5