Beast code in Giters

Byeongjoo Kim's starred repositories

grok-1

Grok open release

Language:PythonApache-2.049366 562 206

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonNOASSERTION22151 636 264

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.016119 132 123

Mos

一个用于在 macOS 上平滑你的鼠标滚动效果或单独设置滚动方向的小工具, 让你的滚轮爽如触控板 | A lightweight tool used to smooth scrolling and set scroll direction independently for your mouse on macOS

Language:SwiftNOASSERTION14216 97 581

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookApache-2.09443 120 135

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonMIT6554 39 953

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonMIT4374 32 111

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04372 49 286

rant3

(Obsolete) Archive of Rant 3.x.

Language:C#MIT2964 810

finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

Language:PythonMIT2133 73 42

generative-ai-docs

Documentation for Google's Gen AI site - including the Gemini API and Gemma

Language:Jupyter NotebookApache-2.01539 49 114

deduplicate-text-datasets

Language:RustApache-2.01079 13 41

PyTorch

Deep Learning Zero to All - Pytorch

Language:Jupyter Notebook1000 30 21

booknlp

BookNLP, a natural language processing pipeline for books

Language:PythonMIT777 23 22

concordia

A library for generative social simulation

Language:PythonApache-2.0463 18 25

narrativeqa

This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and questions and answers.

Language:ShellApache-2.0455 250

NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Language:PythonMIT358 11 14

Persona-Dialogue-Generation

The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"

Language:PythonMIT308 8 34

Selective_Context

Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.

Language:Python263 3 25

FActScore

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Language:PythonMIT259 4 34

cartography

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Language:Jupyter NotebookApache-2.0188 7 9

Poly-Encoder

Language:PythonMIT165 5 12

doc-story-generation

Language:PythonMIT146 4 7

LLM-SLERP-Merge

Spherical Merge Pytorch/HF format Language Models with minimal feature loss.

Language:Python104 3 1

ff-layers

The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Levy. EMNLP, 2021.

Language:PythonMIT77 1 1

ICTC

This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)

Language:PythonApache-2.072 3 8

long_tail_knowledge

Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"

Language:Python70 3 1

doc-storygen-v2

Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation

Language:PythonNOASSERTION65 5 3

mambaformer-icl

MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248

Language:PythonApache-2.030 10

fim-pretraining

Fill-in-the-Middle Pre-training of Large Language Models in pure PyTorch!

Language:PythonMIT4 20