Byeongjoo Kim's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49366Issues:562Issues:206

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonLicense:NOASSERTIONStargazers:22151Issues:636Issues:264

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Mos

一个用于在 macOS 上平滑你的鼠标滚动效果或单独设置滚动方向的小工具, 让你的滚轮爽如触控板 | A lightweight tool used to smooth scrolling and set scroll direction independently for your mouse on macOS

Language:SwiftLicense:NOASSERTIONStargazers:14216Issues:97Issues:581

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9443Issues:120Issues:135

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:6554Issues:39Issues:953

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonLicense:MITStargazers:4374Issues:32Issues:111

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4372Issues:49Issues:286

rant3

(Obsolete) Archive of Rant 3.x.

Language:C#License:MITStargazers:2964Issues:81Issues:0

finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

Language:PythonLicense:MITStargazers:2133Issues:73Issues:42

generative-ai-docs

Documentation for Google's Gen AI site - including the Gemini API and Gemma

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1539Issues:49Issues:114

PyTorch

Deep Learning Zero to All - Pytorch

Language:Jupyter NotebookStargazers:1000Issues:30Issues:21

booknlp

BookNLP, a natural language processing pipeline for books

Language:PythonLicense:MITStargazers:777Issues:23Issues:22

concordia

A library for generative social simulation

Language:PythonLicense:Apache-2.0Stargazers:463Issues:18Issues:25

narrativeqa

This repository contains the NarrativeQA dataset. It includes the list of documents with Wikipedia summaries, links to full stories, and questions and answers.

Language:ShellLicense:Apache-2.0Stargazers:455Issues:25Issues:0

NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Language:PythonLicense:MITStargazers:358Issues:11Issues:14

Persona-Dialogue-Generation

The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"

Language:PythonLicense:MITStargazers:308Issues:8Issues:34

Selective_Context

Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.

FActScore

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Language:PythonLicense:MITStargazers:259Issues:4Issues:34

cartography

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:188Issues:7Issues:9

LLM-SLERP-Merge

Spherical Merge Pytorch/HF format Language Models with minimal feature loss.

ff-layers

The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Levy. EMNLP, 2021.

Language:PythonLicense:MITStargazers:77Issues:1Issues:1

ICTC

This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)

Language:PythonLicense:Apache-2.0Stargazers:72Issues:3Issues:8

long_tail_knowledge

Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"

doc-storygen-v2

Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation

Language:PythonLicense:NOASSERTIONStargazers:65Issues:5Issues:3

mambaformer-icl

MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248

Language:PythonLicense:Apache-2.0Stargazers:30Issues:1Issues:0

fim-pretraining

Fill-in-the-Middle Pre-training of Large Language Models in pure PyTorch!

Language:PythonLicense:MITStargazers:4Issues:2Issues:0