Seonghwan Kim (nawnoes)

nawnoes

Geek Repo

Location:Seoul, Korea

Home Page:https://velog.io/@nawnoes

Github PK Tool:Github PK Tool

Seonghwan Kim's starred repositories

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5230Issues:39Issues:37

langroid

Harness LLMs with Multi-Agent Programming

Language:PythonLicense:MITStargazers:2195Issues:17Issues:151

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1770Issues:17Issues:26

penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Language:PythonLicense:Apache-2.0Stargazers:1615Issues:18Issues:15

awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:1052Issues:42Issues:72

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:928Issues:12Issues:30

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Language:PythonLicense:MITStargazers:771Issues:8Issues:26

LongBench

[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonLicense:MITStargazers:587Issues:6Issues:66

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonLicense:Apache-2.0Stargazers:574Issues:19Issues:23

ringattention

Transformers with Arbitrarily Large Context

Language:PythonLicense:Apache-2.0Stargazers:571Issues:5Issues:15

ring-flash-attention

Ring attention implementation with flash attention

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonLicense:MITStargazers:473Issues:8Issues:6

large_language_model_training_playbook

An open collection of implementation tips, tricks and resources for training large language models

Language:PythonLicense:Apache-2.0Stargazers:451Issues:69Issues:0
Language:PythonLicense:Apache-2.0Stargazers:261Issues:8Issues:77

BlackMamba

Code repository for Black Mamba

terashuf

terashuf shuffles multi-terabyte text files using limited memory

Language:C++License:MITStargazers:201Issues:5Issues:9

LongAlign

LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation

Language:PythonLicense:Apache-2.0Stargazers:183Issues:8Issues:9

miracl

A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.

LogicKor

한국어 언어모델 다분야 사고력 벤치마크

scalax

A simple library for scaling up JAX programs

Language:PythonLicense:Apache-2.0Stargazers:114Issues:7Issues:0

RethinkTinyLM

[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”

Inflection-Benchmarks

Public Inflection Benchmarks

ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

prepacking

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"

Language:Jupyter NotebookStargazers:55Issues:2Issues:1
Language:PythonLicense:MITStargazers:28Issues:1Issues:0

ko-rm-judge

Reward Model을 이용하여 언어모델의 답변을 평가하기

Language:PythonLicense:MITStargazers:26Issues:2Issues:0