Seonghwan Kim (nawnoes)

nawnoes

Geek Repo

Location:Seoul, Korea

Home Page:https://velog.io/@nawnoes

Github PK Tool:Github PK Tool

Seonghwan Kim's starred repositories

ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

Language:PythonStargazers:8Issues:0Issues:0

RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Language:PythonLicense:Apache-2.0Stargazers:288Issues:0Issues:0

prepacking

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"

Language:Jupyter NotebookStargazers:49Issues:0Issues:0

penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Language:PythonLicense:Apache-2.0Stargazers:1496Issues:0Issues:0
Language:PythonLicense:MITStargazers:1273Issues:0Issues:0

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1462Issues:0Issues:0

LogicKor

한국어 언어모델 다분야 사고력 벤치마크

Language:PythonStargazers:109Issues:0Issues:0

terashuf

terashuf shuffles multi-terabyte text files using limited memory

Language:C++License:MITStargazers:197Issues:0Issues:0

RingAttention

Transformers with Arbitrarily Large Context

Language:PythonLicense:Apache-2.0Stargazers:552Issues:0Issues:0

ring-flash-attention

Ring attention implementation with flash attention

Language:PythonStargazers:389Issues:0Issues:0

Inflection-Benchmarks

Public Inflection Benchmarks

License:MITStargazers:66Issues:0Issues:0
Language:PythonLicense:MITStargazers:3928Issues:0Issues:0

ko-rm-judge

Reward Model을 이용하여 언어모델의 답변을 평가하기

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Language:PythonLicense:MITStargazers:703Issues:0Issues:0

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5088Issues:0Issues:0

large_language_model_training_playbook

An open collection of implementation tips, tricks and resources for training large language models

Language:PythonLicense:Apache-2.0Stargazers:441Issues:0Issues:0

langroid

Harness LLMs with Multi-Agent Programming

Language:PythonLicense:MITStargazers:1801Issues:0Issues:0

awesome-llm-powered-agent

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

License:MITStargazers:1101Issues:0Issues:0

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonLicense:MITStargazers:454Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:219Issues:0Issues:0

RethinkTinyLM

The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”

Language:PythonStargazers:105Issues:0Issues:0

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:857Issues:0Issues:0

LongBench

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonLicense:MITStargazers:510Issues:0Issues:0

LongAlign

LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation

Language:PythonLicense:Apache-2.0Stargazers:121Issues:0Issues:0

BlackMamba

Code repository for Black Mamba

Language:PythonStargazers:204Issues:0Issues:0
Language:PythonLicense:MITStargazers:28Issues:0Issues:0

miracl

A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.

License:Apache-2.0Stargazers:143Issues:0Issues:0

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:897Issues:0Issues:0

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

Stargazers:9148Issues:0Issues:0

scalax

A simple library for scaling up JAX programs

Language:PythonLicense:Apache-2.0Stargazers:108Issues:0Issues:0