kykim0

kykim0

Geek Repo

Company:@google

Location:Bay Area / Seoul

Github PK Tool:Github PK Tool


Organizations
JuliaPOMDP
sisl
StanfordVL

kykim0's starred repositories

meta-pretraining

Code accompanying paper: Meta-Learning to Improve Pre-Training

Language:PythonLicense:Apache-2.0Stargazers:35Issues:0Issues:0

modpo

[Findings of ACL'2024] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization.

Language:PythonStargazers:29Issues:0Issues:0

uncertain_ground_truth

Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggregation from our recent work on uncertain ground truth (TMLR'23 and ArXiv pre-print).

Language:PythonLicense:Apache-2.0Stargazers:333Issues:0Issues:0

llm-swarm

Manage scalable open LLM inference endpoints in Slurm clusters

Language:PythonLicense:MITStargazers:184Issues:0Issues:0

text-clustering

Easily embed, cluster and semantically label text datasets

Language:PythonLicense:Apache-2.0Stargazers:380Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:322Issues:0Issues:0

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonLicense:Apache-2.0Stargazers:413Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:12786Issues:0Issues:0

darts

Differentiable architecture search for convolutional and recurrent networks

Language:PythonLicense:Apache-2.0Stargazers:3887Issues:0Issues:0

oss-guideline

IITP_Guideline 1.0 For Open Source Software RnD Projects operated by yangsuplim at IITP.

Stargazers:23Issues:0Issues:0

rl4co

A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)

Language:PythonLicense:MITStargazers:323Issues:0Issues:0

build-nanogpt

Video+code lecture on building nanoGPT from scratch

Language:PythonStargazers:2776Issues:0Issues:0

betty

Betty: an automatic differentiation library for generalized meta-learning and multilevel optimization

Language:PythonLicense:Apache-2.0Stargazers:327Issues:0Issues:0

dreamer

Dream to Control: Learning Behaviors by Latent Imagination

Language:PythonLicense:Apache-2.0Stargazers:595Issues:0Issues:0

nlp-bible-code

자연어처리 바이블의 실습 자료입니다.

Language:Jupyter NotebookStargazers:49Issues:0Issues:0

KULLM

☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM

License:Apache-2.0Stargazers:542Issues:0Issues:0

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:879Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1076Issues:0Issues:0

data-selection-survey

A Survey on Data Selection for Language Models

License:CC0-1.0Stargazers:107Issues:0Issues:0

LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:270Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11413Issues:0Issues:0

calibration-framework

The net:cal calibration framework is a Python 3 library for measuring and mitigating miscalibration of uncertainty estimates, e.g., by a neural network.

Language:PythonLicense:Apache-2.0Stargazers:324Issues:0Issues:0

captum

Model interpretability and understanding for PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:4686Issues:0Issues:0
Language:PythonStargazers:431Issues:0Issues:0

OpenFE

OpenFE: automated feature generation with expert-level performance

Language:PythonLicense:MITStargazers:700Issues:0Issues:0

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1071Issues:0Issues:0

reward-bench

RewardBench: the first evaluation tool for reward models.

Language:PythonLicense:Apache-2.0Stargazers:269Issues:0Issues:0
Language:PythonLicense:MITStargazers:3960Issues:0Issues:0

awesome-llm-human-preference-datasets

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

License:MITStargazers:274Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:283Issues:0Issues:0