kykim0

followers

following

stars

@google

Bay Area / Seoul

Organizations

JuliaPOMDP

sisl

StanfordVL

kykim0's starred repositories

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Language:PythonApache-2.063400

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonApache-2.0187900

d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Language:PythonMIT13700

AlignLLMHumanSurvey

Aligning Large Language Models with Human: A Survey

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonNOASSERTION1540900

LLM-Agents-Papers

A repo lists papers related to LLM based agent

Language:Python85000

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.0880800

babyagi

Language:PythonMIT1971500

improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

Language:PythonApache-2.080000

pytorch-GAT

My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy histograms. I've supported both Cora (transductive) and PPI (inductive) examples!

Language:Jupyter NotebookMIT237100

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonUnlicense7720300

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.01508800

Probabilistic-Programming-and-Bayesian-Methods-for-Hackers

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

Language:Jupyter NotebookMIT2652300

MORL

Multi-Objective Reinforcement Learning

Language:Python24200

stylegan3

Official PyTorch implementation of StyleGAN3

Language:PythonNOASSERTION629400

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause925000

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookApache-2.0681800

rewardedsoups

Rewarded soups official implementation

Language:HTML4100

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT439800

automl

Google Brain AutoML

Language:Jupyter NotebookApache-2.0618500

llama

Inference code for Llama models

Language:PythonNOASSERTION5427000

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonMIT764600

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT1552700

morl-baselines

Multi-Objective Reinforcement Learning algorithms implementations.

Language:PythonMIT25500

deap

Distributed Evolutionary Algorithms in Python

Language:PythonLGPL-3.0568100

FewShotLearning

Pytorch implementation of the paper "Optimization as a Model for Few-Shot Learning"

Language:PythonMIT25400

clrs

Language:Jupyter NotebookApache-2.041100

babyai

BabyAI platform. A testbed for training agents to understand and execute language commands.

Language:PythonBSD-3-Clause68000

deepC

vendor independent TinyML deep learning library, compiler and inference framework microcomputers and micro-controllers

Language:C++Apache-2.054200

dm-haiku

JAX-based neural network library

Language:PythonApache-2.0284200