kykim0

kykim0

Geek Repo

Company:@google

Location:Bay Area / Seoul

Github PK Tool:Github PK Tool


Organizations
JuliaPOMDP
sisl
StanfordVL

kykim0's starred repositories

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Language:PythonLicense:Apache-2.0Stargazers:634Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1879Issues:0Issues:0

d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Language:PythonLicense:MITStargazers:137Issues:0Issues:0

AlignLLMHumanSurvey

Aligning Large Language Models with Human: A Survey

Stargazers:642Issues:0Issues:0

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonLicense:NOASSERTIONStargazers:15409Issues:0Issues:0

LLM-Agents-Papers

A repo lists papers related to LLM based agent

Language:PythonStargazers:850Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8808Issues:0Issues:0
Language:PythonLicense:MITStargazers:19715Issues:0Issues:0

improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

Language:PythonLicense:Apache-2.0Stargazers:800Issues:0Issues:0

pytorch-GAT

My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy histograms. I've supported both Cora (transductive) and PPI (inductive) examples!

Language:Jupyter NotebookLicense:MITStargazers:2371Issues:0Issues:0

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonLicense:UnlicenseStargazers:77203Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15088Issues:0Issues:0

Probabilistic-Programming-and-Bayesian-Methods-for-Hackers

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

Language:Jupyter NotebookLicense:MITStargazers:26523Issues:0Issues:0

MORL

Multi-Objective Reinforcement Learning

Language:PythonStargazers:242Issues:0Issues:0

stylegan3

Official PyTorch implementation of StyleGAN3

Language:PythonLicense:NOASSERTIONStargazers:6294Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9250Issues:0Issues:0

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6818Issues:0Issues:0

rewardedsoups

Rewarded soups official implementation

Language:HTMLStargazers:41Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4398Issues:0Issues:0

automl

Google Brain AutoML

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6185Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54270Issues:0Issues:0

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:7646Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:15527Issues:0Issues:0

morl-baselines

Multi-Objective Reinforcement Learning algorithms implementations.

Language:PythonLicense:MITStargazers:255Issues:0Issues:0

deap

Distributed Evolutionary Algorithms in Python

Language:PythonLicense:LGPL-3.0Stargazers:5681Issues:0Issues:0

FewShotLearning

Pytorch implementation of the paper "Optimization as a Model for Few-Shot Learning"

Language:PythonLicense:MITStargazers:254Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:411Issues:0Issues:0

babyai

BabyAI platform. A testbed for training agents to understand and execute language commands.

Language:PythonLicense:BSD-3-ClauseStargazers:680Issues:0Issues:0

deepC

vendor independent TinyML deep learning library, compiler and inference framework microcomputers and micro-controllers

Language:C++License:Apache-2.0Stargazers:542Issues:0Issues:0

dm-haiku

JAX-based neural network library

Language:PythonLicense:Apache-2.0Stargazers:2842Issues:0Issues:0