kykim0's starred repositories
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
AlignLLMHumanSurvey
Aligning Large Language Models with Human: A Survey
voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
LLM-Agents-Papers
A repo lists papers related to LLM based agent
improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
pytorch-GAT
My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entropy histograms. I've supported both Cora (transductive) and PPI (inductive) examples!
Probabilistic-Programming-and-Bayesian-Methods-for-Hackers
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
rewardedsoups
Rewarded soups official implementation
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
morl-baselines
Multi-Objective Reinforcement Learning algorithms implementations.
FewShotLearning
Pytorch implementation of the paper "Optimization as a Model for Few-Shot Learning"