DokyoonYoon's repositories
coding-interview-university
A complete computer science study plan to become a software engineer.
acme
A library of reinforcement learning components and agents
agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Deep-Multi-Agent-Reinforcement-Learning
deep multi agent reinforcement learning tutorial book for intermediate
examples
TensorFlow examples
HALOs
A library with extensible implementations of DPO, KTO, PPO, and other human-centered loss functions (HALOs).
llm-course-ko
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
nn
🧠 Minimal implementations of neural network architectures and layers in PyTorch with side-by-side notes
octo
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
PracticeEnvrionment
practice envrionment for rl
Unity-Robotics-Hub
Central repository for tools, tutorials, resources, and documentation for robotic simulation in Unity.
Visual-Instruction-Tuning
SVIT: Scaling up Visual Instruction Tuning
xtuner-ko
An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)