leeloolee

DokyoonYoon's repositories

coding-interview-university

A complete computer science study plan to become a software engineer.

CC-BY-SA-4.0100

acme

A library of reinforcement learning components and agents

Language:PythonApache-2.0000

agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Language:PythonApache-2.0000

alpaca-lora-gooddoctor

010

Awesome-Preference-Optimization

000

backjoon

codingtest

Language:Python010

ChipEnv

Language:PythonMIT010

circuit_training

Language:PythonApache-2.0000

confusion-model

Language:Python000

Deep-Multi-Agent-Reinforcement-Learning

deep multi agent reinforcement learning tutorial book for intermediate

000

examples

TensorFlow examples

Language:Jupyter NotebookApache-2.0000

GinTutorial

010

gooddoctor

010

HALOs

A library with extensible implementations of DPO, KTO, PPO, and other human-centered loss functions (HALOs).

Language:PythonApache-2.0000

HMC-SNUH

Language:Jupyter NotebookMIT000

IDC-Tutorials-ko

번역

Language:Jupyter NotebookBSD-3-Clause000

leeloolee

010

LLaVA-ncgpt2

Language:PythonApache-2.0000

llm-course-ko

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.0000

maic2023

test code

010

nc-gpt2

000

nn

🧠 Minimal implementations of neural network architectures and layers in PyTorch with side-by-side notes

MIT000

octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Language:PythonMIT000

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

MIT000

PracticeEnv

Language:Python010

PracticeEnvrionment

practice envrionment for rl

010

RL-tensorflow2.x

MIT010

Unity-Robotics-Hub

Central repository for tools, tutorials, resources, and documentation for robotic simulation in Unity.

Apache-2.0000

Visual-Instruction-Tuning

SVIT: Scaling up Visual Instruction Tuning

MIT000

xtuner-ko

An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)

Language:PythonApache-2.0000