DokyoonYoon (leeloolee)

leeloolee

Geek Repo

Company:Korea

Location:Seoul

Home Page:leeloolee.github.io

Github PK Tool:Github PK Tool

DokyoonYoon's repositories

coding-interview-university

A complete computer science study plan to become a software engineer.

License:CC-BY-SA-4.0Stargazers:1Issues:0Issues:0

acme

A library of reinforcement learning components and agents

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

backjoon

codingtest

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Deep-Multi-Agent-Reinforcement-Learning

deep multi agent reinforcement learning tutorial book for intermediate

Stargazers:0Issues:0Issues:0

examples

TensorFlow examples

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

HALOs

A library with extensible implementations of DPO, KTO, PPO, and other human-centered loss functions (HALOs).

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

llm-course-ko

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

maic2023

test code

Stargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

nn

🧠 Minimal implementations of neural network architectures and layers in PyTorch with side-by-side notes

License:MITStargazers:0Issues:0Issues:0

octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

PracticeEnvrionment

practice envrionment for rl

Stargazers:0Issues:1Issues:0
License:MITStargazers:0Issues:1Issues:0

Unity-Robotics-Hub

Central repository for tools, tutorials, resources, and documentation for robotic simulation in Unity.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Visual-Instruction-Tuning

SVIT: Scaling up Visual Instruction Tuning

License:MITStargazers:0Issues:0Issues:0

xtuner-ko

An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0