oriskunk's repositories
a0-jax
AlphaZero in JAX
AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
awesome-marketing-datascience
Curated list of useful LLM / Analytics / Datascience resources
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
BPP-3D-Viewer
3D pattern viewer for cutting and packing problems
chatbot-ui
An open source ChatGPT UI.
chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
gbr
Go board image recognition
gym-pcgrl
A package for "Procedural Content Generation via Reinforcement Learning" OpenAI Gym interface.
IR-BPP
Packing irregular objects with deep reinforcement learning.
K-G-OAT
IA3방식으로 KoAlpaca를 fine tuning한 한국어 LLM모델
KataGo
GTP engine and self-play learning in Go
langchain
⚡ Building applications with LLMs through composability ⚡
llama.cpp
Port of Facebook's LLaMA model in C/C++
LLM-As-Chatbot
Alpaca-LoRA as Chatbot service
match3
A web match-3 game in C++14 using SDL2 / MVC / Range-v3 / Meta State Machine / Dependency Injection
mctx
Monte Carlo tree search in JAX
ml-papers
My collection of machine learning papers
Online-3D-BPP-DRL
This repository contains the implementation of paper Online 3D Bin Packing with Constrained Deep Reinforcement Learning.
project_MYM
Combined computer vision techniques and convolutional neural networks to accurately classify chess pieces and identified their location on a chessboard. Tools: Python, Google Cloud, Keras, TensorFlow, OpenCV, Pillow, Scikit-learn, NumPy, Seaborn, and others
R-NaD
Experimentation with Regularized Nash Dynamics on a GPU accelerated game
reverb
Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research
Tetris-deep-Q-learning-pytorch
Deep Q-learning for playing tetris game
toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
Travelling-Salesman-Visualiser
Algorithm visualiser for the Travelling Salesman Problem
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs