oriskunk

followers

following

stars

oriskunk's repositories

a0-jax

AlphaZero in JAX

Language:PythonMIT000

AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Language:PythonMIT000

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonMIT000

awesome-marketing-datascience

Curated list of useful LLM / Analytics / Datascience resources

MIT000

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

Apache-2.0000

BPP-3D-Viewer

3D pattern viewer for cutting and packing problems

Language:JavaScriptMIT000

chatbot-ui

An open source ChatGPT UI.

Language:TypeScriptMIT000

chatllama

ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT

Language:Python000

circuit_training

Apache-2.0000

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

MIT000

gbr

Go board image recognition

MIT000

gym-pcgrl

A package for "Procedural Content Generation via Reinforcement Learning" OpenAI Gym interface.

MIT000

IR-BPP

Packing irregular objects with deep reinforcement learning.

000

K-G-OAT

IA3방식으로 KoAlpaca를 fine tuning한 한국어 LLM모델

000

KataGo

GTP engine and self-play learning in Go

NOASSERTION000

langchain

⚡ Building applications with LLMs through composability ⚡

MIT000

llama.cpp

Port of Facebook's LLaMA model in C/C++

MIT000

LLM-As-Chatbot

Alpaca-LoRA as Chatbot service

Apache-2.0000

match3

A web match-3 game in C++14 using SDL2 / MVC / Range-v3 / Meta State Machine / Dependency Injection

000

mctx

Monte Carlo tree search in JAX

Apache-2.0000

ml-papers

My collection of machine learning papers

MIT000

Online-3D-BPP-DRL

This repository contains the implementation of paper Online 3D Bin Packing with Constrained Deep Reinforcement Learning.

000

project_MYM

Combined computer vision techniques and convolutional neural networks to accurately classify chess pieces and identified their location on a chessboard. Tools: Python, Google Cloud, Keras, TensorFlow, OpenCV, Pillow, Scikit-learn, NumPy, Seaborn, and others

000

puzzleagent

000

R-NaD

Experimentation with Regularized Nash Dynamics on a GPU accelerated game

Apache-2.0000

reverb

Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research

Apache-2.0000

Tetris-deep-Q-learning-pytorch

Deep Q-learning for playing tetris game

MIT000

toolformer-pytorch

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

MIT000

Travelling-Salesman-Visualiser

Algorithm visualiser for the Travelling Salesman Problem

GPL-3.0000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Apache-2.0000