oriskunk's repositories

a0-jax

AlphaZero in JAX

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-marketing-datascience

Curated list of useful LLM / Analytics / Datascience resources

License:MITStargazers:0Issues:0Issues:0

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

License:Apache-2.0Stargazers:0Issues:0Issues:0

BPP-3D-Viewer

3D pattern viewer for cutting and packing problems

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

chatbot-ui

An open source ChatGPT UI.

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

chatllama

ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT

Language:PythonStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

License:MITStargazers:0Issues:0Issues:0

gbr

Go board image recognition

License:MITStargazers:0Issues:0Issues:0

gym-pcgrl

A package for "Procedural Content Generation via Reinforcement Learning" OpenAI Gym interface.

License:MITStargazers:0Issues:0Issues:0

IR-BPP

Packing irregular objects with deep reinforcement learning.

Stargazers:0Issues:0Issues:0

K-G-OAT

IA3방식으로 KoAlpaca를 fine tuning한 한국어 LLM모델

Stargazers:0Issues:0Issues:0

KataGo

GTP engine and self-play learning in Go

License:NOASSERTIONStargazers:0Issues:0Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

License:MITStargazers:0Issues:0Issues:0

llama.cpp

Port of Facebook's LLaMA model in C/C++

License:MITStargazers:0Issues:0Issues:0

LLM-As-Chatbot

Alpaca-LoRA as Chatbot service

License:Apache-2.0Stargazers:0Issues:0Issues:0

match3

A web match-3 game in C++14 using SDL2 / MVC / Range-v3 / Meta State Machine / Dependency Injection

Stargazers:0Issues:0Issues:0

mctx

Monte Carlo tree search in JAX

License:Apache-2.0Stargazers:0Issues:0Issues:0

ml-papers

My collection of machine learning papers

License:MITStargazers:0Issues:0Issues:0

Online-3D-BPP-DRL

This repository contains the implementation of paper Online 3D Bin Packing with Constrained Deep Reinforcement Learning.

Stargazers:0Issues:0Issues:0

project_MYM

Combined computer vision techniques and convolutional neural networks to accurately classify chess pieces and identified their location on a chessboard. Tools: Python, Google Cloud, Keras, TensorFlow, OpenCV, Pillow, Scikit-learn, NumPy, Seaborn, and others

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

R-NaD

Experimentation with Regularized Nash Dynamics on a GPU accelerated game

License:Apache-2.0Stargazers:0Issues:0Issues:0

reverb

Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research

License:Apache-2.0Stargazers:0Issues:0Issues:0

Tetris-deep-Q-learning-pytorch

Deep Q-learning for playing tetris game

License:MITStargazers:0Issues:0Issues:0

toolformer-pytorch

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

License:MITStargazers:0Issues:0Issues:0

Travelling-Salesman-Visualiser

Algorithm visualiser for the Travelling Salesman Problem

License:GPL-3.0Stargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0