CookieYang's starred repositories

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:17101Issues:0Issues:0

Reinforcement-Learning-Papers

📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.

License:MITStargazers:271Issues:0Issues:0

Reinforcement-Learning-Papers

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

License:MITStargazers:260Issues:0Issues:0

RSPapers

A Curated List of Must-read Papers on Recommender System.

License:MITStargazers:6016Issues:0Issues:0

pdf2htmlEX

Convert PDF to HTML without losing text or format.

Language:HTMLLicense:NOASSERTIONStargazers:3559Issues:0Issues:0

realikun

[NeurIPS 2022] 1st Place Solution for the 3rd Neural MMO Challenge

Language:PythonLicense:MITStargazers:27Issues:0Issues:0

gpt4all

GPT4All: Chat with Local LLMs on Any Device

Language:C++License:MITStargazers:67447Issues:0Issues:0

PerfectDou

[NeurIPS 2022] PerfectDou: Dominating DouDizhu with Perfect Information Distillation

Language:PythonLicense:Apache-2.0Stargazers:141Issues:0Issues:0

WeChatExtension-ForMac

Mac微信功能拓展/微信插件/微信小助手(A plugin for Mac WeChat)

Language:Objective-CLicense:MITStargazers:22201Issues:0Issues:0

AdversarialAutoencoder

An implementation of unsupervised cluster and semisupervised type of adversarial autoencoder

Language:PythonLicense:Apache-2.0Stargazers:8Issues:0Issues:0

rebel

An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.

Language:C++License:Apache-2.0Stargazers:639Issues:0Issues:0

acme

A library of reinforcement learning components and agents

Language:PythonLicense:Apache-2.0Stargazers:3433Issues:0Issues:0

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10423Issues:0Issues:0

rlpyt

Reinforcement Learning in PyTorch

Language:PythonLicense:MITStargazers:2212Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:32116Issues:0Issues:0

RAdam

On the Variance of the Adaptive Learning Rate and Beyond

Language:PythonLicense:Apache-2.0Stargazers:2533Issues:0Issues:0

tqdm

:zap: A Fast, Extensible Progress Bar for Python and CLI

Language:PythonLicense:NOASSERTIONStargazers:27980Issues:0Issues:0

PhoenixGo

Go AI program which implements the AlphaGo Zero paper

Language:C++License:NOASSERTIONStargazers:2869Issues:0Issues:0

interview_internal_reference

2023年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。

Language:PythonStargazers:36389Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:15521Issues:0Issues:0

Data-Competition-TopSolution

Data competition Top Solution 数据竞赛top解决方案开源整理

Stargazers:3337Issues:0Issues:0

deeplearningbook-chinese

Deep Learning Book Chinese Translation

Language:TeXStargazers:35401Issues:0Issues:0

cloudpickle

Extended pickling support for Python objects

Language:PythonLicense:NOASSERTIONStargazers:1607Issues:0Issues:0

PySnooper

Never use print for debugging again

Language:PythonLicense:MITStargazers:16298Issues:0Issues:0

AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Language:PythonLicense:MITStargazers:3239Issues:0Issues:0

Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Language:PythonLicense:MITStargazers:8762Issues:0Issues:0

xg2xg

by ex-googlers, for ex-googlers - a lookup table of similar tech & services

Stargazers:14389Issues:0Issues:0

shap

A game theoretic approach to explain the output of any machine learning model.

Language:Jupyter NotebookLicense:MITStargazers:22162Issues:0Issues:0

nsfw_data_source_urls

Collection of NSFW images URLs for the purposes of training an NSFW Image Classifier

License:MITStargazers:3328Issues:0Issues:0

nsfw_data_scraper

Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier

Language:ShellLicense:MITStargazers:12200Issues:0Issues:0