blue-wx

blue-wx

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

blue-wx's starred repositories

styleguide

Style guides for Google-originated open-source projects

Language:HTMLLicense:Apache-2.0Stargazers:37014Issues:0Issues:0

TD3

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Language:PythonLicense:MITStargazers:1655Issues:0Issues:0

PPO-Continuous-Pytorch

A clean and robust Pytorch implementation of PPO on continuous action space.

Language:PythonLicense:MITStargazers:112Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:62608Issues:0Issues:0
Stargazers:14Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:15535Issues:0Issues:0

OpenAI_Five_vs_Dota2_Explained

This is the code for "OpenAI Five vs DOTA 2 Explained" By Siraj Raval on Youtube

Language:PythonLicense:MITStargazers:152Issues:0Issues:0

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Language:PythonLicense:MITStargazers:1213Issues:0Issues:0

PPOxFamily

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Language:PythonLicense:Apache-2.0Stargazers:1852Issues:0Issues:0

orbitdeterminator

determination of satellite orbits and more

Language:Jupyter NotebookLicense:MITStargazers:179Issues:0Issues:0

Hands-On-Meta-Learning-With-Python

Learning to Learn using One-Shot Learning, MAML, Reptile, Meta-SGD and more with Tensorflow

Language:Jupyter NotebookStargazers:1156Issues:0Issues:0

Machine-Learning-is-ALL-You-Need

🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!

Language:PythonStargazers:382Issues:0Issues:0

awesome_deep_learning_interpretability

深度学习近年来关于神经网络模型解释性的相关高引用/顶会论文(附带代码)

License:MITStargazers:729Issues:0Issues:0

interpretable-ml-book

Book about interpretable machine learning

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:4728Issues:0Issues:0

awesome-machine-learning-interpretability

A curated list of awesome responsible machine learning resources.

License:CC0-1.0Stargazers:3536Issues:0Issues:0

xinzhibei_rationale_competition_2022

2022 兴智杯 -- 深度学习模型可解释性 -- 第三名(二等奖)

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0

dissect

深度卷积神经网络分类器的可解释性研究

Language:PythonLicense:NOASSERTIONStargazers:4Issues:0Issues:0

Visual-analytics-and-Interpretability-in-Deep-Learning

本项目主要是通过可视分析的手段,对深度学习的可解释性做出讨论与探讨。并且记录小组成员的学习过程与工作

Language:HTMLLicense:GPL-3.0Stargazers:47Issues:0Issues:0

InterpretableMLBook

《可解释的机器学习--黑盒模型可解释性理解指南》,该书为《Interpretable Machine Learning》中文版

License:GPL-3.0Stargazers:4814Issues:0Issues:0

derl

Code for "Embodied Intelligence via Learning and Evolution", Gupta et al, Nature Communications

Language:PythonStargazers:157Issues:0Issues:0

Deep-CFR

Scalable Implementation of Deep CFR and Single Deep CFR

Language:PythonLicense:MITStargazers:271Issues:0Issues:0

PromptCraft-Robotics

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

Language:PythonLicense:MITStargazers:1791Issues:0Issues:0

Counterfactual_Regret_Minimization_Python

Counterfactual Regret Minimization (CFR) sample code in Python

Language:PythonStargazers:12Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:52229Issues:0Issues:0

regret-matching

Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play

Language:PythonStargazers:23Issues:0Issues:0

rebel

An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.

Language:C++License:Apache-2.0Stargazers:642Issues:0Issues:0

go-cfr

go-cfr implements several forms of Counter Factual Regret minimization in Golang

Language:GoLicense:GPL-3.0Stargazers:18Issues:0Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:4112Issues:0Issues:0

game_theory

These are some MATLAB implementations of functions related to Game Thory such as MinMax, Nash and Backwards Induction, which are applied in some exercises.

Language:MATLABStargazers:8Issues:0Issues:0

TuGames

A Mathematica Package for Cooperative Game Theory

Language:MathematicaLicense:MITStargazers:11Issues:0Issues:0