seven8827's repositories
AA-AEGD
This repository is the implementation of Anderson acceleration for "adaptive gradient descent with energy" (AEGD).
Awesome-CV
:page_facing_up: Awesome CV is LaTeX template for your outstanding job application
awesome-rl-for-cybersecurity
A curated list of resources dedicated to reinforcement learning applied to cyber security.
BlenderProc
A procedural Blender pipeline for photorealistic training image generation
ChatPaper
Use ChatGPT to summary the Arxiv papers.
Deep-RL-Notes
A collection of comprehensive notes on Deep Reinforcement Learning, customized for UC Berkeley's CS 285 (prev. CS 294-112)
deep-symbolic-optimization
Source code for deep symbolic optimization.
envlogger
A tool for recording RL trajectories.
explainable-minichess
Chess environment for smaller chess variants, AlphaZero-like MCTS-learning, and Concept Detection
FGD-trading
An implementation of a `fictitious gradient descent' algorithm to find the mean field Nash equilibrium for a an example trading problem.
fiss_planner
[RA-L 2022] FISS: A Trajectory Planning Framework using Fast Iterative Search and Sampling Strategy for Autonomous Driving
godot_rl_agents
An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents
Griddly
A grid-world game engine for game AI research
leetcode
Provide all my solutions and explanations in Chinese for all the Leetcode coding problems.
leetcode-1
推荐刷题网站:https://www.lintcode.com/?utm_source=tf-github-lucifer2022 LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)
leetcode-2
Python & JAVA Solutions for Leetcode
LeetCode-Py
⛽️「算法通关手册」,超详细的「算法与数据结构」基础讲解教程,700+ 道「LeetCode 题目」详细解析。通过「算法理论学习」和「编程实战练习」相结合的方式,从零基础到彻底掌握算法知识。
levenberg-marquardt-method
Python implementation of Levenberg-Marquardt algorithm built from scratch using NumPy.
memory-maze
Evaluating long-term memory of reinforcement learning algorithms
MinAtar-Faster
Optimized version of the MinAtar (testbed for AI agents) codebase along with benchmarks for standard Reinforcement Learning agents on various environments.
Mini-batch-SGD-large-dynamic-networks
large dynamic network latent space inference via mini-batch stochastic gradient descent. Variational approach for lower bound marginal maximization.
optuna
A hyperparameter optimization framework
PaS_CrowdNav
Occlusion-Aware Crowd Navigation Using People as Sensors: ICRA2023
PhySO
Physical Symbolic Optimization
QDax
Accelerated Quality-Diversity
relod
An efficient remote-onboard architecture for real-time Reinforcement Learning
Top-AI-Conferences-Paper-with-Code
MLNLP: This repository is a collection of AI top conferences papers (e.g. ACL, EMNLP, NAACL, COLING, AAAI, IJCAI, ICLR, NeurIPS, and ICML) with open resource code
torchimize
numerical optimization algorithms in pytorch
unscalable-heuristic-approximator
Deep learning/Reinforcement Learning methods for A*