imerdell-55's repositories
bi-att-flow
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.
curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
farbox-template
Farbox 2 支持自动同步模板仓库
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
Hierarchical-Meta-Reinforcement-Learning
This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.
HRAC
PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020).
interview
📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.
notion_widgets
A set of HTML widgets that could be embedded into Notion.so https://www.notion.so/ pages. For more see https://blog.shorouk.dev/notion-widgets-gallery/
LLaMA-Efficient-Tuning
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
overleaf-thesis-template
latex thesis template on overleaf
project-based-learning
Curated list of project-based tutorials
reinforcement_learning_ppo_rnd
Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation
reinforcement_learning_robocup
Implementation of Correlated-Q Learning on RoboCup Game
Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
strategitica
Displays Habitica tasks in calendar format, along with some other helpful info and a sleep toggle.
superset
Apache Superset is a Data Visualization and Data Exploration Platform
unified-hrl
Unified Model-Free Hierarchical Reinforcement Learning Framework