imerdell-55's repositories

bi-att-flow

Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

curiosity-driven-exploration-pytorch

Curiosity-driven Exploration by Self-supervised Prediction

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Language:PythonStargazers:0Issues:0Issues:0

farbox-template

Farbox 2 支持自动同步模板仓库

Language:PythonStargazers:0Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fucking-algorithm

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Stargazers:0Issues:0Issues:0
Language:GoStargazers:0Issues:0Issues:0

Hierarchical-Meta-Reinforcement-Learning

This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

HRAC

PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020).

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

interview

📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

notion_widgets

A set of HTML widgets that could be embedded into Notion.so https://www.notion.so/ pages. For more see https://blog.shorouk.dev/notion-widgets-gallery/

Language:HTMLStargazers:0Issues:0Issues:0

LLaMA-Efficient-Tuning

Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

overleaf-thesis-template

latex thesis template on overleaf

Stargazers:0Issues:0Issues:0

project-based-learning

Curated list of project-based tutorials

License:MITStargazers:0Issues:0Issues:0

reinforcement_learning_ppo_rnd

Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some explanation

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

reinforcement_learning_robocup

Implementation of Correlated-Q Learning on RoboCup Game

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

Sparse-Reward-Algorithms

Implement many Sparse Reward algorithms in Gym Fetch environment

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

strategitica

Displays Habitica tasks in calendar format, along with some other helpful info and a sleep toggle.

Language:JavaScriptLicense:GPL-3.0Stargazers:0Issues:0Issues:0

superset

Apache Superset is a Data Visualization and Data Exploration Platform

Language:TypeScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:wdlStargazers:0Issues:0Issues:0

unified-hrl

Unified Model-Free Hierarchical Reinforcement Learning Framework

Language:PythonLicense:MITStargazers:0Issues:0Issues:0