zjq0717's starred repositories

Probabilistic_Contrastive_Learning

This repository contains the code for our paper "Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs".

Language:PythonStargazers:33Issues:0Issues:0

oyster

Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)

License:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

spark_mllib_demo_pro

专注大数据 Spark ML 机器学习:监督学习、无监督学习,主要有:分类算法、回归算法、聚类算法、推荐算法、频繁模式挖掘算法

Language:JavaStargazers:15Issues:0Issues:0

recommend-system

通过 Spark SQL, Spark MLlib, Spark Streaming 技术,基于隐语义模型(LFM),结合实际项目经验,搭建一套个性化电影推荐系统

Language:ScalaStargazers:11Issues:0Issues:0

News_recommend

基于Spark的新闻推荐系统,包含爬虫项目、web网站以及spark推荐系统

Language:ScalaStargazers:338Issues:0Issues:0

Sequoia

The Research Tree - A playground for research at the intersection of Continual, Reinforcement, and Self-Supervised Learning.

Language:PythonLicense:GPL-3.0Stargazers:190Issues:0Issues:0

replay-based-recurrent-rl

Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"

Language:PythonLicense:Apache-2.0Stargazers:30Issues:0Issues:0

LTL2Action

This is the code repository accompanying the ICML 2021 paper LTL2Action: Generalizing LTL Instructions for Multi-Task RL (https://arxiv.org/abs/2102.06858).

Language:PythonStargazers:27Issues:0Issues:0

garage

A toolkit for reproducible reinforcement learning research.

Language:PythonLicense:MITStargazers:1834Issues:0Issues:0

offline_rl

Offline RL implementations for Unstable Baselines

Language:PythonStargazers:3Issues:0Issues:0

unstable_baselines

Re-implementations of SOTA RL algorithms.

Language:PythonStargazers:119Issues:0Issues:0

Metaworld

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Language:PythonLicense:MITStargazers:1155Issues:0Issues:0

MetaGym

Collection of Reinforcement Learning / Meta Reinforcement Learning Environments.

Language:PythonLicense:Apache-2.0Stargazers:272Issues:0Issues:0

Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Language:PythonLicense:MITStargazers:5483Issues:0Issues:0

maml-rl

元强化学习MAML实现, 修改了部分老旧而不能运行的代码, 并可以通过render直接查看训练的结果

Language:Jupyter NotebookStargazers:2Issues:0Issues:0

meta_rl

Meta RL codebase for Unstable Baselines

Language:PythonStargazers:20Issues:0Issues:0

Imagination-Augmented-Agents

Building Agents with Imagination: pytorch step-by-step implementation

Language:Jupyter NotebookStargazers:204Issues:0Issues:0

MARL-code-pytorch

Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.

Language:PythonLicense:MITStargazers:371Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:3517Issues:0Issues:0

dqn-on-space-invaders

Deep Q-Network to play the Atari 2600 game of Space Invaders.

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

Reinforcement-Learning

Reinforcement Learning (RL DQN) / Atari Acrobot, Breakout, and Space Invaders.

Language:Jupyter NotebookLicense:MITStargazers:4Issues:0Issues:0

DQN-DDQN-on-Space-Invaders

Implementation of Double Deep Q Networks and Dueling Q Networks using Keras on Space Invaders using OpenAI Gym. Code can be easily generalized to other Atari games.

Language:PythonStargazers:37Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:61025Issues:0Issues:0

llm-cookbook

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Language:Jupyter NotebookStargazers:10375Issues:0Issues:0

aliyun-chatgpt

aliyun-chatpt是基于最近较火的chatgpt开发的一个项目,本项目代码十分简单, 通过简单调用openai的接口来实现功能

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

chatgpt-magic-plug

openai接口

Language:JavaScriptLicense:MITStargazers:20Issues:0Issues:0

DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

Language:PythonLicense:MITStargazers:947Issues:0Issues:0