zjq0717's starred repositories
Probabilistic_Contrastive_Learning
This repository contains the code for our paper "Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs".
spark_mllib_demo_pro
专注大数据 Spark ML 机器学习:监督学习、无监督学习,主要有:分类算法、回归算法、聚类算法、推荐算法、频繁模式挖掘算法
recommend-system
通过 Spark SQL, Spark MLlib, Spark Streaming 技术,基于隐语义模型(LFM),结合实际项目经验,搭建一套个性化电影推荐系统
News_recommend
基于Spark的新闻推荐系统,包含爬虫项目、web网站以及spark推荐系统
replay-based-recurrent-rl
Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"
LTL2Action
This is the code repository accompanying the ICML 2021 paper LTL2Action: Generalizing LTL Instructions for Multi-Task RL (https://arxiv.org/abs/2102.06858).
offline_rl
Offline RL implementations for Unstable Baselines
unstable_baselines
Re-implementations of SOTA RL algorithms.
Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Imagination-Augmented-Agents
Building Agents with Imagination: pytorch step-by-step implementation
MARL-code-pytorch
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
dqn-on-space-invaders
Deep Q-Network to play the Atari 2600 game of Space Invaders.
Reinforcement-Learning
Reinforcement Learning (RL DQN) / Atari Acrobot, Breakout, and Space Invaders.
DQN-DDQN-on-Space-Invaders
Implementation of Double Deep Q Networks and Dueling Q Networks using Keras on Space Invaders using OpenAI Gym. Code can be easily generalized to other Atari games.
gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
llm-cookbook
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
aliyun-chatgpt
aliyun-chatpt是基于最近较火的chatgpt开发的一个项目,本项目代码十分简单, 通过简单调用openai的接口来实现功能
chatgpt-magic-plug
openai接口
DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.