QIONG's starred repositories
ai-for-grant-writing
A curated list of resources for using LLMs to develop more competitive grant applications.
project1-boptest
Building Optimization Performance Tests
public-apis
A collective list of free APIs
SimulationBasedInference.jl
A flexible toolkit for simulation based inference in Julia
LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Meta-Learning-Papers
Meta Learning / Learning to Learn / One Shot Learning / Few Shot Learning
smallville
Generative Agents for video games. Based on Generative Agents: Interactive Simulacra of Human Behavior
awesome-language-agents
List of language agents based on paper "Cognitive Architectures for Language Agents"
generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
fccaa.github.io
Personal Homepage
MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
LaTeX-template-phd-thesis
LaTeX Template for OIST Thesis
pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
apis-dcdc_batt_comm
Device Driver Sample for Energy Sharing System
seq2seq-signal-prediction
Signal forecasting with a Sequence-to-Sequence (seq2seq) Recurrent Neural Network (RNN) model in TensorFlow - Guillaume Chevalier
NeurADP-for-Ride-Pooling
A simulator and learning agent to solve the ridesharing problem
maddpg-pytorch
PyTorch Implementation of MADDPG (Lowe et. al. 2017)
MADDPG_torch
The code for maddpg using pytorch