subaruclover

followers

following

stars

Earth

subaruclover.github.io/

QIONG's starred repositories

ai-for-grant-writing

A curated list of resources for using LLMs to develop more competitive grant applications.

Language:PythonCC-BY-4.0268600

project1-boptest

Building Optimization Performance Tests

Language:ModelicaNOASSERTION10900

public-apis

A collective list of free APIs

Language:PythonMIT31682400

SimulationBasedInference.jl

A flexible toolkit for simulation based inference in Julia

Language:JuliaMIT1800

LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION3099100

Bachelor-Reinforcement-Learning

Language:Python100

Meta-Learning-Papers

Meta Learning / Learning to Learn / One Shot Learning / Few Shot Learning

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)

Language:PythonApache-2.0244000

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Language:Jupyter NotebookMIT282500

smallville

Generative Agents for video games. Based on Generative Agents: Interactive Simulacra of Human Behavior

Language:JavaMIT63600

codellama

Inference code for CodeLlama models

Language:PythonNOASSERTION1601000

llama

Inference code for Llama models

Language:PythonNOASSERTION5629600

awesome-language-agents

List of language agents based on paper "Cognitive Architectures for Language Agents"

Language:TeX76100

numpy-ml

Machine learning, in numpy

Language:PythonGPL-3.01537100

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.01737400

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonGPL-3.06543300

Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

Language:MATLAB378800

hok_env

Honor of Kings AI Open Environment of Tencent

Language:PythonApache-2.064200

fccaa.github.io

Personal Homepage

Language:CSSNOASSERTION100

moderncv

A modern curriculum vitae class for LaTeX

Language:TeXLPPL-1.3c73900

Simple-CV

A minimalistic CV template with BibLaTeX support

Language:TeXMIT27600

MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Language:Python145700

LaTeX-template-phd-thesis

LaTeX Template for OIST Thesis

Language:TeXMIT200

pytorch-tutorial

PyTorch Tutorial for Deep Learning Researchers

Language:PythonMIT3017100

apis-dcdc_batt_comm

Device Driver Sample for Energy Sharing System

Language:PythonApache-2.0300

seq2seq-signal-prediction

Signal forecasting with a Sequence-to-Sequence (seq2seq) Recurrent Neural Network (RNN) model in TensorFlow - Guillaume Chevalier

Language:Jupyter NotebookApache-2.0108300

NeurADP-for-Ride-Pooling

A simulator and learning agent to solve the ridesharing problem

Language:Python3100

maddpg-pytorch

PyTorch Implementation of MADDPG (Lowe et. al. 2017)

Language:PythonMIT57200

MADDPG_torch

The code for maddpg using pytorch

Language:Python16200

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Language:PythonMIT130800