suzhidong's repositories

Adaptation-with-Noisy-OracLE

PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Awesome-Incremental-Learning

Awesome Incremental Learning

Stargazers:0Issues:0Issues:0

chatadp-paper

This paper was accepted by ICRA2024

Stargazers:0Issues:1Issues:0

chatbot-deployment

Deployment of PyTorch chatbot with Flask

Stargazers:0Issues:0Issues:0

chatgpt-sql

Allows you to query an SQL database using natural language.

Stargazers:0Issues:0Issues:0

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

License:MITStargazers:0Issues:0Issues:0

django-locallibrary-tutorial

Local Library website written in Django; example for the MDN server-side development Django module: https://developer.mozilla.org/en-US/docs/Learn/Server-side/Django.

License:CC0-1.0Stargazers:0Issues:0Issues:0

FSL-Mate

FSL-Mate: A collection of resources for few-shot learning (FSL).

Stargazers:0Issues:0Issues:0

google-research

Google Research

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0

learning-from-human-preferences

Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"

License:MITStargazers:0Issues:0Issues:0

lets-do-irl

Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)

License:MITStargazers:0Issues:0Issues:0

litgpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

License:Apache-2.0Stargazers:0Issues:0Issues:0

MetaLearning-Lab

The code and methods offered in Awesome-META+: https://wangjingyao07.github.io/Awesome-Meta-Learning-Platform/

Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

pytorch-chatbot

Simple chatbot implementation with PyTorch.

License:MITStargazers:0Issues:0Issues:0

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

License:MITStargazers:0Issues:0Issues:0

REKCARC-TSC-UHT

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

Language:HTMLLicense:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

SalesBot

Transitioning from Open-Domain Chit-Chat to Task-Oriented Dialogues

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

tatk

Task-oriented dialog system toolkits

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ToD-BERT

Pre-Trained Models for ToD-BERT

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

trade-dst

Source code for transferable dialogue state generator (TRADE, Wu et al., 2019). https://arxiv.org/abs/1905.08743

Stargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

weekly

科技爱好者周刊,每周五发布

Stargazers:0Issues:0Issues:0
Language:CSSStargazers:0Issues:0Issues:0