Michael Hu's repositories
deep_rl_zoo
A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.
alpha_zero
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
InstructLLaMA
Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to InstructGPT or ChatGPT, but on a much smaller scale.
SAP-UI5-Development-Re-Introduction
This is the official source code for Udemy course SAP UI5 Development Re-Introduction
RAG-LLaMA
A clean and simple implementation of Retrieval Augmented Generation (RAG) to enhanced LLaMA chat model to answer questions from a private knowledge base. We use Tesla user manuals to build the knowledge base, and use open-source embedding and Cross-Encoders reranking models from Sentence Transformers in this project.
art-of-reinforcement-learning
Original source code The Art of Reinforcement Learning by Michael Hu
ReservoirComputing
Implementing Reservoir Computing Networks for Predicting Dynamic Systems
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
tensorrtllm_backend
The Triton TensorRT-LLM Backend