Devin Hua's repositories
AI-Writer
AI 写小说,生成玄幻和言情网文等等。中文预训练生成模型。采用我的 RWKV 模型,类似 GPT-2 。AI写作。RWKV for Chinese novel generation.
AirSim-Drone-Racing-VAE-Imitation
Code associated with our paper "Learning Visuomotor Policies for Aerial Navigation Using Cross-Modal Representations": https://arxiv.org/abs/1909.06993
alpaca-lora
Instruct-tune LLaMA on consumer hardware
botsim
BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots
camel
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Language Model Society (NeruIPS'2023) https://www.camel-ai.org
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
ColossalAI
Making large AI models cheaper, faster and more accessible
DRLwithTL
Python code for Deep Reinforcement Learning with Transfer Learning in a Simulated Environment
GPT-Bargaining
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
gpt4all
gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue
Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
LOMO
LOMO: LOw-Memory Optimization
lora_bnb_int8
利用LoRA bnb_int8微调chatYuan-large-v2的demo
path-planning-airsim-scripts
Python scripts for path planning and obstacle detection and avoidance using LiDAR sensor in an AirSim - Unreal Engine simulation.
PEDRA
Programmable Engine for Drone Reinforcement Learning Applications
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
PPO-based-Autonomous-Navigation-for-Quadcopters
Deep Reinforcement Learning based autonomous navigation for quadcopters using PPO algorithm.
python-websocket-server
A simple fully working websocket-server in Python with no external dependencies
PythonRobotics
Python sample codes for robotics algorithms.
reasoning-on-graphs
Official Implementation of "Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning"
refiner
About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations". Do not hesitate to open an issue if you run into any trouble!
RWKV-Runner
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.
SocialDial
SocialDial: A Benchmark for Socially-Aware Dialogue Systems (SIGIR'23)
t5_finetuning
clue chatyuan finetuning
The-Python-Graph-Gallery
A website displaying hundreds of charts made with Python
UAV_Navigation_DRL_AirSim
This is a new repo used for training UAV navigation (local path planning) policy using DRL methods.
UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)