Devin Hua's repositories

AI-Writer

AI 写小说,生成玄幻和言情网文等等。中文预训练生成模型。采用我的 RWKV 模型,类似 GPT-2 。AI写作。RWKV for Chinese novel generation.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

AirSim-Drone-Racing-VAE-Imitation

Code associated with our paper "Learning Visuomotor Policies for Aerial Navigation Using Cross-Modal Representations": https://arxiv.org/abs/1909.06993

License:MITStargazers:0Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

botsim

BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

camel

🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Language Model Society (NeruIPS'2023) https://www.camel-ai.org

License:Apache-2.0Stargazers:0Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

License:Apache-2.0Stargazers:0Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DRLwithTL

Python code for Deep Reinforcement Learning with Transfer Learning in a Simulated Environment

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

GPT-Bargaining

Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

gpt4all

gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue

Language:PythonStargazers:0Issues:0Issues:0

Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

Stargazers:0Issues:0Issues:0

lit-gpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

License:Apache-2.0Stargazers:0Issues:0Issues:0

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Stargazers:0Issues:0Issues:0

LOMO

LOMO: LOw-Memory Optimization

License:MITStargazers:0Issues:0Issues:0

lora_bnb_int8

利用LoRA bnb_int8微调chatYuan-large-v2的demo

License:Apache-2.0Stargazers:0Issues:0Issues:0

path-planning-airsim-scripts

Python scripts for path planning and obstacle detection and avoidance using LiDAR sensor in an AirSim - Unreal Engine simulation.

Stargazers:0Issues:0Issues:0

PEDRA

Programmable Engine for Drone Reinforcement Learning Applications

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

License:Apache-2.0Stargazers:0Issues:0Issues:0

PPO-based-Autonomous-Navigation-for-Quadcopters

Deep Reinforcement Learning based autonomous navigation for quadcopters using PPO algorithm.

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

python-websocket-server

A simple fully working websocket-server in Python with no external dependencies

License:MITStargazers:0Issues:0Issues:0

PythonRobotics

Python sample codes for robotics algorithms.

License:NOASSERTIONStargazers:0Issues:0Issues:0

reasoning-on-graphs

Official Implementation of "Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning"

License:MITStargazers:0Issues:0Issues:0

refiner

About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations". Do not hesitate to open an issue if you run into any trouble!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

RWKV-Runner

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.

License:MITStargazers:0Issues:0Issues:0

SocialDial

SocialDial: A Benchmark for Socially-Aware Dialogue Systems (SIGIR'23)

Stargazers:0Issues:0Issues:0

t5_finetuning

clue chatyuan finetuning

Stargazers:0Issues:0Issues:0

The-Python-Graph-Gallery

A website displaying hundreds of charts made with Python

Language:Jupyter NotebookLicense:0BSDStargazers:0Issues:0Issues:0

UAV_Navigation_DRL_AirSim

This is a new repo used for training UAV navigation (local path planning) policy using DRL methods.

Stargazers:0Issues:0Issues:0

UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

License:NOASSERTIONStargazers:0Issues:0Issues:0