Mo-Jiao

0

followers

following

stars

Mo-Jiao's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.0138952 1071 7621

Deep-Learning-Papers-Reading-Roadmap

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Language:Python37897 2100 53

TaskMatrix

Language:PythonNOASSERTION34518 300 351

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.029996 195 4695

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonApache-2.018125 185 730

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

CC0-1.017108 362 24

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT15616 647 850

awesome-deep-vision

A curated list of deep learning resources for computer vision

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookApache-2.010442 426 165

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLApache-2.08858 81 21

TensorLayer

Deep Learning and Reinforcement Learning Library for Scientists and Engineers

Language:PythonNOASSERTION7315 458 465

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonNOASSERTION4686 49 425

Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca

Language:CApache-2.04140 58 244

stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Language:PythonMIT4121 62 946

ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Language:PythonApache-2.03646 32 374

ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等

Language:Python2618 15 144

Inverse-Reinforcement-Learning

Implementations of selected inverse reinforcement learning algorithms.

Language:PythonMIT969 36 16

DeepRL-TensorFlow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

Language:PythonApache-2.0596 19 8

tf2rl

TensorFlow2 Reinforcement Learning

Language:PythonMIT464 18 120

mini-AlphaStar

(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Research.

Language:PythonApache-2.0307 12 34

TF2-RL

Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]

Language:PythonMIT296 6 4

VirtualTaobao-Imp

Implementation to VirtualTaobao

Language:PythonGPL-3.011 3 2