Mo-Jiao

Mo-Jiao

Geek Repo

Github PK Tool:Github PK Tool

Mo-Jiao's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:138952Issues:1071Issues:7621

Deep-Learning-Papers-Reading-Roadmap

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Language:PythonLicense:NOASSERTIONStargazers:34518Issues:300Issues:351

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:29996Issues:195Issues:4695

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:18125Issues:185Issues:730

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:15616Issues:647Issues:850

awesome-deep-vision

A curated list of deep learning resources for computer vision

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10442Issues:426Issues:165

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLLicense:Apache-2.0Stargazers:8858Issues:81Issues:21

TensorLayer

Deep Learning and Reinforcement Learning Library for Scientists and Engineers

Language:PythonLicense:NOASSERTIONStargazers:7315Issues:458Issues:465

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4686Issues:49Issues:425

Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

Language:CLicense:Apache-2.0Stargazers:4140Issues:58Issues:244

stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:4121Issues:62Issues:946

ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Language:PythonLicense:Apache-2.0Stargazers:3646Issues:32Issues:374

ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Inverse-Reinforcement-Learning

Implementations of selected inverse reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:969Issues:36Issues:16

DeepRL-TensorFlow2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

Language:PythonLicense:Apache-2.0Stargazers:596Issues:19Issues:8

tf2rl

TensorFlow2 Reinforcement Learning

Language:PythonLicense:MITStargazers:464Issues:18Issues:120

mini-AlphaStar

(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Research.

Language:PythonLicense:Apache-2.0Stargazers:307Issues:12Issues:34

TF2-RL

Reinforcement learning algorithms implemented for Tensorflow 2.0+ [DQN, DDPG, AE-DDPG, SAC, PPO, Primal-Dual DDPG]

Language:PythonLicense:MITStargazers:296Issues:6Issues:4

VirtualTaobao-Imp

Implementation to VirtualTaobao

Language:PythonLicense:GPL-3.0Stargazers:11Issues:3Issues:2