Jiekai Jia (JiekaiJia)

JiekaiJia

Geek Repo

Company:Ecovacs Robotics

Location:suzhou, china

Github PK Tool:Github PK Tool

Jiekai Jia's starred repositories

baize-chatbot

Let ChatGPT teach your own chatbot in hours with a single GPU!

Language:PythonLicense:GPL-3.0Stargazers:3150Issues:0Issues:0

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3626Issues:0Issues:0

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8194Issues:0Issues:0

Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

Language:CLicense:Apache-2.0Stargazers:4140Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36290Issues:0Issues:0

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10807Issues:0Issues:0

Chinese-alpaca-lora

骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:710Issues:0Issues:0

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:7670Issues:0Issues:0

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7794Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:9008Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29306Issues:0Issues:0

optimate

A collection of libraries to optimise AI model performances

Language:PythonLicense:Apache-2.0Stargazers:8369Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40324Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38543Issues:0Issues:0

ChatYuan

ChatYuan: Large Language Model for Dialogue in Chinese and English

Language:PythonLicense:NOASSERTIONStargazers:1902Issues:0Issues:0

LaneGCN

[ECCV2020 Oral] Learning Lane Graph Representations for Motion Forecasting

Language:PythonLicense:NOASSERTIONStargazers:490Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:64Issues:0Issues:0

apollo

An open autonomous driving platform

Language:C++License:Apache-2.0Stargazers:24949Issues:0Issues:0

Argoverse2_Motion_Forecasting

MFTF: Motion Forecasting Using Transformers

Language:PythonLicense:NOASSERTIONStargazers:50Issues:0Issues:0

CPlusPlusThings

C++那些事

Language:C++Stargazers:38629Issues:0Issues:0

DCENet

Exploring Dynamic Context for Multi-path Trajectory Prediction

Language:PythonStargazers:23Issues:0Issues:0

LTNtorch

PyTorch implementation of Logic Tensor Networks, a Neural-Symbolic framework.

Language:PythonLicense:MITStargazers:71Issues:0Issues:0

logictensornetworks

Deep Learning and Logical Reasoning from Data and Knowledge

Language:Jupyter NotebookLicense:MITStargazers:268Issues:0Issues:0

modulardecision

[CoRL'20] Learning a Decision Module by Imitating Driver’s Control Behaviors

Language:PythonLicense:MITStargazers:29Issues:0Issues:0

NDQ

Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)

Language:PythonLicense:Apache-2.0Stargazers:82Issues:0Issues:0

MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Language:PythonStargazers:1402Issues:0Issues:0

MAProj

Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment

Language:PythonStargazers:109Issues:0Issues:0

Deep-Learning-Papers-Reading-Roadmap

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Language:PythonStargazers:37891Issues:0Issues:0

rllib_differentiable_comms

This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralized critics can be realized in RLLib. This example serves as a reference implementation and starting point for making RLLib more compatible with such architectures.

Language:PythonStargazers:39Issues:0Issues:0

DVRL

Deep Variational Reinforcement Learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:133Issues:0Issues:0