Zian(Andy) Zheng (Orion-Zheng)

Orion-Zheng

Geek Repo

Location:Singapore

Home Page:zheng-zian-andy.com

Twitter:@zian_andy_zheng

Github PK Tool:Github PK Tool

Zian(Andy) Zheng's starred repositories

Language:PythonLicense:Apache-2.0Stargazers:97Issues:0Issues:0

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Language:PythonLicense:Apache-2.0Stargazers:1443Issues:0Issues:0

The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

Language:PostScriptLicense:CC0-1.0Stargazers:16961Issues:0Issues:0

LongChat

Official repository for LongChat and LongEval

Language:PythonLicense:Apache-2.0Stargazers:502Issues:0Issues:0

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Language:PythonStargazers:1327Issues:0Issues:0

ShareGPTQAExtractor-mnbvc

MNBVC项目-ShareGPT语料清洗

Language:PythonLicense:MITStargazers:12Issues:0Issues:0
Language:PythonLicense:MITStargazers:63Issues:0Issues:0

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:14179Issues:0Issues:0

TigerBot

TigerBot: A multi-language multi-task LLM

Language:PythonLicense:Apache-2.0Stargazers:2226Issues:0Issues:0

awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Language:PythonLicense:MITStargazers:4632Issues:0Issues:0

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonLicense:Apache-2.0Stargazers:5663Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3301Issues:0Issues:0

awesome-lm-system

Summary of system papers/frameworks/codes/tools on training or serving large model

License:Apache-2.0Stargazers:56Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:348Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:4Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2608Issues:0Issues:0

Awesome-Mixture-of-Experts-Papers

A curated reading list of research in Mixture-of-Experts(MoE).

License:Apache-2.0Stargazers:508Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:11149Issues:0Issues:0

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

Stargazers:4017Issues:0Issues:0

awesome-huge-models

A collection of AWESOME things about HUGE AI models.

Stargazers:291Issues:0Issues:0
Language:Jupyter NotebookStargazers:7Issues:0Issues:0

CS224n-Reading-Notes

CS224n Reading Notes in Chinese 中文阅读笔记

Stargazers:484Issues:0Issues:0

Bert_related

Data preparations for training Bert

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

wikiextractor

A tool for extracting plain text from Wikipedia dumps

Language:PythonLicense:AGPL-3.0Stargazers:3711Issues:0Issues:0

datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

Language:PythonLicense:Apache-2.0Stargazers:4254Issues:0Issues:0

Tensorflow-101

TensorFlow Tutorials

Language:Jupyter NotebookLicense:MITStargazers:2603Issues:0Issues:0

keras_bert_multi_label_cls

本项目采用Keras和Keras-bert实现文本多标签分类任务,对BERT进行微调。

Language:PythonStargazers:64Issues:0Issues:0

data-centric-AI

A curated, but incomplete, list of data-centric AI resources.

Stargazers:1017Issues:0Issues:0

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

Stargazers:9156Issues:0Issues:0