Lucinao Mazzella's repositories
awesome-instruction-datasets
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型集合
Books-Free-Books
免费书籍汇总。
BUCTBASE
北京化工大学课程资料共享计划
Chinese-instruction-datasets
中文 Instruction tuning datasets
cim
📲cim(cross IM) 适用于开发者的分布式即时通讯系统
cnocr
CnOCR:基于 PyTorch/MXNet 的中文/英文 OCR Python 包
Concurnas
Concurnas is an open source JVM programming language designed for building reliable, scalable, high performance concurrent, distributed and parallel systems
cosmos
GPU-accelerated force graph layout and rendering
DeepIE
DeepIE: Deep Learning for Information Extraction
Goat-Math-Chinese
山羊中文算术大模型
GPTs
leaked prompts of GPTs
groovy-parser
Yet another new parser for Groovy programming language(project code: Parrot)
LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
mather
zzllrr mather(an offline tool for Math learning, education and research)小乐数学,离线可用的数学学习(自学或教学)、研究辅助工具。计划覆盖数学全部学科的解题、作图、演示、探索工具箱。目前是演示Demo版(抛转引玉),但已经支持数学公式编辑显示,部分作图功能,部分学科,如线性代数、离散数学的部分解题功能。最终目标是推动专业数学家、编程专家、教育工作者、科普工作者共同打造出更加专业级的Mather数学工具
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
pdfconv
中文PDF转TXT的实用工具
PDFPatcher
PDF补丁丁——PDF工具箱,可以编辑书签、剪裁旋转页面、解除限制、提取或合并文档,探查文档结构,提取图片、转成图片等等
pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,Kenlm,Seq2Seq_Attention,BERT,MacBERT,ELECTRA,ERNIE,Transformer等模型实现,开箱即用。
pytorchexamples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
QuestionAnsweringSystem
QuestionAnsweringSystem是一个Java实现的人机问答系统,能够自动分析问题并给出候选答案。
RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
ToolGood.Words
一款高性能敏感词(非法词/脏字)检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
vnpy
基于Python的开源量化交易平台开发框架
YaYi
雅意大模型:为每一家企业打造大模型