Fluxation's starred repositories
DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Awesome-GPT-Store
Custom GPT Store - A collection of major GPTS available in public
Awesome-GPT-Agents
A curated list of GPT agents for cybersecurity
aimoneyhunter
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English version for more insights.
TaskWeaver
A code-first agent framework for seamlessly planning and executing data analytics tasks.
ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
CoT-Reasoning-Survey
[ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
meachine_comment
自动评论 ,评论机器人
chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
sft_datasets
开源SFT数据集整理,随时补充
hai-platform
一种任务级GPU算力分时调度的高性能深度学习训练平台
Happy-ChatGPT
ChatGPT 国粹版,和 GPT 一起学习地道的**话吧
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
docker-llama2-chat
Play LLaMA2 (official / 中文版 / INT4 / llama2.cpp) Together! ONLY 3 STEPS! ( non GPU / 5GB vRAM / 8~14GB vRAM)