K's starred repositories
AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
LLM-Science-Exam
Kaggle Competition | LLM | Falcon | Perplexity
Kaggle---LLM-Science-Exam
Use LLMs to answer difficult science questions
CCKS2021-Scheme-Sharing
CCKS2021答非所问竞赛冠军方案
2018-CCL-UIIMCS
CCL2018客服领域用户意图分类冠军1st方案
MetaLC-2nd-Round
Winning method (1st place) in Meta-learning from Learning Curves - 2ND ROUND competition.
Tencent2020_ad
2020腾讯广告算法大赛方案分享及代码(冠军)
lindorm-tsdb-contest-java
第五届天池数据库大赛赛道2冠军
ccir_cup_2023
ccir cup 2023 基于通用大模型的知识库问答 冠军方案
CCF-Algorithm-Competition
2021 CCF BDCI大数据与计算智能算法大赛 冠军方案
CVPR-2023-1st-foundation-model-challenge-Track-2-1th-solution
CVPR 2023 1st foundation model challenge-Track 2 第一名解决方案
dial-clean
中文对话数据清洗
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
LLM-Knowledge-Boundary
Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"
ToolAlpaca
ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases
IncarnaMind
Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs
Llama-2-Open-Source-LLM-CPU-Inference
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
LangChain_LLM_ChatBot
基于LLM和LangChain实现基于本地文档的QA chatbot