Qilong Zhang's starred repositories
awesome-public-datasets
A topic-centric list of HQ open datasets.
nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
LangChain-Chinese-Getting-Started-Guide
LangChain 的中文入门教程
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
ChineseNlpCorpus
搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
TextRank4ZH
:deciduous_tree:从中文文本中自动提取关键词和摘要
Chinese-ELECTRA
Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)
DeepSeek-LLM
DeepSeek LLM: Let there be answers
chatgpt-comparison-detection
Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥
ProphetNet
A research project for natural language generation, containing the official implementations by MSRA NLC team.
self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
opencc-python
OpenCC made with Python
llms_paper
该仓库主要记录 LLMs 算法工程师相关的顶会论文研读笔记(多模态、PEFT、小样本QA问答、RAG、LMMs可解释性、Agents、CoT)
Awesome-Machine-Generated-Text
Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.
AIGC_text_detector
The official codes of our work on AIGC detection: "Multiscale Positive-Unlabeled Detection of AI-Generated Texts" (ICLR'24 Spotlight)