Conghui He's starred repositories
chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
WanJuan1.0
万卷1.0多模态语料
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
data-centric-AI
A curated, but incomplete, list of data-centric AI resources.
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
llm-foundry
LLM training code for Databricks foundation models
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
ml-study-plan
The Ultimate FREE Machine Learning Study Plan
labelU-Kit
Data annotation component library --provided as NPM packages
opendatalab-datasets
datasets resource
opendatalab-python-sdk
SDK of OpenDataLab - https://opendatalab.org.cn
labelbee-client
Out-of-the-box Annotation Toolbox
Crawler_Illegal_Cases_In_China
Collection of China illegal cases about web crawler 本项目用来整理所有**大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在**大陆工作的爬虫行业从业者了解我国相关法律,避免触碰数据合规红线。 [AD]中文知识图谱门户
PixelAnnotationTool
Annotate quickly images.
lightline.vim
A light and configurable statusline/tabline plugin for Vim
Spark-2.3.1
Spark-2.3.1源码解读