Risan's starred repositories
MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
pandarallel
A simple and efficient tool to parallelize Pandas operations on all available CPUs
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
chatglm.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4
NLP-Interview-Notes
该仓库主要记录 NLP 算法工程师相关的面试题
Chinese-Llama-2-7b
开源社区第一个能下载、能运行的中文 LLaMA2 模型!
ctransformers
Python bindings for the Transformer models implemented in C/C++ using GGML library.
LLMs_interview_notes
该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题
Med-ChatGLM
Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调
Python-Package-Template
A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much much more
GraphXInAction
Book <Spark GraphX In Action> code and resources.