wang-benqiang's starred repositories
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
text-embeddings-inference
A blazing fast inference solution for text embeddings models
InfiniTransformer
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
open-parse
Improved file parsing for LLM’s
BCEmbedding
Netease Youdao's open-source embedding and reranker models for RAG products.
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
LeetcodeTop
汇总各大互联网公司容易考察的高频leetcode题🔥