hyf's starred repositories
Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
intro-llm-rag
LLM Models and RAG Hands-on guide
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
rag-search
RAG Search API
MediaCrawler-new
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
deep_learning_curriculum
Language model alignment-focused deep learning curriculum
RLHF-Shakespeare
Finetune LLM with RLHF to generate positive tone message from Shakespeare Corpus.
LLM-RLHF-Tuning-with-PPO-and-DPO
Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various configurations for the Alpaca, LLaMA, and LLaMA2 models.
MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
llm-answer-engine
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper
TheBigPromptLibrary
A collection of prompts, system prompts and LLM instructions
wonderful-prompts
🔥中文 prompt 精选🔥,ChatGPT 使用指南,提升 ChatGPT 可玩性和可用性!🚀
A-Guide-to-Retrieval-Augmented-LLM
an intro to retrieval augmented large language model
search_with_lepton
Building a quick conversation-based search demo with Lepton AI.