huang-xx

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Language:PythonApache-2.01789600

dpo-notes

Notes on Direct Preference Optimization

600

ai-edu

AI education materials for Chinese students, teachers and IT professionals.

Language:HTMLNOASSERTION1328700

RAGs

RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems

Language:Jupyter NotebookApache-2.06500

makeMoE

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Language:Jupyter NotebookMIT54200

rag-search

RAG Search API

Language:PythonApache-2.071700

MediaCrawler-new

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频｜评论爬虫、微博帖子｜评论爬虫

Language:PythonApache-2.043200

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptNOASSERTION3416200

RLHF

Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models

Language:Jupyter Notebook14800

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonApache-2.02444500

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

114400

RLHF-Shakespeare

Finetune LLM with RLHF to generate positive tone message from Shakespeare Corpus.

Language:PythonApache-2.0300

LLM-RLHF-Tuning-with-PPO-and-DPO

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various configurations for the Alpaca, LLaMA, and LLaMA2 models.

Language:Python9200

huang-xx

hyf's starred repositories

Token-level-Direct-Preference-Optimization

Omost

QAnything

intro-llm-rag

PowerPaint

HunyuanDiT

detextify

DeepSeek-V2

Fooocus

IOPaint

dpo-notes

ai-edu

RAGs

makeMoE

rag-search

MediaCrawler-new

dify

RLHF

LLaMA-Factory

deep_learning_curriculum

RLHF-Shakespeare

LLM-RLHF-Tuning-with-PPO-and-DPO

MediaCrawler

llm-answer-engine

TheBigPromptLibrary

GPTs

wonderful-prompts

A-Guide-to-Retrieval-Augmented-LLM

search_with_lepton

GPTFast