sharkwyf

Yuanfu Wang's repositories

cgdt

[AAAI'2024] Critic-Guided Decision Transformer for Offline Reinforcement Learning

Language:PythonMIT10 20

RepoAgent

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Language:PythonApache-2.0100

safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonApache-2.0100

agenta

The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.

Language:TypeScriptMIT000

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonApache-2.0000

An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.

Language:TypeScriptNOASSERTION000

gpt-researcher

LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.

Apache-2.0000

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonMIT000

HarmBench

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Language:Jupyter NotebookMIT000

IVR

Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

Language:PythonMIT000

langflow

⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.

Language:PythonMIT000

latent-adversarial-training

MIT000

lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

MIT000

LLaMA-Factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

Language:PythonApache-2.0000

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT000

lmm-r1

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Apache-2.0000

lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Language:PythonNOASSERTION000

ms-swift

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).

Apache-2.0000

sharkwyf

Yuanfu Wang's repositories

cgdt

RepoAgent

safe-rlhf

agenta

Continuous-AdvTrain

DeepSpeedExamples

dify

gpt-researcher

graphrag

HarmBench

IVR

langflow

latent-adversarial-training

lighteval

LLaMA-Factory

lm-evaluation-harness

lmm-r1

lmms-eval

ms-swift

neuralmmo

notion-feeder

R1-V

ragflow

SimPO

Stable-Alignment

trl

verl

VITA

vllm

VLMEvalKit