sharkwyf's repositories
agenta
The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.
DeepSpeedExamples
Example models using DeepSpeed
DI-engine
OpenDILab Decision AI Engine
dify
An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.
dreamerv3
Mastering Diverse Domains through World Models
FrozenBiLM
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
MineDojo
Modified actions space to MineRL style
minerl
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
IVR
Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
langflow
⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.
LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
MotionCLIP
Official Pytorch implementation of the paper "MotionCLIP: Exposing Human Motion Generation to CLIP Space"
neuralmmo
Baselines for Neural MMO -- new users should treat this repo as a starter project
notion-feeder
🕸 A Node app for creating a Feed Reader in Notion.
online-dt
Online Decision Transformer
PDT
Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer
ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
Stable-Alignment
Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".
trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
trl
Train transformer language models with reinforcement learning.
VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs