sharkwyf's repositories

cgdt

[AAAI'2024] Critic-Guided Decision Transformer for Offline Reinforcement Learning

Language:PythonLicense:MITStargazers:6Issues:2Issues:0

RepoAgent

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

agenta

The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DI-engine

OpenDILab Decision AI Engine

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dify

An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.

Language:TypeScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0

dreamerv3

Mastering Diverse Domains through World Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

FrozenBiLM

[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MineDojo

Modified actions space to MineRL style

Language:JavaLicense:MITStargazers:0Issues:0Issues:0

minerl

MineRL Competition for Sample Efficient Reinforcement Learning - Python Package

Language:JavaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

HarmBench

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

License:MITStargazers:0Issues:0Issues:0

IVR

Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

langflow

⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LLaMA-Factory

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MotionCLIP

Official Pytorch implementation of the paper "MotionCLIP: Exposing Human Motion Generation to CLIP Space"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

neuralmmo

Baselines for Neural MMO -- new users should treat this repo as a starter project

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

notion-feeder

🕸 A Node app for creating a Feed Reader in Notion.

License:MITStargazers:0Issues:0Issues:0

online-dt

Online Decision Transformer

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

PDT

Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Language:PythonStargazers:0Issues:0Issues:0

Stable-Alignment

Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

trajectory-transformer

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

VITA

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

License:NOASSERTIONStargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0