sharkwyf

sharkwyf's repositories

RepoAgent

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Language:PythonApache-2.0100

safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonApache-2.0100

agenta

The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.

Language:TypeScriptMIT000

block-recurrent-transformer

Pytorch implementation of "Block Recurrent Transformers" (Hutchins & Schlag et al., 2022)

Language:PythonMIT000

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Language:PythonMIT000

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonApache-2.0000

DI-engine

OpenDILab Decision AI Engine

Language:PythonApache-2.0000

An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.

Language:TypeScriptNOASSERTION000

dreamerv3

Mastering Diverse Domains through World Models

Language:PythonMIT000

FrozenBiLM

[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

Language:PythonApache-2.0000

MineDojo

Modified actions space to MineRL style

Language:JavaMIT000

minerl

MineRL Competition for Sample Efficient Reinforcement Learning - Python Package

Language:JavaNOASSERTION000

SwinBERT

Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"

Language:PythonMIT000

IVR

Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

MIT000

langflow

⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.

Language:PythonMIT000