Carro (robertalanm)

robertalanm

Geek Repo

Company:@manifold-inc

Location:Texas

Home Page:https://sybil.com

Twitter:@0xcarro

Github PK Tool:Github PK Tool

Carro's repositories

safe-rlhf

Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0

alpaca-weight

Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

airoboros

Customizable implementation of the self-instruct paper.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

alpaca-lora

Code for reproducing the Stanford Alpaca InstructLLaMA result on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

autocrit

A repository for transformer critique learning and generation

Language:PythonStargazers:0Issues:0Issues:0

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

H3

Language Modeling with the H3 State Space Model

License:Apache-2.0Stargazers:0Issues:0Issues:0

langflow

⛓️ LangFlow is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

langfuse

open-source observability for LLM applications

Language:TypeScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

llama-trl

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

OpenLLaMA2

A Ray-based High-performance LLaMA2 RLHF framework

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

orca

Experiments into reproducing orca

Stargazers:0Issues:1Issues:0

pfrl

PFRL: a PyTorch-based deep reinforcement learning library

License:MITStargazers:0Issues:0Issues:0

raodottown

website for rao.town

Language:JavaScriptStargazers:0Issues:0Issues:0

substrate-indexer

indexer for substrate chain (bt)

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

validators

Repository for bittensor validators

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0