Argilla

Argilla's repositories

argilla

Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonApache-2.03162 25 1921

distilabel

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

Language:PythonApache-2.01009 12 271

notus

Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach

Language:PythonMIT154 6 5

distilabel-spin-dibt

Repository containing the SPIN experiments on the DIBT 10k ranked prompts

Language:PythonApache-2.02100

argilla-llama-index

A public repo that contains integrations for Argilla and LlamaIndex.

Language:PythonApache-2.0900

argilla-server

A Python native FastAPI server for the Argilla backend.

Language:PythonApache-2.0900

argilla-python

The Argilla API python SDK

Language:PythonApache-2.0600

argilla-haystack

A public repo that contains integrations for Argilla and Haystack.

Language:PythonApache-2.04 4 5

distilabel-workbench

A working repository for experimental pipelines in distilabel

Language:Jupyter Notebook3 40

ray-clay

Ray Clay is a tool to train and deploy models from Argilla using the Ray framework.

Language:Python2 50

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptApache-2.01 10

:mag: Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-4, ChatGPT and alike). Haystack offers production-ready tools to quickly build complex decision making, question answering, semantic search, text generation applications, and more.

Language:PythonApache-2.01 10

roadmap

Argilla Public Roadmap

04 47

.github

✨ Argilla: the open-source feedback platform for LLMs

Apache-2.0040

argilla-docker-deploy

Language:Shell000

argilla-workshop

A repo with everything someone might need to give a nice workshop on NLP with Argilla.

Language:Jupyter NotebookApache-2.0030

awesome-argilla-datasets

The Argilla team periodically creates datasets and loves to share the process and data with the world.

Language:Jupyter NotebookApache-2.0000

cookbook

Language:Jupyter NotebookMIT000

data-is-better-together

Let's build better datasets, together!

Language:Jupyter Notebook000

dataset_examples

A public repo for holding dataset examples.

Apache-2.0020

dill

serialize all of Python

NOASSERTION000

distilabel-helm-instruct-adaptable-evaluation-criteria

A repo that implements Stanford CRFM their HELM Instruct with adaptable evaluation criteria

Language:Jupyter NotebookApache-2.0000

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.0010

genai-stack

Langchain + Docker + Neo4j + Ollama + Argilla

Language:PythonCC0-1.0010

LLM-Blender

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.

Language:PythonApache-2.0000