Argilla's repositories
distilabel
Distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency
spacy-wordnet
spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
adept-augmentations
A Python library aimed at dissecting and augmenting NER training data.
awesome-llm-datasets
👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)
distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
argilla-streamlit
👑 Streamlit for extended UI functionalities for Argilla.
argilla-llama-index
A public repo that contains integrations for Argilla and LlamaIndex.
argilla-plugins
🔌 Open-source plugins for with practical features for Argilla using listeners.
argilla-server
A Python native FastAPI server for the Argilla backend.
argilla-haystack
A public repo that contains integrations for Argilla and Haystack.
distilabel-workbench
A working repository for experimental pipelines in distilabel
haystack
:mag: Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-4, ChatGPT and alike). Haystack offers production-ready tools to quickly build complex decision making, question answering, semantic search, text generation applications, and more.
teachable-machine-boilerplate
Boilerplate code for Teachable Machine
argilla-workshop
A repo with everything someone might need to give a nice workshop on NLP with Argilla.
awesome-argilla-datasets
The Argilla team periodically creates datasets and loves to share the process and data with the world.
dataset_examples
A public repo for holding dataset examples.
dill
serialize all of Python
distilabel-helm-instruct-adaptable-evaluation-criteria
A repo that implements Stanford CRFM their HELM Instruct with adaptable evaluation criteria
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
genai-stack
Langchain + Docker + Neo4j + Ollama + Argilla
prompt-collective-dashboard
A Gradio app to monitor a collective effort from the Open Source AI Community to understand and collect good quality and diverse prompts.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.