There are 7 repositories under data-annotation topic.
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
A list of tools for annotating data, managing annotations, etc.
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
Enhances construction site safety using YOLO for object detection, identifying hazards like workers without helmets or safety vests, and proximity to machinery or vehicles. HDBSCAN clusters safety cone coordinates to create monitored zones. Post-processing algorithms improve detection accuracy.
🧬 A JupyterLab extension for annotating data with Prodigy
The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals
Social Media Mining Toolkit (SMMT) main repository
:fire: One of the most comprehensive open-source data annotation platform.
Tornado is an open source Human-in-the-loop machine learning tool. It helps you label your dataset on the fly while training your model through a simple web user interface. It supports all data types: structured, text and image.
A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.
Visualization and Annotation Tool for ROS
PersianDataAnnotations is ASP.NET Core MVC & ASP.NET MVC Custom Localization DataAnnotations (Localized MVC Errors) for Persian(Farsi) language - فارسی سازی خطاهای اعتبارسنجی توکار ام.وی.سی. و کور.ام.وی.سی. برای نمایش اعتبار سنجی سمت کلاینت
Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
🧬 A VS Code extension for annotating data with Prodigy
This is a tool to annotate the focus plane of z-stacked images.
AnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexibility to replace modules for different languages.
a tool for mapping free-text descriptions of entities to ontology terms
Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information"
Converts brat standoff format to JSONL format
The entry point for adapting, training, evaluating, and leveraging various Large Language Models (LLMs) for a wide range of Ukrainian NLP tasks.
Curated list of Awesome Training Data! (Data Labeling, Annotation, Discovery, Workflow etc)
Simple Telegram bot to annotate and varify automatic speech recognition datasets
SuperAnnotate HTTP service for Generated Text Detection
A PointRCNN version of SAnE, which is a web-based semi-automatic annotation tool for point cloud data.
Convert your annotated data from one format to another format
An internationalized highly customizable annotation and evaluation tool for Natural Language Processing (NLP) tasks
Structured test tasks and model tuning scripts for multiple subjects from ZNO - the Ukrainian External Independent Evaluation (ЗНО)
🧠 Multimodal Retrieval-Augmented Generation that "weaves" together text and images seamlessly. 🪡
This repository presents a project focused on image recognition of nuts and screws using object detection techniques. The objective is to develop a model capable of accurately detecting and classifying nuts and screws in images, enabling automation and quality control in industrial settings.