Vicomtech's repositories
hate-speech-dataset
Hate speech dataset from Stormfront forum manually labelled at sentence level.
DMD-Driver-Monitoring-Dataset
DMD - Driver Monitoring Dataset
video-content-description-VCD
Video Content Description (VCD) is a schema, API and set of tools to produce semantically rich labels from multi-sensorial data series.
STDG-evaluation-metrics
Standardised Metrics and Methods for Synthetic Tabular Data Evaluation
itzuli-api-lib
Itzuli® Machine Translation Engine API libraries
d-EVD_dual-Electric-Vehicle-Dataset
d-EVD-dual-electric-vehicle-dataset
NUBes-negation-uncertainty-biomedical-corpus
Repository of the NUBes corpus
serverless-mlperf
This repo aims to benchmark Amazon AWS DNN performance with Caffe, TensorFlow and OpenVINO models, using OpenCV and OpenVINO IE as inference backend engines.
CAPTAIN-Elderly-clustering-and-evolution-analysis
CAPTAIN - Elderly clustering and evolution analysis
RailSceneSet
RailSceneSet Dataset
ASVspoophone
The ASVspoophone corpus is the telephonic version of the ASV Spoof 2019 corpus found at https://www.asvspoof.org It contains the telephonic versions of the audios used for the countermeasure (CM) ASV Spoof 2019 challenge, which have been created by transferring each of them through real land-land, mobile-land and land-mobile telephonic channels. The results are the corresponding 8 kHz 8 bit A-Law versions of the originial audios, which can be used to train anti-spoofing systems that will be used on real telephonic scenarios such as call and contact centres.
BaSCo-Corpus
BaSCo Corpus
Dataset-of-2D-polygons-for-Additive-Manufacturing
Dataset of 2D polygons for Additive Manufacturing
esport-corpus
ES-Port Corpus. Spontaneous spoken human-human dialogue corpus consisting of transcribed dialogues from calls to the technical customer support service of a Spanish telecom operator for companies. The corpus has been anonymised and annotated at various linguistic and acoustic-related extralinguistic levels.
GRACE-Benchmark
GRACE-Benchmark
.github
Vicomtech Profile
dataset-machine-tool-wear
dataset_machine_tool_wear
IIOT-protocols-study-for-high-frequency-data-in-the-edge-and-cloud
IIOT protocols study for high frequency data in the edge and cloud dataset
mintzai-ST
Corpus para traducción del habla euskera-castellano
synthetic-neu-seg-images-via-stable-diffusion
This dataset accompanies the paper "State-of-the-art Diffusion Models to Improve the Robustness of Visual Defect Segmentation by Semantic Networks in Manufacturing Components".
transkit-api-lib
Transkit API libraries
voice-cloning
Vicomtech's voice-cloning capabilities and information repo
xhare
xhare