There are 0 repository under large-scale-dataset topic.
SemanticKITTI API for visualizing dataset, processing data, and evaluating results.
LexicMap: efficient sequence alignment against millions of prokaryotic genomes
A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks
SOTA kilo-scale MIDI dataset for MIR and Music AI purposes
Maximize Efficiency, Elevate Accuracy: Slash GPU Hours by Half with Efficient Pre-training!
Official code release for BOLD5000 Release 2.0
Korean Moview Review Emotion (KMRE) Dataset
Web interface for querying the LAION-5B dataset using CLIP embeddings.
This repository contains a framework with a GPU implementation of generalized convolution operators. The framework is designed for large image data sets and can run in a distributed system.
A large-scale datasets for session-based recommendation and sequential recommendation
M3LS : Multi-lingual Multi-modal summarization dataset
Music recommender system based on collaborative filtering using the ListenBrainz listens dataset.
Densim is a library for efficient similarity search and clustering of dense vectors, which are numerical representations of data such as images, text, or audio.
GeoCARET - a command line Python tool for delineating and analysing catchments and reservoirs.
To analyze and predict flight data using Spark within the Databricks environment.