yisaienkov / tinysets

The project aims to collect various datasets for tasks such as classification, clustering, object detection... The purpose of this datasets is quick checking models and algorithms performance.

Home Page:https://github.com/yisaienkov/tinysets

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TinySets

PyPI version

  • This project contains different datasets.
  • The main purpose of these sets is to quickly check the performance of models and algorithms.

Table of contents

How to use

You can see all available sets and their actual locations using the next code:

>>> from tinysets import get_all_sets
>>> get_all_sets()
{
    'LEGO Minifigures Classification': [
        'https://www.kaggle.com/ihelon/lego-minifigures-classification'
    ],
    'Ukrainian Descriptions Of Words': [
        'https://www.kaggle.com/ihelon/ukrainian-descriptions-of-words'
    ]
}

Installation

PyPI

You can use pip to install evaluations:

pip install tinysets

Or you can clone and install the latest version of the library from GitHub:

pip install -U git+https://github.com/yisaienkov/tinysets

Datasets

LEGO Minifigures Classification

This dataset contains pictures of various LEGO Minifigures. There are several images in different poses and with different environments for each Minifigure in the dataset.

Currently, it contains 28 figures (more than 300 images totally) from the LEGO sets Yoda's Hut, Spider Mech vs. Venom, General Grievous' Combat Speeder, Kylo Ren's Shuttle™ Microfighter, AT-ST™ Raider from The Mandalorian, Molten Man Battle, Aragog's Lair, Black Widow's Helicopter Chase, Captain America: Outriders Attack, Pteranodon Chase, Iron Man Hall of Armor.

This dataset is available in Kaggle Datasets

Ukrainian Descriptions Of Words

This dataset contains descriptions of various words in simple terms by different people. It looks like the Alias game when you need to describe the desired word using other words so that the other player guesses your word.

Currently, it contains descriptions from 8 different people for 15 unique words.

This dataset is available in Kaggle Datasets

About

The project aims to collect various datasets for tasks such as classification, clustering, object detection... The purpose of this datasets is quick checking models and algorithms performance.

https://github.com/yisaienkov/tinysets

License:MIT License


Languages

Language:Python 100.0%