alecsharpie / pysimulacra

A pandas wrapper for simulacra-aesthetic-captions dataset

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Dataset

download dataset and get info here: https://github.com/JD-P/simulacra-aesthetic-captions

Entity Relationship Diagram

Install

Clone the project and install it:

git clone git@github.com:alecsharpie/pysimulacra.git
cd pysimulacra
make install # or `pip install .`

Example

Download the data from the repository linked above and instantiate the class pointing to the sql db. Do it manually (eg with chrome) if curl isn't working

from pysimulacra.data import SimulacraData

ds = SimulacraData('/data/sac_public_2022_06_29.sqlite')

data = ds.fetch_all_data()

here data is a dictionary containing all tables from db, key = table_name, value = pd.DataFrame tables.

Table : col1, col2, col3, ...
----------
survey  :  id, qid, rating
generations  :  id, sid, method, prompt, verified
images  :  id, gid, idx
paths  :  iid, path
ratings  :  sid, iid, rating, verified
upscales  :  iid, method
ds.get_image_paths_and_prompts()

ds.get_prompts_and_ratings()

ds.get_image_paths_and_prompts_and_ratings()

About

A pandas wrapper for simulacra-aesthetic-captions dataset


Languages

Language:Python 79.1%Language:Makefile 20.9%