vinayprabhu / artcrawl

Documenting the nature & depth of art-websites scraped in CommonCrawl

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

artcrawl

Documenting the nature & depth of art-websites scraped in CommonCrawl

The modern artist and the plumage paradox

Ansatz:

Artists have to make ends meet → Hence, they need to advertize their art on online art-marrketplaces (irrespective of what medium their art revels in) → They upload the digital versimiltude of their works → Web-scraping projects simply scrape away their digital simulacra & feed them into LM training pipelines and claim: "I am merely scraping culture"

LAION-400M

Histogram of scraped image counts

About

Documenting the nature & depth of art-websites scraped in CommonCrawl