abheesht17 / pepethescraper

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Pepe The Scraper

Library for scraping memes off Know Your Meme and Reddit (along with the explanations and background provided for all the memes)

Setting Up Pepe The Scraper

git clone https://github.com/abheesht17/pepethescraper.git
cd pepethescraper
python setup.py install
pip install clean-text[gpl]==0.3.0
pip install praw==7.1.4

Examples:

  • Help Pepe scrape memes off KYM!
from pepethescraper.pepe_at_work import KYMScraper
scraper = KYMScraper(output_format="json", save_dir_path="kym_memes", save_img=True, clean_text=True)
scraper.scrape(search_query="political memes",number_of_memes=2)
  • Help Pepe scrape memes off Reddit!
from pepethescraper.pepe_at_work import RedditScraper
scraper = RedditScraper(output_format="json", save_dir_path="reddit_memes", save_img=True, clean_text=True)
scraper.scrape(search_query="PoliticalMemes",number_of_memes=2)

Note: The output files and images corresponding to the above pieces of code are given in the examples folder.

Pepe's Helpers

  • clean-text
  • praw

Upcoming Updates:

  • Pepe's learning how to scrape memes off Twitter and ImgFlip.
  • Pepe will also try to clean the text from KYM.

About


Languages

Language:Python 100.0%