agentidea / getLyrics

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

getLyrics

scripts to scrape lyrics from

for discovery/nlp etc

a) run scrapeArtistUrls.py ( putting in artist name in file ... eg 'pink_floyd' b) run scrapeLyrics.py ( again putting artist name in [] eg ['pink_floyd']

post processing

cd corpi/<<artist_name>>/raw cat * > ../<<artiest_name.corpus

About


Languages

Language:Jupyter Notebook 73.1%Language:Python 26.6%Language:HTML 0.3%