msaaksjarvi's repositories
scraping-comicbookrealm
Data scraping project for collecting Comic issues from comicbookrealm.com
album_sales
Album sales analysis
awesome-product-design
A collection of bookmarks, resources, articles for product designers.
billboard-hot-100
JSON files for every Billboard Hot 100 chart in history, updated daily.
billboard-json
🎧 Get json type billboard hot 100 chart
buzzfeed-news-trending-strip
Dataset: BuzzFeed News “Trending” Strip, 2018–2023
Capstone_Spotify-Sequential-Skip-Prediction
The model trained by the data of 1 day before can best predict the skipping behavior on Spotify.
chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
comics_text_plus
Official repository of the paper: "A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition"
data-1
Latest data on UK food banks from Give Food scraped from our API and republished in various formats.
Dr.Weather
Discover the perfect harmony of tunes and movies!
flexplot
flexplot: graphical data analysis
hollywood-age-gap
🎬 The age difference in years between movie love interests.
IMDB-Movie-Report
I began this project with three csv files on movie ratings, title info, and ratings. I then used the TMDB API to extract additional information on budget and revenue. Lastly, I compiled them all into a mySQL database to use for Hypothesis Testing
mobilephone-brands-and-models
A database includes mobilephone manufacturers and their models.
Moodify
This application utilizes the LGBM model to accurately classify the emotions of songs and provide tailored song recommendations based on mood and cosine similarity.
natbot
Drive a browser with GPT-3
ner-annotator
Named Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.
Nike_Backend_Image_Scraper
Short Python script used to scrape official full resolution product images from Nike's API
Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
PushshiftDumps
Example scripts for the pushshift dump files
r-television-Weekly-Thread-Data
Getting the most mentioned shows in the weekly recommendation thread on r/television.
random-data
Just a place to put all my random data sources
spotify-pipeline-setup
This repository can be a hands on guide for aspiring Data Engineers to see what a simple pipeline looks like.
The-Sigma-Awards-projects-data
This is the repository of all projects data submitted to The Sigma Awards.
tree-of-thought-prompting
Using Tree-of-Thought Prompting to boost ChatGPT's reasoning