issaoreilly / data_science_solutions

This repository hosts a number of short data science solution with code snippets ready to be used in various data science applications

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data Science Solutions

This repository contains different data science solutions that can be used in different data science projects with different applications.

Solution 1: Computing Rolling Cosine Similarity Over Time Series Data

This solution contains python code to calculate the rolling cosine similarity over time series data. The solution is implemented using Numpy and Numba. The notebook contains detailed explanation to the approach as well as the implementation using Numpy and Numba. This solution isn't new, there exist different approaches, however, I have added the rolling window capability as well as modifying the numpy code so that I'm able to use numba for faster computations. I have evaluated different solutions and showed that using numpy with Numba is the fastest among all. The link for the notebook can be found here Rolling Cosine Similarity

About

This repository hosts a number of short data science solution with code snippets ready to be used in various data science applications


Languages

Language:Jupyter Notebook 100.0%