Ekta Grover's repositories
Experiments-in-Data-mining
So how much is your Linkedin network worth ? Exploring useful data from Linkedin
Custom-Distance-function-for-typos-in-hand-generated-datasets-with-QWERY-Keyboard
Building a custom distance function for typographical errors - particularly with the QWERTY keyboard
bidding_ad_optimization_module
Module to predict prob. of a browser's conversion based on PVSVR & CPA
Creating-Custom-job-feeds-for-Linkedin
Building relevant job feeds based on TF-IDF(custom written on uni-grams) , lemmatization and other NLTK constructs
Building-Custom-lemmatizer
Building a custom Lemmatizer to plug in to the TF-IDF and other Information Retrieval problems
Scraping-with-scrapy-in-python
A baby problem in scraping in python
MapReduce-with-PySpark
Repo to host code for scaling up native python code with Map reduce in Python
redshift-udfs
SQL for many helpful Redshift UDFs, and the scripts for generating and testing those UDFs
Sampling-techniques
Initially developed for Kaggle's Expedia contest
scrapy-linkedin
Using Scrapy to get Linkedin's person public profile.
AB-testing-Framework
Performance Benchmarks for min sample size & tests of acceptance on OEC (overall evaluation criteria)
Data-mining-Pro
Creating scripts to automate all the near-boring part that comes between data preparation & start of modelling
Kaggle_yandex
Initially written for Yandex competition hosted at kaggle
mincemeatpy
Lightweight MapReduce in python
multipolyfit
A multivariate polynomial regression function in python
play_store_scraper
Scraps the play store for app information.