N Singh's repositories
AnalyticsVidya
Bigmart Sales Forecast
awesome-datascience-ideas
A list of awesome and proven data science use cases and applications
bad-data-guide
An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.
Bayesian-Spam-Filter
An implementation of a Spam Filter in Python that uses the Naive Bayes Model to classify mails as spam or ham.
Classifiers_Evaluation
EvaluationParameters
Coursera-Machine-Learning
Coursera Machine Learning - Python code
Data-Analysis-and-Machine-Learning-Projects
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
data-science-ipython-notebooks
Continually updated data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
DataSciencePython
common data analysis and machine learning tasks using python
DSE210_Probability_Statistics_Python
Probability and Statistics Using Python Data Science Masters Course at UCSD (DSE 210)
ExData_Plotting1
Plotting Assignment 1 for Exploratory Data Analysis
Exploratory-Data-Analysis-and-Prediction-on-Diabetes-Dataset-using-R
This project first conducts Exploratory Data Analysis (EDA) and data visualization on the diabetes dataset and then predict the disbetes using machine learning.
hdbscan
A high performance implementation of HDBSCAN clustering.
HouseSalesprice
House_prices_prediction version1
hypothesis-python
Advanced property-based (QuickCheck-like) testing for Python
janitor
simple tools for data cleaning in R
jupyter
Jupyter metapackage for installation, docs and chat
kaggle-for-fun
All my submissions for Kaggle contests that I have been, and going to be participating.
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
pandas-cookbook
Recipes for using Python's pandas library
ProgrammingAssignment2
Repository for Programming Assignment 2 for R Programming on Coursera
pydata-sf-2016-arima-tutorial
PyData San Francisco 2016 - ARIMA Tutorial
python-machine-learning-book
The "Python Machine Learning" book code repository and info resource
PythonDataScienceHandbook
Jupyter Notebooks for the Python Data Science Handbook
TimeSeriesAnalysiswithPython
Time Series Analysis with Python
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow