Vasi Rahman's repositories
Breast-Cancer-Diagnostic-Classification
In this project, I experimented with various machine learning algorithms on a dataset from Breast Cancer Wisconsin (Diagnostic) Data available on UCI repository and kaggle. This dataset provides various features along with a target variable of diagnosis. I learned how different machine learning techniques can be applied to find the patterns in the data and classify the tumor as benign or malignant. Through this project, I gained a lot of experience in data wrangling, feature engineering and machine learning algorithms, and compare the results obtained on different ML models based on accuracy and ROC-AUC.
AutoML-data-munging-assignment
In this assignment, I wrote a function in python to manipulate dates and time values in python, and 2nd question is to write a general function to remove variables in a dataset with pearson correlation >=0.85, so as to deal with multicollinearity effectively.
Cancer-Genes-Clustering
In this project, we have a dataset of Cancer genes found in different Cancer cell lines/tissues. I tried to clustered the similar cancer genes using hierarchical and k-means clustering in R
BigMart-Sales-Prediction
We have 2013 sales data for 1559 products across 10 stores in different cities at BigMart. Also, certain attributes of each product and store have been defined. In this project, I built predictive models using different machine learning algorithms and compare them based on RMSE and MAE. Using these models, we find out the sales of each product at a particular store, which will help in understanding the properties of products and stores which play a key role in increasing sales.
Movies-similarity-NLP
In this NLP project, I quantified the similarity of movies based on their plot summaries available on IMDb and Wikipedia, then separated them into groups aka clusters. Finally I have created a dendrogram to represent how closely the movies are related to each other.
awesome-deep-learning-papers
The most cited deep learning papers
awesome-public-datasets
A topic-centric list of HQ open datasets.
awesome-python
A curated list of awesome Python frameworks, libraries, software and resources
coding-interview-university
A complete computer science study plan to become a software engineer.
data-science
:bar_chart: Path to a free self-taught education in Data Science!
data-scientist-roadmap
Toturial coming with "data science roadmap" graphe.
Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
deep_learning_2018-19
Официальный репозиторий курса Deep Learning (2018-2019) от Deep Learning School при ФПМИ МФТИ
ds-cheatsheets
List of Data Science Cheatsheets to rule the world
hello-world
A repo of Vasi's ML resources
manim
Animation engine for explanatory math videos
mathematics_dataset
This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.
NLP-10-basics-programs
In this repo, I have written 10 basic NLP programs for beginners
nlp-recipes
Natural Language Processing Best Practices & Examples
Principles-of-Machine-Learning-R
Principles of Machine Learning R
samples
A collection machine learning experiment that use LabML
StatApps
Small web apps that illustrate statistical concepts