rohan's repositories
CF-movie_recommendation_system
A Movie - recommendation system built using @ MovieLens 100K Dataset , based on collaborative filtering & SVD.
Practical_Machine_Learning_coursera
Human Activity Recognition(HAR) - Case Study : Wearable Computing: Accelerometers’ Data Classification of Body Postures and Movements, using Random forests classifier to quantify and predict best way to do exercise.
DREAM_challenge-COVID-19_EHR
DREAM challenge 2021 - To facilitate understanding risk factors that lead to a positive COVID-19 test utilizing electronic health recorded data mapped to the OMOP Common Data Model. UW Medicine clinical data sets also used. Submission for challenge part Q2.
PB-Dataset-Recommender_Engine
Algorithms for NCBI, SRA, EBI datasets recommendation and how to get around with comparing your own Datasets. Recommender systems | Bio-NLP
Breast_Cancer_benign_or_malignant
Predicting, if a breast cancer is benign or malignant using : UCI breast cancer dataset.
charles_book_club-Case_Study
MIT_open-courseware (P1): This data set represents information associated with individuals who are members of a book club. Using customer data, predict if customers will purchase a book.
compgenomics2021
Using un-assembled genome sequence data from the Centers for Disease Control and Prevention (CDC) proceed through five distinct stages of analysis and interpretation of that data: 1-genome assembly, 2-gene prediction, 3-functional annotation, 4-comparative genomics and 5-production of a predictive webserver.
compgenomics-2021-Team2_WebServer
RApid Subtyping and Prediction for E coli. The web server developed by the Team for analysis of E coli.
Human_Protein_Atlas_challenge-kaggle-2021
Submission of Human Protein Atlas - Single Cell Classification on Kaggle 2021. A weakly supervised multi-label classification problem.
AML-Classification_using_flow_cytometry
The goal of the project is to predict AML or normal status of patients from flow cytometry data (single-cell). The samples were studied with flow cytometry to quantitate the expression of different protein markers. The challenge is to determine the state of health of the other half, based only on the provided flow cytometry data.
Applied_linear_regression_mv
linear regression case study
bioinformatics_courses
A repository for maintaining assignments and projects, from courses done in bioinformatics and CB.
BMED-applied_AI
Biomedical (BMED) - applied health a.i. course | assignments, mid-term and final term projects covering deep learning topics
deep_learning_specialization_7
Deep Learning specialisation coursera courses and projects
gatk
Official code repository for GATK versions 4 and up
grocery-calculator-app
grocery calculator app for reminding item list when low in quantity.
kaggle_projects
Submissions for competitions on Kaggle.
Machine-Learning
ML course on coursera by AndrewNg.
Neural_network
Building from scratch neural networks in python.
Prog_in-Biological_and_Health_Science
Solutions for day-to-day computational problems in biological and health sciences.
single-cell_rna-seq
Algorithms and functions for working with single cell RNA-seq datasets. (in progress)
SNP_calling_workflow
A simple pipeline written in Bash for SNP calling utilizing the infamous samtools.
Time-Series-Forecast-Case_Study
Sales analysis of French champagne. Time series forecast to predict sales figures.
Travel-Reservations-Service
To design and develop a system to handle travel reservations. This system will support a "third party" travel reservations agency. Customers will make flights reservations and hotel\AirBnB bookings for their trips. After confirming the bookings, the agency will be paid electronically by the customer.
vg_snakemake
Snakemake workflow for the vg toolkit.