sabyasachidatta

Sabyasachi Datta's repositories

Breast-Cancer-Wisconsin-Dataset

Visualize the data in 2-D scatter plot and write the inferences, Make a boxplot for each feature and highlight the outlier, if any, then remove the outlier, make again box plot to show the outlier effect and write the inferences.

Language:Jupyter Notebook010

DieTanics-Dataset

Perform 1. Data input 2. Basic head view of the data 3. Data description 4. Data summary 5. Univariate analysis

Language:Jupyter Notebook000

Hbos-Technique

Hbos (Histogram based Outlier Score) Technique

Language:Jupyter Notebook000

IBM-Bigdata

000

Linear-Regression-Decision-Tree-Gaussian-Naive-Bayes

Perform Linear Regression on the given dataset. Also perform K-Fold cross-validation

Language:Jupyter Notebook010

Preprocessing-Hidden-Markov-Model

Language:Jupyter Notebook010

Stroke-Prediction-Dataset

You need to download ‘Stroke Prediction Dataset’ data using the library Scikit learn; ref is given below. [5] 2. Divide the data randomly in training and testing with a 7:3 ratio 100 times, perform the following tasks with training data and test the performance on testing data. Testing data should remain unseen for all steps.

Language:Jupyter Notebook000

Wine-Data-Clusterring

1. You need to download “Wine” data from the kaggle Perform at least 5 Clustering methods with varying cluster sizes. Find correct cluster numbers for each method and show with line plot, how you finalized this cluster number.

Language:Jupyter Notebook010