There are 0 repository under chi-square-test topic.
Machine learning for beginner(Data Science enthusiast)
Case Studies and Projects in Machine Learning/EDA/DL
This is an optional model development project on a real dataset related to predicting the different progressive levels of Alzheimer’s disease (AD).
A 30+ node flowchart for selecting the right statistical test for evaluating experimental data.
This is an optional model development project on a real dataset related to predicting the different progressive levels of Alzheimer’s disease (AD) with MRI data.
A friendly, automated chi-square test function which takes care of post-hoc tests and multiple comparisons.
Chi-Squared For Feature Selection using SelectKBest
about statistical techniques for Data Science
"A set of Jupyter Notebooks on feature selection methods in Python for machine learning. It covers techniques like constant feature removal, correlation analysis, information gain, chi-square testing, univariate selection, and feature importance, with datasets included for practical application.
The code build a correlation like heat but using chi-square test for catagorical variables. Python and R have built in libraries for producing heatmap for correlation test but there is no such library to produce heat map for chi-square test of association.
Wesleyan University
Shiny interface for growth model fit
Adapted chi-squared and CMH test to evolve and resequenced data. Includes drift and pool sequencing variance in the tests
Examples of using the test statistics to test hypotheses
Lean Six Sigma with Python — Chi-Squared Test
Polycystic Ovary Syndrome (PCOS) is a widespread pathology that affects many aspects of women's health, with long-term consequences beyond the reproductive age. The wide variety of clinical referrals, as well as the lack of internationally accepted diagnostic procedures, have had a significant impact on making it difficult to determine the exact etiology of the disease. The exact histology of PCOS is not yet clear. It is therefore a multifaceted study, which shares genetic and environmental factors. The aim of this project is to analyse simple factors (height, weight, lifestyle changes, etc.) and complex (imbalances of bio hormones and chemicals such as insulin, vitamin D, etc.) factors that contribute to the development of the disease. The data we used for our project was published in Kaggle, written by Prasoon Kottarathil, called Polycystic ovary syndrome (PCOS) in 2020. This database contains records of 543 PCOS patients tested on the basis of 40 parameters. For this, we have used Machine Learning techniques such as Logistic Regression, Decision Trees, SVMs, Random Forests, etc, A detailed analysis of all the items made using graphs and programs and prediction using Machine Learning Models helped us to identify the most important indicators for the same.
All type of calculators like Cuboid (4D), Binning, Chi-square test, Red-black tree, Binary search tree, Longest Common Sub Sequence, Master Theorm, Heap Sort, Decision Theory at one place ✨
[Survival Analytics] Working with Dr. Surbhi Grover MD, MPH evaluating survival outcomes, treatment, and survivorship for a cervical cancer cohort with or without HIV co-infection in Botswana, Africa using methods rooted in statistics.
a simple shiny app that perform statistical tests such as normal test, t-test, and chi-square test
Notes on statistical inference made for learning statistics for data scientists
Here I did an analysis of a survey taken on cell based meat and processed it using R to find correlations and make some assumptions on the data.
Customer churn analysis using EDA and confirmed by Chi-square hypothesis testing.
Implementing Unif(0,1) Pseudo Random Generators & performing Statistical Tests on them
More practical differentially private publication of key statistics in GWAS
This project analyzes a Portuguese bank's term deposit marketing campaigns, uncovering key factors and client profiles to optimize marketing strategies and enhance the bank's lending pool and revenues.
HHA507 / Data Science / Assignment 5 / Inferential Statistics
This repository contains statistical analyses conducted on various datasets related to medical and health factors. The analyses include Spearman correlation, Chi-square test, and Linear Regression to explore relationships and predictive models related to heart attacks and other medical conditions.
chi-square test within the Montana Library case study.
This project focuses on predicting whether a customer will default on their credit card payment in the upcoming month. Utilizing historical transaction data and customer demographics, the project employs various machine learning algorithms to distinguish between risky and non-risky customers for better credit risk management.
Stats HW chat bot that solves chi-square problems involving goodness of fit, homogeneity, or independence
Hypothesis-Testing-Chi2-Test-Athletes-and-Smokers. Assume Null Hypothesis as Ho: Independence of categorical variables (Athlete and Smoking not related). Thus Alternate Hypothesis as Ha: Dependence of categorical variables (Athlete and Smoking is somewhat/significantly related). As (p_value = 0.00038) < (α = 0.05); Reject Null Hypothesis i.e. Dependence among categorical variables Thus Athlete and Smoking is somewhat/significantly related.
Embark on a journey of data-driven insights with our diabetes research project. Leveraging Python's pandas, matplotlib, and scikit-learn, we preprocess, visualize, and analyze 330 health features. Employing logistic regression, decision trees, KNN, and SVM, we predict diabetes with precision.
The project aims to build a Species Distribution Model for the frog species - "Litoria Fallax" across Australia using TerraClimate variables.
This repository is a collection of basic code templates for A/B Testing. All codes I am sharing are from the practical exercises I did from the Data Science Infinity Program.