There are 2 repositories under correlation-analysis topic.
:link: Methods for Correlation Analysis
NFC signal and protocol analyzer using SDR receiver
:bar_chart: 数据挖掘常用算法:关联分析Apriori算法,数据分类决策树算法,数据聚类K-means算法
A detialed analysis on the customers, products, orders and shipments of the Brazilian E-commerce giant Olist.
correlationMatrix is a Python powered library for the statistical analysis and visualization of correlations
Assignment-04-Simple-Linear-Regression-2. Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.
Open-source statistical package in Python based on Pandas
Twitter is an online social networking service with over 300 million monthly active users. This enormous amount of data available on social media platforms can be extracted and analyzed for various purposes. In this paper, we aim to investigate the relationship between sentiment analysis of Twitter data and stock market prices for five companies (Walmart, ExxonMobil, Apple, Berkshire Hathaway Inc., and Amazon) by scraping the Tweets extracted from Twitter based on company hashtags and using the twitter intelligence tool – twint. Sentiment analysis is applied to the extracted tweets and a correlation is analyzed between stock market movements of a company and sentiments in tweets. Elaborately, news and tweets in social media about a company would encourage decision of people to invest or not in the stocks of that company and as a result, the stock price of that company would increase or fall. At the end of the paper, it is shown that a none or very weak correlation exists between the rise and fall in stock prices with the public sentiments in tweets
St. Nicolas House Algorithm implementation in R - predicting correlation networks using association chains
Assignment-11-Text-Mining-01-Elon-Musk, Perform sentimental analysis on the Elon-musk tweets (Exlon-musk.csv), Text Preprocessing: remove both the leading and the trailing characters, removes empty strings, because they are considered in Python as False, Joining the list into one string/text, Remove Twitter username handles from a given twitter text. (Removes @usernames), Again Joining the list into one string/text, Remove Punctuation, Remove https or url within text, Converting into Text Tokens, Tokenization, Remove Stopwords, Normalize the data, Stemming (Optional), Lemmatization, Feature Extraction, Using BoW CountVectorizer, CountVectorizer with N-grams (Bigrams & Trigrams), TF-IDF Vectorizer, Generate Word Cloud, Named Entity Recognition (NER), Emotion Mining - Sentiment Analysis.
Protecting portfolios using Correlation Diversification
Scalable code to solve SUMCOR Generalized CCA problem with missing views.
Computational protemics analysis of cancer cell-lines at the level of single-cells
Capstone project for The Data Incubator ('18). Plots SCOTUS vs. public opinion polarity over time given keywords.
Blazing fast Gene/GEM Correlation Analysis for Rust and Python
Exploring Google Play Store apps dataset to identify key factors for app engagement and success, revealing correlations between reviews, installs, categories, ratings, and user preferences.
This repo is an attempt to diagnose Parkinson's disease using voice measurements of patients using machine learning algorithms.
A code tutorial to accompany https://link.springer.com/article/10.3758/s13428-023-02098-1
In this repository, four famous correlation algorithms have been implemented. Pearson, spearman, Chatterjee, and MIC correlation algorithm implemented
A code that populates dark matter halos and computes correlation functions
Supervised Machine Learning project with KNN, decision tree, random forest and adaboost algorithms
Assignment-04-Simple-Linear-Regression-1. Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regression.
Multiple-Linear-Regression-1. Consider only the below columns and prepare a prediction model for predicting Price of Toyota Corolla.
Assignment-05-Multiple-Linear-Regression-2. Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in the past few years State -- states from which data is collected Profit -- profit of each state in the past few years.
EDA (Exploratory Data Analysis) -1: Loading the Datasets, Data type conversions,Removing duplicate entries, Dropping the column, Renaming the column, Outlier Detection, Missing Values and Imputation (Numerical and Categorical), Scatter plot and Correlation analysis, Transformations, Automatic EDA Methods (Pandas Profiling and Sweetviz).
Supervised-ML---Simple-Linear-Regression---Newspaper-data. EDA and Visualization, Correlation Analysis, Model Building, Model Testing, Model predictions.
Supervised-ML---Multiple-Linear-Regression---Toyota-Cars. EDA, Correlation Analysis, Model Building, Model Testing, Model Validation Techniques, Collinearity Problem Check, Residual Analysis, Model Deletion Diagnostics (checking Outliers or Influencers) Two Techniques : 1. Cook's Distance & 2. Leverage value, Improving the Model, Model - Re-build, Re-check and Re-improve - 2, Model - Re-build, Re-check and Re-improve - 3, Final Model, Model Predictions.
Time-Series Modeling of Bitcoin for Equities, Commodities & Forex Markets
This project demonstrates appropriate use of statistical techniques to analyse data, and extract meaningful insights.
Predicting house price
Codebase: Novel scaling law governing stock price dynamics
Instead of making comparisons between two Excel documents reading line-by-line, this program utilizes data manipulation techniques to make correlations between Excel documents and creates a new Excel document based on these correlations.
The project deals with determining and predicting the type of accident taking place in the city of Austin. The data would help in understanding what possible factors are leading to the accidents based on the severity of the incident that has occurred.