Kateryna Drogaieva's repositories
Boto3-Demo
Examples of dynamic creation and use of VPC, EC2, Load Balancer, Auto Scaling group, Launch Configuration, Redshift cluster, S3, SQS, SNS
TweetsAutorshipAttributionModelsEvaluation
In this notebook I work on the question whether the author of a tweet (very short text) can be successfully identified. I try to choose the best classification method its parameters set and features
Auto-Insurance-Risk-Classification-and-Claim-Prediction
XGB model and feature importance to predict At Fault Auto Claims
Mini-ETL-Tool
Mini ETL Tool is a Python module. It allows to run SQL and CLI commands in parallel or sequential mode, set up preconditions, dependencies and notifications
2016-US-President-Election-Primary-Results-Analysis
Correlation analysis between candidates and county facts based on 2016 US President Election Primary Results by county
AWS-Sage-Maker-Machine-Learning-Experiments-Automation
Machine Learning experiments automation with the help of AWS Sage Maker using XGBoost Classification and Insurance Property data
aws_data_pipeline_samples
Few AWS Data Pipeline samples to demo export from MS SQL to a file in S3 bucket, load a DynamoDB table to Redshift, multiple dependencies in the flow
BartScraper
The application collects real time train departures from Bart API
Data-Feeds
Advanced SQL
data-pipeline-samples
This repository hosts sample pipelines
eva.ru
What Russian women talk about - Natural Language Processing (NLP) research of Russian women eva.ru forum
Insurance-Data-Pipelines
Pentaho Data Integration ETL and Matillion ELT
Insurance-Data-Warehouse
Data Warehouse Modeling
KaterynaD
Config files for my GitHub profile.
KaterynaD.github.io
Personal site
TechcrunchPostsMulticlassPostsClassification
In this notebook I search the best classifier and its parameters for posts multi-class classifications based on authorship attributes
TweetsListener
Collects tweets and performs sentiment analysis based on emoticons and NLP (TextBlob)