balakhonoff / kaggle-fraud-detection

Solution of the Kaggle IEEE-CIS Fraud Detection Competition

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

IEEE-CIS Fraud Detection

https://www.kaggle.com/c/ieee-fraud-detection

Team: [ods.ai] No Fosters for Fraudsters

Private results: 76/6381, top 2%, silver medal

Kudos to my teammates:

Model description

Throughout the competition, we created various models which used different data preprocessing techniques, sets of hyperparameters and random seeds. For the final solution, we selected 20 LightGBM models and 5 Catboost models. Using LightGBM, we stacked them all. Finally, we postprocessed the result.

Model scheme:

https://drive.google.com/file/d/1EQWsDyP7V6K2XOeDGuOhdNIidHqrHC35/view?usp=sharing

Links

First-layer models:

Preprocessing:

Task description

Imagine standing at the check-out counter at the grocery store with a long line behind you and the cashier not-so-quietly announces that your card has been declined. In this moment, you probably aren’t thinking about the data science that determined your fate.

Embarrassed, and certain you have the funds to cover everything needed for an epic nacho party for 50 of your closest friends, you try your card again. Same result. As you step aside and allow the cashier to tend to the next customer, you receive a text message from your bank. “Press 1 if you really tried to spend $500 on cheddar cheese.”

While perhaps cumbersome (and often embarrassing) in the moment, this fraud prevention system is actually saving consumers millions of dollars per year. Researchers from the IEEE Computational Intelligence Society (IEEE-CIS) want to improve this figure, while also improving the customer experience. With higher accuracy fraud detection, you can get on with your chips without the hassle.

IEEE-CIS works across a variety of AI and machine learning areas, including deep neural networks, fuzzy systems, evolutionary computation, and swarm intelligence. Today they’re partnering with the world’s leading payment service company, Vesta Corporation, seeking the best solutions for fraud prevention industry, and now you are invited to join the challenge.

In this competition, you’ll benchmark machine learning models on a challenging large-scale dataset. The data comes from Vesta's real-world e-commerce transactions and contains a wide range of features from device type to product features. You also have the opportunity to create new features to improve your results.

If successful, you’ll improve the efficacy of fraudulent transaction alerts for millions of people around the world, helping hundreds of thousands of businesses reduce their fraud loss and increase their revenue. And of course, you will save party people just like you the hassle of false positives.

About

Solution of the Kaggle IEEE-CIS Fraud Detection Competition


Languages

Language:Jupyter Notebook 100.0%