dongkyuk / Kaggle-Foursquare

Simple Silver Medal solution for Kaggle Foursquare Location Matching

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Kaggle-Foursquare

Here is my solution for Kaggle Foursquare Location Matching.

This solution received a Silver Medal without any ensembling or complicated feature engineering.

The pipeline is is simple:

  1. Train xlmroberta with ArcFace Loss
  2. Use the cos sims from the xlmroberta + coordinate distance to extract match candidates
  3. Add features (cos sim, distance, lcs, tfidf, etc…)
  4. Train a lightgbm model (with flaml hyperparameter optimization) to select the correct candidates as a binary classification task.
  5. Do 2-3 on the test data and inference with lgbm

About

Simple Silver Medal solution for Kaggle Foursquare Location Matching


Languages

Language:Python 100.0%