anirudhs123 / Indic-Swipe

IndicSwipe is a collection of datasets and neural model architectures for decoding swipe gesture inputs on touch-based Indic language keyboards across 7 languages.

Home Page:https://iitmnlp.github.io/indic-swipe

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Project website: https://iitmnlp.github.io/indic-swipe

IndicSwipe

IndicSwipe is aimed at developing a keyboard that supports gesture typing in Indic languages on mobile devices. IndicSwipe provides a novel Deep Learning architecture that jointly uses Transformers and LSTMs to accurately decode noisy swipe inputs and has been tested on 7 languages. To further research in this field, we release two datasets that are generated by simulations that model human motor control using the principles of jerk minimization. For detailed information regarding this work, please visit our official website.

The models and datasets have been developed to cater to two closely related tasks:

  • Indic-to-Indic Decoding: To support users who prefer to type in the native Indic script (Devanagari, Bengali, etc.)
  • English-to-Indic Decoding: To support users who prefer to type using an English script keyboard but want the output in the native script.

IndicSwipe demonstrates high decoding accuracies on both tasks varying from 70% to 95% across the 7 languages.

Key Contributions

  1. A Gesture Path Decoding model that uses a multi-headed Transformer along with LSTM layers for coordinate sequence encoding and a character-level LSTM model for character sequence decoding.
  2. A Contrastive Transliteration correction model that uses position-aware character embeddings to measure word proximities and correct spellings of transliterated words.
  3. Two datasets of simulated word traces for supporting work on gesture typing for Indic language keyboards including low resource languages like Telugu and Kannada.
  4. The accuracies of the proposed models vary from 70 to 89% for English-to-Indic decoding and 86-95% for Indic-to-Indic decoding across the 7 languages used for the study.

Link to paper: https://www.aclweb.org/anthology/2020.coling-main.87

People

This work has been developed by Anirudh Sriram, Emil Biju, Prof. Mitesh Khapra and Prof. Pratyush Kumar from the Indian Institute of Technology, Madras. Ask us your questions at anirudhsriram30799@gmail.com or emilbiju7@gmail.com.

About

IndicSwipe is a collection of datasets and neural model architectures for decoding swipe gesture inputs on touch-based Indic language keyboards across 7 languages.

https://iitmnlp.github.io/indic-swipe


Languages

Language:Python 100.0%