There are 1 repository under code-mixing topic.
A curated list of research papers and resources on code-switching
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanagari script.
Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)
Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.
Repository containing Abusive Tweet Detection, Location Detection and Gender Detection codes
This repo contains the source code of HIT: A Hierarchically Fused Deep Attention Network for RobustCode-mixed Language Representation (Accepted in ACL 2021)
Jopara (Guarani-dominant mixed with Spanish) sentiment analysis corpus
Indonesian-English code-mixed Twitter dataset
A word level Language Identification (LID) tool for Tagalog-English (Taglish) text.
Psycholinguistic Analysis of Code Mixing - Speech and Natural Language Processing Term Project: CS60057. Department of Computer science and Engineering, Indian Institute of Technology Kharagpur
A language detection model for code-switched texts in es/en/zh
Handling Bahasa Rojak (Malaysian Code Mixing Language) OOV and performing Sentiment Analysis using downstreamed XLM-R
Tweet ids for code-mixed Russian-German and Russian-Hebrew tweets
This is a depression detection system that detects depression in Sinhala-English code-mixed text content which are published by different users on social media. The frontend of the system was developed using Bootstrap, HTML, and Jquery and the backend of the system was developed using Flask
The official code for the "True Bilingual NMT" paper
300-Person-Mandarin-Chinese-and-English-Bilingual-Spontaneous-Monologue-smartphone
A Centralized Frenglish Benchmark from Naturally Occurring Code-Switching and Code-Mixing
This is a machine learning project focused on analysing and classifying sentiments in code-switched and code-mixed text, specifically targeting the unique linguistic characteristics found in Malaysian conversations.