Department of Linguistics, K.M. Institute of Hindi and Linguistics's repositories
vardial2018
This repository contains the dataset used for Indo-Aryan Language identitifcation Shared Task as part of the Evaluation Campaign in the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial) at COLING 2018. It has 15k sentences each in Awadhi, Bhojpuri, Braj, Magahi and Hindi
indianlr.github.io
A repository for listing the non-scheduled and endangered Indian language resources and technologies. The website could be accessed here
kmi-linguistics.github.io
Research and Development at the Department of Linguistics in K.M. Institute of Hindi and Linguistics at Dr. Bhim Rao Ambedkar University, Agra
propaganda
Repository of the data and models generated by Mr. Shyam Ratan as part of his MPhil dissrtation titled 'Automatic Detection Of Propaganda In Hindi On Social Media'
sigtyp2020
This repository contains code and details of the KMI-Panlingua-IITKGP system submitted to the SigTyp 2020 Shared Task on Prediction of Linguistic Features. It could be used for training and prediction on any new dataset in the same format with similar information.
speech-aggression
Repository of data and scripts of UGC-UKIERI Project on "Automatic Detection of Verbal Threat in HIndi and English Aggressive Speech"
text-aggression
This is the repository of the aggression project carried out as part of the The Aggression Project at the Microsoft Research India Summer Workshop on Artificial Social Intelligence in June 2017. The repository contains all codes and datasets generated during the school.
trac-2
Repository hosting dataset for the Shared Task on Aggression and Misogyny Identification during Second Workshop on Trolling, Aggression and Cyberbullying (TRAC - 2) as LREC-2020. Please visit the workshop website - https://sites.google.com/view/trac2/shared-task - for more details
western-hindi
Repository for all data and resources on Western Hindi that is being developed at the Institute. Currently, it contains all the data generated as part of the M.Phil. dissertation of Ms. Saba Parween.