Parameswari Krishnamurthy's repositories
parameshkrishnaa.github.io
My git hub page(resume)
LLM101n
LLM101n: Let's build a Storyteller
Telugu-Morph-lttoolbox
apertium lttoolbox telugu morph object file
CRF-train-test
CRF Sample training and testing
machinetranslate.org
Open information and community for machine translation
sacrebleu
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
COMET
A Neural Framework for MT Evaluation
PhrasalVerbs-Idioms-Eng-Identify
Verb phrase identification
Telugu-Morph-Dataset
Telugu morph dataset with setence id, pos tag and morph features
SaTeMr-MT
Sanskrit to Telugu and Sanskrit to Marathi anusaaraka based MT package.
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
IndicTrans2
Translation models for 22 scheduled languages of India
utilities
Some utiltities
Tokenizer_for_Indian_Languages
Tokenizer For Indian Languages
linkedin-skill-assessments-quizzes
Full reference of LinkedIn answers 2022 for skill assessments (aws-lambda, rest-api, javascript, react, git, html, jquery, mongodb, java, Go, python, machine-learning, power-point) linkedin excel test lösungen, linkedin machine learning test LinkedIn test questions and answers
trankit
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Indian_ParallelCorpus
Curated list of publicly available parallel corpus for Indian Languages
Medium
Scripts for Medium articles
stanza
Official Stanford NLP Python Library for Many Human Languages
UD_Telugu-MTG
Telugu data.
gt-nlp-class
Course materials for Georgia Tech CS 4650 and 7650, "Natural Language"
MLP_Word_Segementation
This a code written for training and evaluating Sandhi Splitter or Word segmentor in Dravidian languages. The dataset provided here is for Malayalam, however any langauge dataset can be used. This system tries to classify each and every character in a word to split point and non split point based on the character contexts.
generated-english-phrasal-verbs
[public][generated-english-phrasal-verbs]
scl_2018
Sanskrit Computational Linguistics Tools
dependency-parser
Neural graph-based dependency parser
thirukkural-dataset
A dataset containing all the verses of Thirukkural in Unicode format