Maharaj Brahma (maharajbrahma)

maharajbrahma

Geek Repo

Company:@D-OMA @bodonlp

Location:Kokrajhar, India

Home Page:maharajbrahma.github.io

Twitter:@mrajbrahma

Github PK Tool:Github PK Tool


Organizations
bodonlp
stihub-cit

Maharaj Brahma's starred repositories

WhisperCppAndroidDemo

A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.

Language:CLicense:MITStargazers:62Issues:0Issues:0

audio.whisper

Transcribe audio files using the "Whisper" Automatic Speech Recognition model from R

Language:CLicense:NOASSERTIONStargazers:113Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13625Issues:0Issues:0

indonlu

The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:536Issues:0Issues:0

DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Language:C++License:MPL-2.0Stargazers:25155Issues:0Issues:0

kbd-audio

🎤⌨️ Acoustic keyboard eavesdropping

Language:C++License:MITStargazers:8478Issues:0Issues:0

ggwords

Generate language n-gram statistics

Language:C++License:GPL-3.0Stargazers:17Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:CLicense:MITStargazers:34809Issues:0Issues:0

kscp

Kashmiri Speech Corpus Processing

Language:CStargazers:3Issues:0Issues:0

tweetnlp

TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/understand tweets such as sentiment analysis, emoji prediction, and named entity recognition, powered by state-of-the-art language models specialised on Twitter.

Language:PythonLicense:MITStargazers:306Issues:0Issues:0

MorphyNet

MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)

Stargazers:36Issues:0Issues:0

K-MHaS

This repository contains Korean Hate Speech dataset for paper, "K-MHaS: A Multi-label Hate Speech Detection Dataset in Korean Online News Comment", accepted by COLING2022.

Language:Jupyter NotebookStargazers:40Issues:0Issues:0

StrokeNet

The official code for our EMNLP 2022 long paper [Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation with Stroke Sequence Modeling]

Language:PythonStargazers:22Issues:0Issues:0
Language:PythonLicense:MITStargazers:56Issues:0Issues:0

gector

Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)

Language:PythonLicense:Apache-2.0Stargazers:898Issues:0Issues:0

Contrastive-Learning-NLP-Papers

Paper List for Contrastive Learning for Natural Language Processing

Stargazers:536Issues:0Issues:0

python-bpe

Byte Pair Encoding for Python!

Language:PythonLicense:MITStargazers:223Issues:0Issues:0

SelfTraining4UNMT

[ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation

Language:PythonStargazers:31Issues:0Issues:0

bitswap

Bit-Swap: Recursive Bits-Back Coding for Lossless Compression with Hierarchical Latent Variables

Language:PythonStargazers:265Issues:0Issues:0

cog

Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.

Language:C#License:MITStargazers:23Issues:0Issues:0

learning-chess-blindfolded

AAAI 2022 Paper: Bet even Beth Harmon couldn't learn chess like that :)

Language:Jupyter NotebookStargazers:35Issues:0Issues:0

bart_ls

Long-context pretrained encoder-decoder models

Language:PythonLicense:NOASSERTIONStargazers:95Issues:0Issues:0
Language:PythonLicense:MITStargazers:93Issues:0Issues:0
Language:PythonStargazers:186Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:133Issues:0Issues:0

turkish-bert

Turkish BERT/DistilBERT, ELECTRA and ConvBERT models

Language:PythonStargazers:493Issues:0Issues:0
Language:TypeScriptLicense:MITStargazers:1Issues:0Issues:0

self-debiasing

This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".

Language:PythonLicense:Apache-2.0Stargazers:86Issues:0Issues:0

nlg-text-generation

This repository contains Natural Language Generation (NLG) models aimed on generating text of fairy tales (Markov Chain, LSTM neural network, GPT-2 Transformers).

Language:Jupyter NotebookStargazers:55Issues:0Issues:0

algorithm-whiteboard-resources

this is where we share notebooks/projects used in your youtube channel

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:146Issues:0Issues:0