mbert

There are 0 repository under mbert topic.

csebuetnlp / banglabert
This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics: NAACL-2022.
bangla-nlp bangla-language-processing bangla-natural-language-processing sentiment-classification document-classification emotion-classification named-entity-recognition natural-language-inference textual-entailment bert bert-fine-tuning xlm-roberta mbert bengali-nlp bengali-language-processing bengali-natural-language-processing banglabert multilingual-models
Language:Python 230
cambridgeltl / ContrastiveBLI
Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.
bilingual-lexicon-induction word-translation contrastive-learning self-learning cross-lingual-word-embeddings mbert pytorch word-alignment cross-lingual-embeddings bilingual-lexicon-extraction bilingual-word-embedding word-embeddings fasttext-embeddings bilingual-dictionary-induction cross-lingual-word-embedding low-resource-machine-translation information-retrieval machine-translation
Language:Python 32
lirondos / lazaro
An observatory of anglicism usage in the Spanish press
anglicisms bilstm-crf borrowings corpus crf-model linguistics mbert spanish spanish-newswire
Language:Python 10
ishan00 / meta-learning-for-multi-task-multilingual
Official Repository for the paper titled "Meta-Learning for Effective Multi-task and Multilingual Modelling" accepted at EACL 2021
meta-learning reptile multi-task-learning transformers multilingual-models mbert question-answering natural-language-inference part-of-speech-tagging named-entity-recognition paraphrase-identification
Language:Python 8
negar-foroutan / multiLMs-lang-neutral-subnets
[EMNLP 2022] Discovering Language-neutral Sub-networks in Multilingual Language Models.
lottery-ticket-hypothesis multilingual-language-models multilingual-nlp mbert mt5 cross-lingual-transfer
Language:Python 8
Mukaffi28 / Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset
A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language
banglat5 machine-translation mbert mt5 neural-machine-translation bangla-bert-base regional-dialects
Language:Jupyter Notebook 7
fatemafaria142 / MultiBanFakeDetect-An-Extensive-Benchmark-Dataset-for-Multimodal-Bangla-Fake-News-Detection
This study introduces MultiBanFakeDetect, a novel multimodal dataset for Bangla fake news detection, combining textual and visual information. It features TextFakeNet for text analysis and MultiFusionFake for integrating multimodal data.
early-fusion fake-news-detection late-fusion mbert multimodal-dataset resnet-101 under-resourced-language xlm-roberta densenet-169 fusion-techniques intermediate-fusion benchmark dataset
Language:Jupyter Notebook 4
juletx / multilingual-question-answering
Zero-shot and Translation Experiments on XQuAD, MLQA and TyDiQA
bert mbert multilingual-bert squad xquad zero-shot translate-train mlqa question-answering roberta machine-translation translation tydiqa translate-test xlm-r
Language:Jupyter Notebook 4
BassaniRiccardo / ICEBERT
ICEBERT: Interlingual-Clusters Enhanced BERT. A BERT-like model trained on clusters of monolingual subwords.
clustering mbert subword-segmentation tokenization
Language:Python 3
fatemafaria142 / Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI
This research examines Large Language Models in Bengali Natural Language Inference, comparing them with state-of-the-art models using the XNLI dataset.
bengali large-language-models low-resource-languages natural-language-inference pretrained-language-models banglabert distilbert mbert
Language:Jupyter Notebook 3
DiFronzo / Multilingual-Models
mBERT and XLM-R for encodeing of Scandinavian languages
language mbert multilingual python python3 pytorch transformers xlm-r xlm-roberta scandinavian
Language:Python 2
elsheikh21 / cross-natural-language-inference
ZeroShot XNLI
torch xnli mbert xlm-roberta xlm transformers
Language:Python 2
AditiBagora / Hasoc2021CodeMix
HASOC2021: Subtask 2 a) Codemix Challenge; Contains baselines and hierarchical approach that extracts the relevant context useful for classification of hostile tweets on English-Hindi code-mix data obtained from twitter.
transformers mlp fine-tuning mbert xlm-roberta nlp-machine-learning torch tensorflow feature-extraction
Language:Jupyter Notebook 1
Elijas / lithuanian-text-summarization-model
Deployed model which can summarize Lithuanian language text by leveraging Artificial Neural Networks, Transformers, mBERT.
ann bert language-model mbert nlp pytorch streamlit summarization
Language:Python 1
michaelpeterhoffmann / masterthesis
Multilingual hate speech detection for German, Italian and Spanish Social Media Posts #machine learning #classifier
bert mbert svm-classifier transfer-learning transformer xlmroberta
Language:Jupyter Notebook 1
peterzee-tsien / LING484-COMP599-Final-Projects
By using the hypothesis of historical linguistics, we found a way to improve the performance of multilingual transformers with limited amount of data
fine-tuning multilingual-bert ner pos-tagger swahili wolof yoruba mbert
Language:Jupyter Notebook 1
Koharu24 / mBERT-Unaligned-fine-tuning-for-a-cross-lingual-RD-of-untranslatable-terms
This is a project proposal to implement Yan et al.'s (2020) mBERT-Unaligned for cross-lingual RDs with Japanese, German and Italian untranslatable terms
cross-linguistic-data mbert nlp-machine-learning reverse-dictionary unaligned
0
Koharu24 / mBERT_crosslingual_rd
This is a project proposal to implement Yan et al.'s (2020) mBERT-Unaligned for cross-lingual RDs with Japanese, German and Italian untranslatable terms
cross-lingual-transfer mbert reverse-dictionary untranslatability
Language:Python 0
mbruton0426 / GalicianSRL
Collection of scripts used to create SRL datasets for Galician and Spanish using a verbal indexing method, as well as fine-tuned BERT and XLM-R models for SRL on each language
mbert semantic-role-labeling xlm-r galician semantic-parsing spanish srl-parser
Language:Python 0
MusfiqDehan / Multilingual-Sentence-Alignments-Demo
Align Parallel Sentence of 104 Languages with the help of mBERT and LaBSE
labse mbert multilingual-alignment multilingual-bert
Language:Python 0
NasserMohamedEid / Text-AI-Detection
arabert bert llm mbert mt5 nlp roberta-model streamlit
Language:Jupyter Notebook 0
Revanth-Reddy-Pingala / Abusive_Comment_Detector_BERT
Fine tuned BERT, mBERT and XLMRoBERTa for Abusive Comments Detection in Telugu, Code-Mixed Telugu and Telugu-English.
abusive-comment-detection bert-fine-tuning mbert natural-language-processing nlp pretrained-language-models xlm-roberta abusive-comment-detector
Language:Jupyter Notebook 0
RobinSmits / GPT-3.5-FineTuning
GPT 3.5 FineTuning
dutch-language fine-tuning gpt-35-turbo large-language-models mbert mdistilbert prompt-engineering transformers openai-api deberta-v3 gpt-3-5-turbo
Language:Jupyter Notebook 0
ShafakatArnob / Bengali-Misogyny-Identification-Deep-Learning-LIME
Bengali Misogyny Identification with Deep Learning and LIME.
accuracy banglabert bengali-nlp bert crosslingual deep-learning deep-neural-networks explainable-ai f1-score fine-tuning lime mbert misogyny-detection multilingual bengali-misogyny sexism-detection bengali-sexism
Language:Jupyter Notebook 0
fatemafaria142 / BanglaCalamityMMD-A-Comprehensive-Benchmark-Dataset-for-Multimodal-Disaster-Identification
This study presents a novel multimodal fusion technique for disaster identification in Bangla, combining text and image data using the "BanglaCalamityMMD" dataset. Employing DisasterTextNet, DisasterImageNet, and DisasterMultFusionNet, the approach addresses a key gap in Bangla disaster research.
bangla-dataset benchmark disaster-identification mbert multimodal-fusion swin-transformer swintransformer vision-transformer xlm-roberta banglacalamitymmd benchmarking dataset
Language:Jupyter Notebook
fatemafaria142 / Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset
This study addresses the gap in translating Bangla regional dialects into standard Bangla by creating a large-scale multilingual benchmark dataset of 32,500 sentences in Bangla, Banglish, and English, representing five regional Bangla dialects such as Sylheti, Chittagong, Mymensingh, Noakhali, and Barishal.
bangla-bert-base banglat5 benchmark bleu-score dataset mbert mt5 region-detection word-error-rate bangla-regional-dialects dialect-translation standard-bangla
Language:Jupyter Notebook
honghanhh / definition_extraction
Slovenian Definition Extraction
language-models rule-based-classifier binary-classifier mbert slovenian transformers xlmr mdistilbert python pytorch
Language:Python

mbert

csebuetnlp / banglabert

cambridgeltl / ContrastiveBLI

lirondos / lazaro

ishan00 / meta-learning-for-multi-task-multilingual

negar-foroutan / multiLMs-lang-neutral-subnets

Mukaffi28 / Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset

fatemafaria142 / MultiBanFakeDetect-An-Extensive-Benchmark-Dataset-for-Multimodal-Bangla-Fake-News-Detection

juletx / multilingual-question-answering

BassaniRiccardo / ICEBERT

fatemafaria142 / Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI

DiFronzo / Multilingual-Models

elsheikh21 / cross-natural-language-inference

AditiBagora / Hasoc2021CodeMix

Elijas / lithuanian-text-summarization-model

michaelpeterhoffmann / masterthesis

peterzee-tsien / LING484-COMP599-Final-Projects

Koharu24 / mBERT-Unaligned-fine-tuning-for-a-cross-lingual-RD-of-untranslatable-terms

Koharu24 / mBERT_crosslingual_rd

mbruton0426 / GalicianSRL

MusfiqDehan / Multilingual-Sentence-Alignments-Demo

NasserMohamedEid / Text-AI-Detection

Revanth-Reddy-Pingala / Abusive_Comment_Detector_BERT

RobinSmits / GPT-3.5-FineTuning

ShafakatArnob / Bengali-Misogyny-Identification-Deep-Learning-LIME

fatemafaria142 / BanglaCalamityMMD-A-Comprehensive-Benchmark-Dataset-for-Multimodal-Disaster-Identification

fatemafaria142 / Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset

honghanhh / definition_extraction