Lazarus NLP's repositories
indonesian-sentence-embeddings
Embedding Representation for Indonesian Sentences!
machine-translation
Many-to-Many Multilingual Translation Model for Languages of Indonesia
ConGen
Implementation of ConGen: Unsupervised Control and Generalization Distillation For Sentence Representation (Finding of EMNLP 2022).
EasyDeL
EasyDeL is an OpenSource Library to make your training faster and more Optimized With cool Options for training and serving Both in Python And Mojoš„
indobenchmark-toolkit
Toolkit for Indobenchmark
lazarusnlp.github.io
Lazarus NLP is a collective initiative to revive the dying languages of Indonesia through speech and language technology.
minilmv2.bb
Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)
nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
FJFormer
Embark on a journey of paralleled/unparalleled computational prowess with FJFormer - an arsenal of custom Jax Flax Functions and Utils that elevate your AI endeavors to new heights!
mteb
MTEB: Massive Text Embedding Benchmark