Bruno Vilar's starred repositories
modern-unix
A collection of modern/faster/saner alternatives to common unix commands.
applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
recommenders
Best Practices on Recommendation Systems
awesome-nlp
:book: A curated list of resources dedicated to Natural Language Processing (NLP)
ciencia-da-computacao
🎓 Um caminho para a educação autodidata em Ciência da Computação!
data-engineer-roadmap
Roadmap to becoming a data engineer in 2021
ann-benchmarks
Benchmarks of approximate nearest neighbor libraries in Python
deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
Fast_Sentence_Embeddings
Compute Sentence Embeddings Fast!
lowresource-nlp-bootcamp-2020
The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020
BERT-Relation-Extraction
PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper
labelbox-python
A data-centric AI Platform for Building & Using AI
WordMoversEmbeddings
WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clustering.
mlp-regression-template
Example repo to kickstart integration with mlflow pipelines.
tabular-dl-pretrain-objectives
Revisiting Pretrarining Objectives for Tabular Deep Learning
UD_Portuguese-Bosque
This Universal Dependencies (UD) Portuguese treebank.
phd-thesis
My PhD thesis with all its source files, including all .tex files and images created, as well as the slides of my defense.
portuguese-clinical-pos-tagger
A portuguese clinical POS-Tagger model trained with Flair.