Bruno Vilar (brunovilar)

brunovilar

Geek Repo

Location:SP, SP

Github PK Tool:Github PK Tool

Bruno Vilar's starred repositories

modern-unix

A collection of modern/faster/saner alternatives to common unix commands.

applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

recommenders

Best Practices on Recommendation Systems

Language:PythonLicense:MITStargazers:18186Issues:270Issues:838

awesome-nlp

:book: A curated list of resources dedicated to Natural Language Processing (NLP)

ciencia-da-computacao

🎓 Um caminho para a educação autodidata em Ciência da Computação!

data-engineer-roadmap

Roadmap to becoming a data engineer in 2021

ann-benchmarks

Benchmarks of approximate nearest neighbor libraries in Python

Language:PythonLicense:MITStargazers:4668Issues:117Issues:198

deepchecks

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.

Language:PythonLicense:NOASSERTIONStargazers:3428Issues:19Issues:973

checklist

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Language:Jupyter NotebookLicense:MITStargazers:1987Issues:29Issues:89

SIF

sentence embedding by Smooth Inverse Frequency weighting scheme

Language:PythonLicense:MITStargazers:1087Issues:34Issues:42
Language:TypeScriptLicense:Apache-2.0Stargazers:756Issues:9Issues:55

Fast_Sentence_Embeddings

Compute Sentence Embeddings Fast!

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:612Issues:12Issues:55

lowresource-nlp-bootcamp-2020

The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:596Issues:22Issues:1

BERT-Relation-Extraction

PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper

Language:PythonLicense:Apache-2.0Stargazers:560Issues:12Issues:45

demo

JupyterLite demo deployed to GitHub Pages 🚀

Language:Jupyter NotebookStargazers:323Issues:10Issues:35
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:294Issues:15Issues:6

papers

Curated repository of notes from papers I'm reading, mostly NLP related. Updated regularly.

dict2vec

Dict2vec is a framework to learn word embeddings using lexical dictionaries.

Language:PythonLicense:GPL-3.0Stargazers:115Issues:5Issues:4

labelbox-python

A data-centric AI Platform for Building & Using AI

Language:PythonLicense:Apache-2.0Stargazers:94Issues:11Issues:72
Language:PythonLicense:MITStargazers:90Issues:3Issues:12

WordMoversEmbeddings

WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clustering.

Language:CLicense:Apache-2.0Stargazers:81Issues:13Issues:5

mlp-regression-template

Example repo to kickstart integration with mlflow pipelines.

Language:PythonLicense:Apache-2.0Stargazers:75Issues:9Issues:11

tabular-dl-pretrain-objectives

Revisiting Pretrarining Objectives for Tabular Deep Learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:47Issues:4Issues:2

UD_Portuguese-Bosque

This Universal Dependencies (UD) Portuguese treebank.

Language:Common LispLicense:NOASSERTIONStargazers:46Issues:121Issues:357

Euphemism

Self-Supervised Euphemism Detection and Identification for Content Moderation, IEEE S&P (Oakland) 2021

Language:PythonLicense:MITStargazers:29Issues:3Issues:2

SemClinBr

SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks

PyWME

A pure python implementation of the Word Mover‘s Embedding Algorithm

Language:PythonLicense:Apache-2.0Stargazers:6Issues:2Issues:0

phd-thesis

My PhD thesis with all its source files, including all .tex files and images created, as well as the slides of my defense.

Language:TeXStargazers:3Issues:1Issues:0

portuguese-clinical-pos-tagger

A portuguese clinical POS-Tagger model trained with Flair.

Stargazers:2Issues:0Issues:0