Breno Matos (brenomatos)

brenomatos

Geek Repo

Location:Belo Horizonte - Brazil

Home Page:brenomatos.github.io

Github PK Tool:Github PK Tool


Organizations
LatinUFMG
MPMG-DCC-UFMG

Breno Matos's starred repositories

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonLicense:UnlicenseStargazers:83026Issues:498Issues:7740

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:67837Issues:570Issues:0

LLM101n

LLM101n: Let's build a Storyteller

newspaper

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

Language:PythonLicense:MITStargazers:14076Issues:387Issues:675

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:11640Issues:93Issues:331

al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

Language:HTMLLicense:MITStargazers:10598Issues:25Issues:548

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Language:PythonLicense:Apache-2.0Stargazers:6798Issues:72Issues:123

Surprise

A Python scikit for building and analyzing recommender systems

Language:PythonLicense:BSD-3-ClauseStargazers:6362Issues:145Issues:383

handcalcs

Python library for converting Python calculations into rendered latex.

Language:CSSLicense:Apache-2.0Stargazers:5572Issues:88Issues:185

whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4360Issues:43Issues:178

scikit-llm

Seamlessly integrate LLMs into scikit-learn.

Language:PythonLicense:MITStargazers:3258Issues:35Issues:56

detoxify

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.

Language:PythonLicense:Apache-2.0Stargazers:929Issues:15Issues:63

nlp-phd-global-equality

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

soccerdata

⛏⚽ Scrape soccer data from Club Elo, ESPN, FBref, FiveThirtyEight, Football-Data.co.uk, FotMob, Sofascore, SoFIFA, Understat and WhoScored.

Language:PythonLicense:NOASSERTIONStargazers:587Issues:14Issues:153

cabrita

Finetuning InstructLLaMA with portuguese data

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:554Issues:10Issues:14
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:324Issues:29Issues:12

innertube

Python Client for Google's Private InnerTube API. Works with YouTube, YouTube Music and more!

Language:PythonLicense:MITStargazers:280Issues:7Issues:33

maritalk-api

Code and documentation for the MariTalk API

Language:PythonLicense:MITStargazers:246Issues:16Issues:18

HateXplain

Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.

Language:PythonLicense:MITStargazers:187Issues:7Issues:18

exploring-T5

A repo to explore different NLP tasks which can be solved using T5

Language:Jupyter NotebookStargazers:168Issues:3Issues:15

charformer-pytorch

Implementation of the GBST block from the Charformer paper, in Pytorch

Language:PythonLicense:MITStargazers:117Issues:5Issues:7

UnknownPleasures

Python script that makes the Unknown Pleasures album art

Language:PythonStargazers:95Issues:4Issues:0

ppgccufmg

Uma classe LaTeX para dissertações, teses e propostas do Programa de Pós-Graduação em Ciência da Computação (PPGCC) da Universidade Federal de Minas Gerais (UFMG).

Language:TeXLicense:MITStargazers:50Issues:2Issues:2

Tutorial-Resources

Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021

Language:PythonLicense:MITStargazers:36Issues:5Issues:1

gap-text2sql

GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Language:PythonLicense:Apache-2.0Stargazers:26Issues:1Issues:0

That-is-a-Known-Lie

Main GIt repo for the ACL paper "That is a Known Lie: Detecting Previously Fact-Checked Claims"

C01

Coleta de Dados Públicos

Language:PythonLicense:GPL-3.0Stargazers:18Issues:5Issues:11255
Language:Jupyter NotebookStargazers:13Issues:13Issues:0

forkkit

Web crawler to mine album review scores and metadata from pitchfork.com

Language:PythonLicense:MITStargazers:5Issues:0Issues:10