Rennan Cordeiro's starred repositories
crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
xv6-public
xv6 OS
long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
summarize-from-feedback
Code for "Learning to summarize from human feedback"
GoogleSearchCrawler
a tool for crawl Google search results
LongDocSum
Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"
pesquisas-presidenciais-2022
Pesquisas para presidente - 2022
QueCos_code
The code repo for <Enriching Query Semantics for Code Search with Reinforcement Learning>
KorKeyBLD_NLP
Effective Retrieval Model implementing "KeyBLD : Selecting Key Blocks with Pre-ranking for Long Document Information Retrieval" with Korean dataset.
abusive-language
Code for the paper "On the Presence of Abusive Language in Mis/Disinformation", accepted at SocInfo 2022.