JR Oakes 's starred repositories
blackhat-python3
Source code for the book "Black Hat Python" by Justin Seitz. The code has been fully converted to Python 3, reformatted to comply with PEP8 standards and refactored to eliminate dependency issues involving the implementation of deprecated libraries.
TransformerSum
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
finetune-gpt2xl
Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed
Clustering-with-LLM
A customer segmentation project can be approached in multiple ways. In this repository, we will explore advanced techniques for defining clusters and analyzing the results.
transformer-lm
Transformer language model (GPT-2) with sentencepiece tokenizer
tech-seo-crawler
Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index.
HMNet-End-to-End-Abstractive-Summarization-for-Meetings
"End-to-End Abstractive Summarization for Meetings" paper - Unofficial PyTorch Implementation
bert2bert-summarization
Abstractive summarization using Bert2Bert framework.
transformers-trainers
Tools for training pytorch language models
data-pipeline
Build a data pipeline using Google BigQuery, dbt, Google Sheets, and Supermetrics. It helps you create a monthly reporting toolkit that pulls in data from a variety of marketing channels.
gmb_monitor
Script to take a list of brand queries and monitor the image found on the KG daily.
Electra_with_tensorflow
This is an implementation of electra according to the paper {ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators}
elixir-ticketmaster
Elixir library for Ticketmaster's API