Farley Knight's starred repositories
Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
DeepSpeedExamples
Example models using DeepSpeed
PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
DUP-ocropy
Python-based tools for document analysis and OCR
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
pytorch-cpp
C++ Implementation of PyTorch Tutorials for Everyone
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
CoCa-pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
pytorch-MNIST-CelebA-cGAN-cDCGAN
Pytorch implementation of conditional Generative Adversarial Networks (cGAN) and conditional Deep Convolutional Generative Adversarial Networks (cDCGAN) for MNIST dataset
synthtiger
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
nlg-yongzhuo
中文文本生成(NLG)之文本摘要(text summarization)工具包, 语料数据(corpus data), 抽取式摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。(graph,feature,topic model,summarize tool or tookit)
academic-budget-bert
Repository containing code for "How to Train BERT with an Academic Budget" paper
extreme-bert
ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.
mo-sql-parsing
Let's make a SQL parser so we can provide a familiar interface to non-sql datastores!
chatGPT-python-elm
A repository fully generated by ChatGPT making it believed it checked out a this repository which I described like the first line of the README.
reddit-nlp
Perform basic NLP of popular subreddits to understand trending topics
classifying_reddit_posts
Leveraging NLP and supervised learning methods to classify posts scraped via Reddit's API