Niall Taylor's repositories
BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
clin-summ
Clinical text summarization by adapting large language models
DeCLUTR
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
emb-gam
An interpretable and efficient predictor using pre-trained language models. Scikit-learn compatible.
ferret
A python package for benchmarking interpretability techniques.
HealthLLM
Health-LLM: Personalized Retrieval-Augmented Disease Prediction Model
intro_to_llm_agents
Simple introduction to LLM Agents
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
localLLM_langchain
Local LLM Agent with Langchain
MentalLLaMA
This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.
mergekit
Tools for merging pretrained large language models.
random_nn_tutorials
Paper implementations from scratch and machine learning tutorials
RoBERT_Recurrence_over_BERT
pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch implementation of other Hierarchical Methods (Mean Pooling and Max Pooling) and Truncation Methods (Head Only and Tail Only) presented in this paper https://arxiv.org/abs/1905.05583
setfit
Efficient few-shot learning with Sentence Transformers
synthetic-data-blog
This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data
targetedSummarization
TextReducer - A Tool for Summarization and Information Extraction
Text-To-Image-Generator
Python GUI application that generates images based on user prompts using the StableDiffusionPipeline model from the diffusers module. The application allows users to enter a prompt, click a button to generate an image based on the prompt, and view the generated image in the GUI window.
topicx_is_neural_tm_better_than_clustering
Model zoo for topic models, neural topic models, contextual embeddings for topic models ...
whatlies
Toolkit to help understand "what lies" in word embeddings. Also benchmarking!