Ali Safaya's repositories
neurocache
Neurocache: A library for augmenting language models with external caching mechanisms
char-rnn.pytorch
PyTorch implementation of char-rnn (character-level language model)
txt-from-pdf
Extracting clean text from pdfs using pdfminer.six and pypdf.
adaptive-span
Transformer training code for sequential tasks
BERT-NER-Pytorch
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
BiBERT
This is the repository of the EMNLP 2021 paper "BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation".
BooookScore
A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper, "BooookScore: A systematic exploration of book-length summarization in the era of LLMs".
datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
fsl-trendy
Trendy: Few Shot Learning for Topic Classification on Social Media
human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
ios-demo-app
PyTorch iOS examples
lm-evaluation-harness
A framework for few-shot evaluation of language models.
LongBench
LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
multimodal_seq2seq_gSCAN
The multi-modal sequence to sequence baseline neural models used in the Grounded SCAN paper.
ntm-reader
Neural Turing Machine Reader: Entity State Modeling
pytorch-sentiment-analysis
Tutorials on getting started with PyTorch and TorchText for sentiment analysis.
speechbrain
A PyTorch-based Speech Toolkit
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.