Arian Askari's starred repositories
arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
BERT-related-papers
BERT-related papers
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
awesome-twitter-data
A list of Twitter datasets and related resources.
attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
awesome-pretrained-models-for-information-retrieval
A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).
Knowledge-Grounded-Conversation
A Knowledge Grounded Conversation (KGC) Paper Reading List Maintained by Chuan Meng.
Twitter-Follower-Count
Display the number of followers of Twitter users
reddit_collector
Reddit Collector and Text Processor
LLM-Misinfo-QA
This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).
Wikipedia_TF_IDF_Dataset
Pre-computed IDF stats over all EN Wiki articles
transformer-vs-bm25
ECIR'22 - How Different are Pre-trained Transformers for Text Ranking? D.Rau et al.
bem_score_pytorch
Answer Equivalence BEM score example in PyTorch using Huggingface Tokenizer