Arian Askari (arian-askari)

arian-askari

Geek Repo

Company:Leiden University (LIACS)

Location:Netherlands

Home Page:arian-askari.github.io

Twitter:@arian_ask

Github PK Tool:Github PK Tool

Arian Askari's starred repositories

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9762Issues:84Issues:247

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonLicense:Apache-2.0Stargazers:5085Issues:31Issues:52

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2141Issues:26Issues:54

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1914Issues:19Issues:77

awesome-twitter-data

A list of Twitter datasets and related resources.

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonLicense:Apache-2.0Stargazers:651Issues:12Issues:29

awesome-pretrained-models-for-information-retrieval

A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).

GraphGPT

[SIGIR'2024] "GraphGPT: Graph Instruction Tuning for Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:493Issues:4Issues:74
Language:PythonLicense:Apache-2.0Stargazers:375Issues:11Issues:6

QuIP

Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"

GenRead

Code and Checkpoints for "Generate rather than Retrieve: Large Language Models are Strong Context Generators" in ICLR 2023.

Knowledge-Grounded-Conversation

A Knowledge Grounded Conversation (KGC) Paper Reading List Maintained by Chuan Meng.

DSI-QG

The official repository for "Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation", Shengyao Zhuang, Houxing Ren, Linjun Shou, Jian Pei, Ming Gong, Guido Zuccon and Daxin Jiang.

Language:PythonLicense:MITStargazers:105Issues:1Issues:16

RAGElo

RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker

Language:PythonLicense:Apache-2.0Stargazers:93Issues:7Issues:11

Twitter-Follower-Count

Display the number of followers of Twitter users

Language:JavaScriptLicense:GPL-3.0Stargazers:62Issues:2Issues:4

DukeNet

Code for SIGIR-2020 full paper: DukeNet: A Dual Knowledge Interaction Network for Knowledge-Grounded Conversation

hagrid

A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution

RefNet

Code for AAAI-2020 oral paper: RefNet: A Reference-aware Network for Background Based Conversation

reddit_collector

Reddit Collector and Text Processor

Language:PythonStargazers:20Issues:2Issues:0
Language:PythonLicense:MITStargazers:20Issues:1Issues:1

MANtIS

MANtIS - a multi-domain information seeking dialogues dataset

ranger

Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!

Language:PythonLicense:Apache-2.0Stargazers:11Issues:1Issues:0

LLM-Misinfo-QA

This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).

Wikipedia_TF_IDF_Dataset

Pre-computed IDF stats over all EN Wiki articles

License:MITStargazers:9Issues:13Issues:0

transformer-vs-bm25

ECIR'22 - How Different are Pre-trained Transformers for Text Ranking? D.Rau et al.

SIP

Code for the CIKM 2023 long paper: System Initiative Prediction for Multi-turn Conversational Information Seeking

Language:PythonStargazers:2Issues:2Issues:0

bem_score_pytorch

Answer Equivalence BEM score example in PyTorch using Huggingface Tokenizer

Language:PythonStargazers:1Issues:1Issues:0