Arian Askari's repositories
ChatGPT-RetrievalQA
A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on real human responses.
AnswerRetrieval-Legal
The offial repository for Answer Retrieval in Legal Community Question Answering - accepted at ECIR 2024
anonymous-comment
On Anonymous Commenting: A Greedy Approach to Balance Utilization and Anonymity for Instagram Users - Accepted at SIGIR 2019
arian-askari.github.io
My online CV
aug-pe
Differentially Private Synthetic Data via Foundation Model APIs 2: Text
bem_score_pytorch
Answer Equivalence BEM score example in PyTorch using Huggingface Tokenizer
bitsandbytes
8-bit CUDA functions for PyTorch
CHESS
Contextual Harnessing for Efficient SQL Synthesis
Claude_API_Contest
Claude API Test Project
cohere-terrarium
A simple Python sandbox for helpful LLM data agents
ColBERT
ColBERT25
datablations
Scaling Data-Constrained Language Models
dspy
DSPy: The framework for programming—not prompting—foundation models
llm-notebooks
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
lsr-multimodal
ECIR 2024: Sparse lexical representation for image-text retrieval
multitask_text_and_chemistry_t5
Code for "Unifying Molecular and Textual Representations via Multi-task Language Modelling" @ ICML 2023
RIPOR
The official repo for paper: Scalable and Effective Generative Information Retrieval
SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
unsloth
5X faster 60% less memory QLoRA finetuning