Nandan Thakur's repositories
beir-ColBERT
Evaluation of BEIR Datasets using ColBERT retrieval model
topic-modeling
This repository contains as intuitive example on topic-modeling using regular LDA, and how GuidedLDA is better than regular LDA
Imagesearch
CS 679 Project Repository: Learning Efficient Autoencoders for Image Search
personal-website
Personal Website | Nandan Thakur | Copyright © nandan-thakur.com, 2021
poison-texts
CS 886 Project on Adversarial Attacks on NLP models
compute-canada
CC Information provided to easy run slurm scripts on CC Wiki
anserini
A Lucene toolkit for replicable information retrieval research
BatteryDEV
Our Official Code Repositorty for QS-EIS-Challenge BatteryDEV 2022
beir-leaderboard
BEIR Leaderboard
citadel-repro
A reproduction of CITADEL and CITADEL+ checkpoints using dpr-scale repository
CQADupStack
A Benchmark Data Set for Community Question-Answering Research
datasets
🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools
DeepCT
DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.
mteb
MTEB: Massive Text Embedding Benchmark
orpo
Official repository for ORPO
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
qra_code
Question similarity with domain adaptation.
sentence-transformers
Sentence Embeddings with BERT & XLNet
tevatron
Tevatron - A flexible toolkit for neural retrieval research and development.
thakur-nandan.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
video-insights
video insights created and using open-sourced packages