Kexin Wang's repositories
easy-elasticsearch
Using business-level retrieval system (BM25) with Python in just a few lines.
crash-ipdb
Debug Python crashes conveniently: Whenever a Python code crashes, the ipdb debugger will be triggered.
benchmarking-ann
Benchmarking Approximate Nearest Neighbor (ANN) algorithms for dense text retrieval.
dist_tuto.pth
Official code for "Writing Distributed Applications with PyTorch", PyTorch Tutorial
AdaSent
This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification"
beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
CQADupStack
Python3 and docker for preprocessing CQADupStack
DPR
This repo aims at reproducing DPR (single-nq) retrieval evaluation with just a few commands.
long-coref
Coreference resolution (e2e-coref) for long documents
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.