SiSun's repositories
PaperReading
Some Interesting papers and ideas
ACLPUB
The official tool for creating proceedings for conferences of the Association for Computational Linguistics (ACL).
awesome-phd-advice
Collection of advice for prospective and current PhD students
beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
convertvec
Convert word2vec vectors between binary and plain text format
EntityQuestions
EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers
FewRel
A Large-Scale Few-Shot Relation Extraction Dataset
MSMARCO
Utilities, Baselines, Statistics and Descriptions Related to the MSMARCO DATASET
MSMARCO-Passage-Ranking
MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, and passage ranking. A variant of this task will be the part of TREC and AFIRM 2019. For Updates about TREC 2019 please follow This Repository Passage Reranking task Task Given a query q and a the 1000 most relevant passages P = p1, p2, p3,... p1000, as retrieved by BM25 a succeful system is expected to rerank the most relevant passage as high as possible. For this task not all 1000 relevant items have a human labeled relevant passage. Evaluation will be done using MRR
nmt
TensorFlow Neural Machine Translation Tutorial
PEVL
Source code for EMNLP 2022 paper “PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models”
rebiber
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
SunSiShining.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
tevatron-1
Tevatron - A flexible toolkit for dense retrieval research and development.
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.