JM's repositories
ALBEF
Code for ALBEF: a new vision-language pre-training method
albert_zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
ANCE
A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks
AutoPhrase
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
DeepCTR-Torch
【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.
ESIM
Implementation of the ESIM model for natural language inference with PyTorch
GLM
GLM (General Language Model)
IRNet
An algorithm for cross-domain NL2SQL
Mengzi
Mengzi Pretrained Models
NeuroNLP2
Deep neural models for core NLP tasks (Pytorch version)
pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
rat-sql
A relation-aware semantic parsing model from English to SQL
roberta_zh
RoBERTa中文预训练模型: RoBERTa for Chinese
SPTAG
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.
stable-diffusion
A latent text-to-image diffusion model
stog
AMR Parsing as Sequence-to-Graph Transduction
TaBERT
This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic parsing. TaBERT is pre-trained on a massive corpus of 26M Web tables and their associated natural language context, and could be used as a drop-in replacement of a semantic parsers original encoder to compute representations for utterances and table schemas (columns).
tapas
End-to-end neural table-text understanding models.
tas-balanced-dense-retrieval
SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
tevatron
Tevatron - A flexible toolkit for dense retrieval research and development.
tianchi_nl2sql
追一科技首届中文NL2SQL挑战赛决赛第3名方案+代码
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
tranX
A general-purpose neural semantic parser for mapping natural language queries into machine executable code
unilm
UniLM - Unified Language Model Pre-training
visual-chatgpt
Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
WikiSQL
A large annotated semantic parsing corpus for developing natural language interfaces.