JM's repositories
qlib
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
visual-chatgpt
Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
Mengzi
Mengzi Pretrained Models
stable-diffusion
A latent text-to-image diffusion model
pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
tevatron
Tevatron - A flexible toolkit for dense retrieval research and development.
ALBEF
Code for ALBEF: a new vision-language pre-training method
GLM
GLM (General Language Model)
SPTAG
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search scenario.
AutoPhrase
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
ANCE
A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks
DeepCTR-Torch
【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.
tas-balanced-dense-retrieval
SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
tapas
End-to-end neural table-text understanding models.
TaBERT
This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural language utterances and (semi-)structured tables for semantic parsing. TaBERT is pre-trained on a massive corpus of 26M Web tables and their associated natural language context, and could be used as a drop-in replacement of a semantic parsers original encoder to compute representations for utterances and table schemas (columns).
rat-sql
A relation-aware semantic parsing model from English to SQL
ESIM
Implementation of the ESIM model for natural language inference with PyTorch
unilm
UniLM - Unified Language Model Pre-training
IRNet
An algorithm for cross-domain NL2SQL
tranX
A general-purpose neural semantic parser for mapping natural language queries into machine executable code
NeuroNLP2
Deep neural models for core NLP tasks (Pytorch version)
tianchi_nl2sql
追一科技首届中文NL2SQL挑战赛决赛第3名方案+代码
stog
AMR Parsing as Sequence-to-Graph Transduction
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
WikiSQL
A large annotated semantic parsing corpus for developing natural language interfaces.
albert_zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型