22842219's repositories
Awesome-Dataset-Distillation
Awesome Dataset Distillation Papers
data-models
A joint collaboration program to support the adoption of a reference architecture and compatible common data models underpinning a digital market of interoperable and replicable smart solutions.
dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
elasticsearch-py
Official Elasticsearch client library for Python
emoji-cheat-sheet
A markdown version emoji cheat sheet
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
ERNIE
Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"
gakg
GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.
gensim
Topic Modelling for Humans
gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
llm-fmc
Experiments on using ChatGPT for failure mode classification
luke
LUKE -- Language Understanding with Knowledge-based Embeddings
nmt
TensorFlow Neural Machine Translation Tutorial
notebooks
Notebooks using the Hugging Face libraries 🤗
rse-course
Materials for Turing's Research Software Engineering course
sacrebleu
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
Table-Pretraining
ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor
TabularSemanticParsing
Translating natural language questions to a structured query language
TAT-QA
TAT-QA (Tabular And Textual dataset for Question Answering) contains 16,552 questions associated with 2,757 hybrid contexts from real-world financial reports.
text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
trainbenchmark
The Train Benchmark framework for evaluating incremental model validation performance
UnifiedSKG
[EMNLP 2022] A Unified Framework and Analysis for Structured Knowledge Grounding with Text-to-Text Language Models
unify-parameter-efficient-tuning
Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
wikibase-cli
read and edit a Wikibase instance from the command line
x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow