yuhongqian

yhq's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0130607 1118 15480

bert

TensorFlow code and pre-trained models for BERT

Language:PythonApache-2.037659 997 1142

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Language:PythonMIT22471 1266 100

gpt-3

GPT-3: Language Models are Few-Shot Learners

15654 896 3

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Language:PythonNOASSERTION12332 221 606

fullstack-course4

Example code for HTML, CSS, and Javascript for Web Developers Coursera Course

Language:JavaScript10695 1321 131

nerdcommenter

Vim plugin for intensely nerdy commenting powers

Language:Vim ScriptCC0-1.04960 62 249

BERT-BiLSTM-CRF-NER

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

Language:Python4665 93 386

mediawiki

🌻 The collaborative editing software that runs Wikipedia. Mirror from https://gerrit.wikimedia.org/g/mediawiki/core. See https://mediawiki.org/wiki/Developer_access for contributing.

Language:PHPNOASSERTION4099 1860

wikiextractor

A tool for extracting plain text from Wikipedia dumps

Language:PythonAGPL-3.03711 74 242

XLM

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Language:PythonNOASSERTION2873 57 334

Wikipedia

A Pythonic wrapper for the Wikipedia API

Language:PythonMIT2859 82 235

gluon-nlp

NLP made easy

Language:PythonApache-2.02552 95 534

Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Language:PythonApache-2.02385 64 377

rdflib

RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information.

Language:PythonBSD-3-Clause2130 84 1219

sqlite

Unofficial git mirror of SQLite sources (see link for build instructions)

Language:CNOASSERTION1966 930

noise2noise

Noise2Noise: Learning Image Restoration without Clean Data - Official TensorFlow implementation of the ICML 2018 paper

Language:PythonNOASSERTION1395 440

BERT-NER

Use Google's BERT for named entity recognition （CoNLL-2003 as the dataset）.

Language:PythonMIT1233 36 90

tagger

Named Entity Recognition Tool

Language:PythonApache-2.01157 63 84

pytorch_memlab

Profiling and inspecting memory in pytorch

Language:PythonMIT1009 13 35

anserini

Anserini is a Lucene toolkit for reproducible information retrieval research

Language:JavaApache-2.01008 40 603

mwclient

Python client library to interface with the MediaWiki API

Language:PythonMIT314 18 174

DeepCT

DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.

Language:PythonBSD-3-Clause312 8 19

MSMARCO

Utilities, Baselines, Statistics and Descriptions Related to the MSMARCO DATASET

Language:PythonMIT188 15 28

neleval

Entity disambiguation evaluation and error analysis tool

Language:PythonApache-2.0116 15 25

Extending-Google-BERT-as-Question-and-Answering-model-and-Chatbot

BERT Question and Answer system meant and works well for only limited number of words summary like 1 to 2 paragraphs only. It can’t be able to answer well from understanding more than 10 pages of data. We can extend the BERT question and answer model to work as chatbot on large text. To accomplish the understanding of more than 10 pages of data, here we have used a specific approach of picking the data.

Language:PythonApache-2.0110 5 9