yhq (yuhongqian)

yuhongqian

Geek Repo

Company:Carnegie Mellon University

Location:Pittsburgh, PA

Github PK Tool:Github PK Tool

yhq's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130607Issues:1118Issues:15480

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37659Issues:997Issues:1142

NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Language:PythonLicense:MITStargazers:22471Issues:1266Issues:100

gpt-3

GPT-3: Language Models are Few-Shot Learners

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Language:PythonLicense:NOASSERTIONStargazers:12332Issues:221Issues:606

fullstack-course4

Example code for HTML, CSS, and Javascript for Web Developers Coursera Course

nerdcommenter

Vim plugin for intensely nerdy commenting powers

Language:Vim ScriptLicense:CC0-1.0Stargazers:4960Issues:62Issues:249

BERT-BiLSTM-CRF-NER

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

mediawiki

🌻 The collaborative editing software that runs Wikipedia. Mirror from https://gerrit.wikimedia.org/g/mediawiki/core. See https://mediawiki.org/wiki/Developer_access for contributing.

Language:PHPLicense:NOASSERTIONStargazers:4099Issues:186Issues:0

wikiextractor

A tool for extracting plain text from Wikipedia dumps

Language:PythonLicense:AGPL-3.0Stargazers:3711Issues:74Issues:242

XLM

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Language:PythonLicense:NOASSERTIONStargazers:2873Issues:57Issues:334

Wikipedia

A Pythonic wrapper for the Wikipedia API

Language:PythonLicense:MITStargazers:2859Issues:82Issues:235

gluon-nlp

NLP made easy

Language:PythonLicense:Apache-2.0Stargazers:2552Issues:95Issues:534

Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Language:PythonLicense:Apache-2.0Stargazers:2385Issues:64Issues:377

rdflib

RDFLib is a Python library for working with RDF, a simple yet powerful language for representing information.

Language:PythonLicense:BSD-3-ClauseStargazers:2130Issues:84Issues:1219

sqlite

Unofficial git mirror of SQLite sources (see link for build instructions)

Language:CLicense:NOASSERTIONStargazers:1966Issues:93Issues:0

noise2noise

Noise2Noise: Learning Image Restoration without Clean Data - Official TensorFlow implementation of the ICML 2018 paper

Language:PythonLicense:NOASSERTIONStargazers:1395Issues:44Issues:0

BERT-NER

Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).

Language:PythonLicense:MITStargazers:1233Issues:36Issues:90

tagger

Named Entity Recognition Tool

Language:PythonLicense:Apache-2.0Stargazers:1157Issues:63Issues:84

pytorch_memlab

Profiling and inspecting memory in pytorch

Language:PythonLicense:MITStargazers:1009Issues:13Issues:35

anserini

Anserini is a Lucene toolkit for reproducible information retrieval research

Language:JavaLicense:Apache-2.0Stargazers:1008Issues:40Issues:603
Language:PythonLicense:BSD-3-ClauseStargazers:474Issues:14Issues:48

mwclient

Python client library to interface with the MediaWiki API

Language:PythonLicense:MITStargazers:314Issues:18Issues:174

DeepCT

DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.

Language:PythonLicense:BSD-3-ClauseStargazers:312Issues:8Issues:19

MSMARCO

Utilities, Baselines, Statistics and Descriptions Related to the MSMARCO DATASET

Language:PythonLicense:MITStargazers:188Issues:15Issues:28

neleval

Entity disambiguation evaluation and error analysis tool

Language:PythonLicense:Apache-2.0Stargazers:116Issues:15Issues:25

Extending-Google-BERT-as-Question-and-Answering-model-and-Chatbot

BERT Question and Answer system meant and works well for only limited number of words summary like 1 to 2 paragraphs only. It can’t be able to answer well from understanding more than 10 pages of data. We can extend the BERT question and answer model to work as chatbot on large text. To accomplish the understanding of more than 10 pages of data, here we have used a specific approach of picking the data.

Language:PythonLicense:Apache-2.0Stargazers:110Issues:5Issues:9

wikitools

Python package for working with MediaWiki wikis

yago4

Yago 4 - the next version of Yago

Language:RustLicense:GPL-3.0Stargazers:90Issues:6Issues:18

os-turboparser-entitytagger

Open source Priberam's TurboParser-based Entity Tagging module

Language:PythonStargazers:2Issues:4Issues:0