jmw999

jmw999

Geek Repo

Github PK Tool:Github PK Tool

jmw999's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130136Issues:1119Issues:15379

lime

Lime: Explaining the predictions of any machine learning classifier

Language:JavaScriptLicense:BSD-2-ClauseStargazers:11444Issues:263Issues:633

GloVe

Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings

Language:CLicense:Apache-2.0Stargazers:6800Issues:228Issues:162

xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Language:PythonLicense:Apache-2.0Stargazers:6166Issues:170Issues:263

statrethinking_winter2019

Statistical Rethinking course at MPI-EVA from Dec 2018 through Feb 2019

awesome-google-colab

Google Colaboratory Notebooks and Repositories (by @firmai)

Language:Jupyter NotebookStargazers:1379Issues:54Issues:2

projects

🪐 End-to-end NLP workflows from prototype to production

Language:PythonLicense:MITStargazers:1280Issues:31Issues:0

nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

Language:Jupyter NotebookStargazers:1139Issues:51Issues:8

concrete_NLP_tutorial

An NLP workshop about concrete solutions to real problems

Language:Jupyter NotebookStargazers:1080Issues:48Issues:10

datacamp-python-data-science-track

All the slides, accompanying code and exercises all stored in this repo. 🎈

Language:PythonLicense:MITStargazers:778Issues:20Issues:5

mat2vec

Supplementary Materials for Tshitoyan et al. "Unsupervised word embeddings capture latent knowledge from materials science literature", Nature (2019).

Language:PythonLicense:MITStargazers:615Issues:40Issues:24

Practical_NLP_in_PyTorch

A repository containing tutorials for practical NLP using PyTorch

Language:Jupyter NotebookStargazers:531Issues:31Issues:16

stat479-deep-learning-ss19

Course material for STAT 479: Deep Learning (SS 2019) at University Wisconsin-Madison

Language:Jupyter NotebookStargazers:510Issues:34Issues:0

mexican-government-report

Text Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file to plotting the results.

Language:PythonLicense:MITStargazers:482Issues:26Issues:0

prodigy-recipes

🍳 Recipes for the Prodigy, our fully scriptable annotation tool

Language:Jupyter NotebookStargazers:476Issues:26Issues:0

nbsvm

Naive Bayes SVM for Sentiment Analysis

CX_DB8

a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)

Language:PythonLicense:GPL-3.0Stargazers:226Issues:11Issues:7

recogito2

Semantic Annotation Without the Pointy Brackets

Language:ScalaLicense:Apache-2.0Stargazers:150Issues:23Issues:671

CogStack-SemEHR

Surfacing Semantic Data from Clinical Notes in Electronic Health Records for Tailored Care, Trial Recruitment and Clinical Research

Language:JavaScriptLicense:Apache-2.0Stargazers:87Issues:25Issues:3

HES_pipeline

R pipeline to clean and process Hospital Episode Statistics (HES) data

Language:RLicense:MITStargazers:35Issues:7Issues:3

mtas

Multi Tier Annotation Search

Language:JavaLicense:Apache-2.0Stargazers:26Issues:9Issues:13

redcoat

A lightweight web-based annotation tool for labelling entity recognition data.

Language:JavaScriptLicense:Apache-2.0Stargazers:23Issues:3Issues:6

external-recommender-spacy

External recommender example for the INCEpTION annotation platform using spacy

Language:PythonLicense:Apache-2.0Stargazers:21Issues:3Issues:6

Towards-reliable-BioNER

This repository contains the corpora and supplementary data, along with instructions for recreating the experiments, for our paper: "Towards reliable named entity recognition in the biomedical domain".

permanent-colandr-back

DataCorps - Colandr Tool resources

Language:Jupyter NotebookStargazers:14Issues:7Issues:35
Language:PythonLicense:MITStargazers:13Issues:4Issues:0

NLP

NLP Projects & Learning

Language:Jupyter NotebookStargazers:9Issues:2Issues:0

qstep-socialmedia

Files for the social media masterclass delivered March 2019

Language:RLicense:GPL-3.0Stargazers:6Issues:4Issues:0

qstep-sql

Repository containing materials for the QSTEP SQL Masterclass delivered by James on the 1st of December 2021.

Language:PLpgSQLLicense:GPL-3.0Stargazers:6Issues:2Issues:0
Language:Jupyter NotebookStargazers:2Issues:0Issues:0