Jason Timm (jaytimm)

jaytimm

Geek Repo

Company:University of New Mexico

Location:New Mexico

Home Page:jtimm.net/

Twitter:@DrJayTimm

Github PK Tool:Github PK Tool

Jason Timm's starred repositories

dragnet

Just the facts -- web page content extraction

Language:PythonLicense:MITStargazers:1239Issues:132Issues:50

pySBD

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

Language:PythonLicense:MITStargazers:776Issues:12Issues:74

practical_cheminformatics_tutorials

Practical Cheminformatics Tutorials

Language:Jupyter NotebookLicense:MITStargazers:770Issues:45Issues:10

UpSetR

An R implementation of the UpSet set visualization technique published by Lex, Gehlenborg, et al..

Language:RLicense:NOASSERTIONStargazers:749Issues:38Issues:232

medspacy

Library for clinical NLP with spaCy.

Language:Jupyter NotebookLicense:MITStargazers:507Issues:16Issues:132

wordset-dictionary

The Open Source Dictionary

Language:CLicense:NOASSERTIONStargazers:507Issues:21Issues:8

blueprints-text

Jupyter notebooks for our O'Reilly book "Blueprints for Text Analysis Using Python"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:246Issues:11Issues:28

spacy-universal-sentence-encoder

Google USE (Universal Sentence Encoder) for spaCy

Language:PythonLicense:MITStargazers:176Issues:8Issues:28

kindred

A Python biomedical relation extraction package that uses a supervised approach (i.e. needs training data).

Language:PythonLicense:MITStargazers:154Issues:8Issues:25

spacyfishing

A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata

Language:PythonLicense:MITStargazers:150Issues:7Issues:17

gutenberg

Pipeline to generate the Standardized Project Gutenberg Corpus

Language:PythonLicense:GPL-3.0Stargazers:149Issues:10Issues:32

huggingfaceR

Hugging Face state-of-the-art models in R

Language:RLicense:NOASSERTIONStargazers:132Issues:8Issues:14

pyrfume-data

Provenance and tracking for Pyrfume data sources

Language:Jupyter NotebookLicense:MITStargazers:48Issues:5Issues:102

mesh-tree

Utility functions for traversing the Medical Subject Heading (MeSH) ontology tree

Language:JavaScriptLicense:Apache-2.0Stargazers:39Issues:8Issues:17
Language:PythonStargazers:21Issues:1Issues:0
Language:C++License:BSD-2-ClauseStargazers:20Issues:3Issues:1

incivility-sage-open

Incivility classifier used in Theocharis et al (2020, Sage Open)

Language:RLicense:GPL-3.0Stargazers:20Issues:3Issues:0

kamila

An R package for clustering mixed-type data

Language:RLicense:GPL-3.0Stargazers:16Issues:0Issues:0

mgsub

A safe, multiple, simultaneous string substitution

Language:RLicense:NOASSERTIONStargazers:14Issues:1Issues:2

mesh

User-friendly extensions to MeSH

Language:Jupyter NotebookStargazers:9Issues:2Issues:0

SpacySpanBERT

Using spaCy & SpanBERT for relation extraction from web documents.

Language:PythonLicense:NOASSERTIONStargazers:8Issues:1Issues:0

sentspace

a module to obtain diverse real-world-grounded features for sentences for large-scale benchmarking

Language:PythonStargazers:4Issues:3Issues:0

BERT-CRel-Embeddings

BMET: Improved Biomedical Word Embeddings

Language:C++Stargazers:4Issues:0Issues:0

spacy-nlp

spaCy & scispacy wrappers

Language:PythonStargazers:3Issues:1Issues:0

uspols

A collection of US political data: (1) federal election returns, (2) Twitter details for US lawmakers, and (3) some political geometries.

Language:RStargazers:3Issues:0Issues:0

wnomadds

Some additional functions for working with VoteView's wnominate package

Language:RStargazers:2Issues:1Issues:0

adjorder

Predicting adjective order using mutual information and subjectivity

Language:JavaScriptStargazers:2Issues:0Issues:0

dailypotus

A simple set of functions for scraping Wikipedia-based timelines/daily happenings for the Trump & Biden Presidencies.

Language:RStargazers:1Issues:1Issues:0

gbr

User friendly extensions for working with and searching Project Gutenberg

Language:RStargazers:1Issues:1Issues:0