The ContentMine (ContentMine)

The ContentMine

ContentMine

Geek Repo

The ContentMine is extracting 100 million facts from the academic literature

Location:UK

Home Page:http://contentmine.org

Github PK Tool:Github PK Tool

The ContentMine's repositories

getpapers

Get metadata, fulltexts or fulltext URLs of papers matching a search query

Language:JavaScriptLicense:MITStargazers:197Issues:16Issues:156

journal-scrapers

Journal scraper definitions for the ContentMine framework

norma

Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML

Language:HTMLLicense:Apache-2.0Stargazers:36Issues:12Issues:50

canary

Canary is a UI to the contentmine tools getpapers, quickscrape, norma, and ami.

Language:HTMLLicense:MITStargazers:5Issues:11Issues:29
Language:GoLicense:Apache-2.0Stargazers:5Issues:3Issues:0

canary-perch

ES Academic paper fact extraction - backend for canary

Language:JavaScriptLicense:Apache-2.0Stargazers:4Issues:2Issues:3

vms

ContentMine virtual machines

sciencesource-wikibase-docker

🐳 Docker images and compose file for Wikibase and the query service

Language:ShellStargazers:2Issues:7Issues:0

wikibase

Simple golang library for interfacing with wikibase.

Language:GoLicense:Apache-2.0Stargazers:2Issues:3Issues:1

cephis

Document processing including support libraries and PDFBox2

cm-uclii

Data and progress tracking for table extraction and semantically guided content enhancement

Language:HTMLLicense:Apache-2.0Stargazers:1Issues:6Issues:1

CMServices

Web services layer for ContentMine text and data mining tools and utilities

Language:JavaScriptLicense:Apache-2.0Stargazers:1Issues:5Issues:0

contentmine-gui

GUI for executing ContentMine commands - browser SPA for running locally on user's machine.

Language:JavaScriptStargazers:1Issues:6Issues:3

dictionaries

Dictionaries for use with `ami` , including some management software

Language:HTMLLicense:Apache-2.0Stargazers:1Issues:6Issues:6

imageanalysis

ContentMine Fork of the WWMM imageanalysis Package

Language:HTMLStargazers:1Issues:5Issues:0

pdf2svg

ContentMine Fork of the WWMM pdf2svg Package

Language:GoLicense:Apache-2.0Stargazers:1Issues:2Issues:0

ahocorasick

A Golang implementation of the Aho-Corasick string matching algorithm

Language:GoLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

cm-pom

Parent POM for ContentMine Java/MVN stack

Language:ShellLicense:Apache-2.0Stargazers:0Issues:4Issues:0

CMForestPlots

Things for managing the ContentMine forest plot functionality in normal

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

cproject

ArgProcessor and files for basic CMDirectories. Often subclassed. Needs to be separate from euclid and norma

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:4Issues:7

euclid

ContentMine Fork of the WWMM Euclid Package

Language:JavaStargazers:0Issues:4Issues:5

go-europmc

Simple Go library for working with openXML papers form EuroPMC

Language:GoLicense:Apache-2.0Stargazers:0Issues:7Issues:0

junk

analysis of documents containing forest plots in Stata format

License:Apache-2.0Stargazers:0Issues:2Issues:0

normami

A tool to convert a variety of inputs into normalized, tagged, XHTML (with embedded/linked SVG and PNG where appropriate).

Stargazers:0Issues:7Issues:14

ScienceSourceIngest

Tool for importing openXML format papers into ScienceSource

Language:GoLicense:Apache-2.0Stargazers:0Issues:2Issues:11

stataforestplots

documents and tests relating to ForestPlots in Stata format

License:Apache-2.0Stargazers:0Issues:3Issues:2

svg2xml

ContentMine Fork of the WWMM svg2xml Package

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:5Issues:3

svghtml

Combined SVG and HTML repos and building functionality

Language:JavaLicense:Apache-2.0Stargazers:0Issues:6Issues:0
Stargazers:0Issues:6Issues:0