The ContentMine (ContentMine)

The ContentMine

ContentMine

Geek Repo

The ContentMine is extracting 100 million facts from the academic literature

Location:UK

Home Page:http://contentmine.org

Github PK Tool:Github PK Tool

The ContentMine's repositories

norma

Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML

Language:HTMLLicense:Apache-2.0Stargazers:36Issues:0Issues:0

getpapers

Get metadata, fulltexts or fulltext URLs of papers matching a search query

Language:JavaScriptLicense:MITStargazers:197Issues:0Issues:0

contentmine-gui

GUI for executing ContentMine commands - browser SPA for running locally on user's machine.

Language:JavaScriptStargazers:1Issues:0Issues:0

CMForestPlots

Things for managing the ContentMine forest plot functionality in normal

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sciencesource-wikibase-docker

🐳 Docker images and compose file for Wikibase and the query service

Language:ShellStargazers:2Issues:0Issues:0

vms

ContentMine virtual machines

License:CC0-1.0Stargazers:3Issues:0Issues:0

cephis

Document processing including support libraries and PDFBox2

Stargazers:1Issues:0Issues:0

stataforestplots

documents and tests relating to ForestPlots in Stata format

License:Apache-2.0Stargazers:0Issues:0Issues:0

junk

analysis of documents containing forest plots in Stata format

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:GoLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:GoLicense:Apache-2.0Stargazers:5Issues:0Issues:0

ScienceSourceIngest

Tool for importing openXML format papers into ScienceSource

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0

wikibase

Simple golang library for interfacing with wikibase.

Language:GoLicense:Apache-2.0Stargazers:2Issues:0Issues:0

dictionaries

Dictionaries for use with `ami` , including some management software

Language:HTMLLicense:Apache-2.0Stargazers:1Issues:0Issues:0

go-europmc

Simple Go library for working with openXML papers form EuroPMC

Language:GoLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

ahocorasick

A Golang implementation of the Aho-Corasick string matching algorithm

Language:GoLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

normami

A tool to convert a variety of inputs into normalized, tagged, XHTML (with embedded/linked SVG and PNG where appropriate).

Stargazers:0Issues:0Issues:0

journal-scrapers

Journal scraper definitions for the ContentMine framework

Language:RubyStargazers:66Issues:0Issues:0

CMServices

Web services layer for ContentMine text and data mining tools and utilities

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

canary

Canary is a UI to the contentmine tools getpapers, quickscrape, norma, and ami.

Language:HTMLLicense:MITStargazers:5Issues:0Issues:0

canary-perch

ES Academic paper fact extraction - backend for canary

Language:JavaScriptLicense:Apache-2.0Stargazers:4Issues:0Issues:0

imageanalysis

ContentMine Fork of the WWMM imageanalysis Package

Language:HTMLStargazers:1Issues:0Issues:0

pdf2svg

ContentMine Fork of the WWMM pdf2svg Package

Language:JavaStargazers:1Issues:0Issues:0

svg2xml

ContentMine Fork of the WWMM svg2xml Package

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

svghtml

Combined SVG and HTML repos and building functionality

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

cproject

ArgProcessor and files for basic CMDirectories. Often subclassed. Needs to be separate from euclid and norma

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

euclid

ContentMine Fork of the WWMM Euclid Package

Language:JavaStargazers:0Issues:0Issues:0

cm-pom

Parent POM for ContentMine Java/MVN stack

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

cm-uclii

Data and progress tracking for table extraction and semantically guided content enhancement

Language:HTMLLicense:Apache-2.0Stargazers:1Issues:0Issues:0