TokenMill (tokenmill)

TokenMill

tokenmill

Geek Repo

We can help you with your natural language generation and processing projects

Location:Vilnius, Lithuania

Home Page:https://www.tokenmill.ai/

Github PK Tool:Github PK Tool

TokenMill's repositories

beagle

Beagle helps you identify keywords, phrases, regexes, and complex search queries of interest in streams of text documents.

Language:ClojureLicense:NOASSERTIONStargazers:51Issues:4Issues:42

clojure-graalvm-aws-lambda-template

Leiningen template for AWS Lambda custom runtime with GraalVM native image compiled Clojure projects.

Language:ClojureLicense:NOASSERTIONStargazers:43Issues:5Issues:8

timewords

Multilingual library to easily parse date strings to java.util.Date objects.

Language:ClojureLicense:NOASSERTIONStargazers:29Issues:5Issues:12

crawling-framework

Easily crawl news portals or blog sites using Storm Crawler.

Language:JavaLicense:NOASSERTIONStargazers:21Issues:6Issues:33

ltlangpack

Tools for Lithuanian language processing

Language:ShellStargazers:15Issues:12Issues:0

fast-url-access-checker

Easily run HTTP GET requests against a list of URLs to check their HTTP status.

Language:ClojureLicense:NOASSERTIONStargazers:12Issues:3Issues:5

docx-utils

Easily work with .docx files from Clojure (a wrapper on Apache POI library).

Language:ClojureLicense:MITStargazers:11Issues:3Issues:10

dictionary-annotator

Fast and configurable UIMA dictionary annotator.

Language:JavaLicense:NOASSERTIONStargazers:7Issues:5Issues:3

snowball

Snowball version of the Porter stemmer for the Lithuanian language.

License:NOASSERTIONStargazers:7Issues:3Issues:0

common-crawl-utils

Various Common Crawl utilities in Clojure.

Language:ClojureLicense:NOASSERTIONStargazers:6Issues:3Issues:3

docker-images

Docker configurations, images, and examples of Dockerfiles for various TokenMill products and projects.Official source for Docker configurations, images, and examples of Dockerfiles for TokenMill products and projects

Language:DockerfileLicense:NOASSERTIONStargazers:5Issues:3Issues:4

crawling-framework-example

Demonstration on how to use the Crawling Framework to setup a simple science news crawler and store results in ElasticSearch. Use this configuration to set up your own crawler.

Language:JavaLicense:NOASSERTIONStargazers:3Issues:4Issues:0

beagle-performance-benchmarks

Performance benchmarks for the Beagle library, and comparisons with other stored-query solutions.

Language:ClojureLicense:NOASSERTIONStargazers:1Issues:4Issues:1

es-utils

Clojure helper functions for Elasticsearch.

Language:ClojureLicense:NOASSERTIONStargazers:1Issues:3Issues:4

metadata-detector

Library to detect metadata from html files.

Language:HTMLLicense:Apache-2.0Stargazers:1Issues:5Issues:1

openccg

OpenCCG library for parsing and realization with CCG

Language:JavaLicense:NOASSERTIONStargazers:1Issues:5Issues:0

doccano

Open source text annotation tool for machine learning practitioner.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

faraday

DynamoDB client for Clojure

Language:ClojureLicense:EPL-1.0Stargazers:0Issues:2Issues:0

gf-wordnet

A WordNet in GF

Language:Grammatical FrameworkStargazers:0Issues:0Issues:0

spaCy

đź’« Industrial-strength Natural Language Processing (NLP) with Python and Cython

Language:PythonLicense:MITStargazers:0Issues:3Issues:13

unsupervised-keyphrase-extraction

EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0