ASchopenhauer

ASchopenhauer

Geek Repo

Github PK Tool:Github PK Tool

ASchopenhauer's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:133003Issues:1119Issues:15868

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:61401Issues:1685Issues:2640

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37956Issues:998Issues:1143

CoreNLP

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Language:JavaLicense:GPL-3.0Stargazers:9659Issues:489Issues:1123

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7777Issues:98Issues:1585

umap

Uniform Manifold Approximation and Projection

Language:PythonLicense:BSD-3-ClauseStargazers:7394Issues:127Issues:786

Mathematics-for-ML

🧮 A collection of resources to learn mathematics for machine learning

grobid

A machine learning software for extracting information from scholarly documents

Language:JavaLicense:Apache-2.0Stargazers:3464Issues:96Issues:868

spacy-course

👩‍🏫 Advanced NLP with spaCy: A free online course

Language:PythonLicense:MITStargazers:2306Issues:63Issues:41

SimpleHTR

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

Language:PythonLicense:MITStargazers:1970Issues:51Issues:150

leptonica

Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The official github repository for Leptonica is: danbloomberg/leptonica. See leptonica.org for more documentation.

Language:CLicense:NOASSERTIONStargazers:1769Issues:79Issues:442

pdf2image

A python module that wraps the pdftoppm utility to convert PDF to PIL Image object

Language:PythonLicense:MITStargazers:1603Issues:19Issues:196

word2vec

Automatically exported from code.google.com/p/word2vec

Language:CLicense:Apache-2.0Stargazers:1517Issues:52Issues:45

JSON2YOLO

Convert JSON annotations into YOLO format.

Language:PythonLicense:AGPL-3.0Stargazers:828Issues:8Issues:54

LDAvis

R package for web-based interactive topic model visualization.

Language:JavaScriptLicense:NOASSERTIONStargazers:556Issues:32Issues:79

German-NLP

Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German

acl-anthology

Data and software for building the ACL Anthology.

Language:PythonLicense:Apache-2.0Stargazers:415Issues:20Issues:2281

odfpy

API for OpenDocument in Python

Language:PythonLicense:GPL-2.0Stargazers:311Issues:65Issues:101

grobid_client_python

Python client for GROBID Web services

Language:PythonLicense:Apache-2.0Stargazers:280Issues:6Issues:54

Stylesheets

TEI XSL Stylesheets

awesome-digital-humanities

Software for humanities scholars using quantitative or computational methods.

Language:HTMLLicense:CC0-1.0Stargazers:221Issues:38Issues:4

german-nouns

A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the data and parse compound words.

Language:PythonLicense:CC-BY-SA-4.0Stargazers:143Issues:2Issues:9

nlp-resources

Natural language processing resources for multiple languages, with an eye towards use for digital humanities.

License:GPL-3.0Stargazers:124Issues:12Issues:0

uni-dep-tb

A set of treebanks for multiple languages annotated in basic Stanford-style dependencies.

dlcl204

Digital Humanities Across Borders

Language:Jupyter NotebookStargazers:46Issues:12Issues:2

TEI-Facsimile-Plugin

A plugin that provides support for working with Digital Facsimiles in Text Encoding Initiative (TEI) vocabulary. The plugin contributes a new View in which the user can load an image and draw shapes. These shapes are then converted into TEI "zone" elements. All the existing "zone" elements from the document are also rendered over the image.

Language:JavaLicense:NOASSERTIONStargazers:25Issues:34Issues:9

nlp-german

natural language processing on german texts

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:16Issues:4Issues:0

MVA_2023_SL

Course materials for the MVA course "algorithms for speech and language processing"

License:GPL-3.0Stargazers:10Issues:5Issues:0

MultilayerParis

Paris multilayer transport network

Language:Jupyter NotebookStargazers:8Issues:5Issues:0

dimlex

A Lexicon of German discourse markers

Language:XSLTLicense:NOASSERTIONStargazers:6Issues:11Issues:1