andrealenzi11's repositories
py-poppleract
Python library and Web service based on Poppler Pdftotext utility and Tesseract OCR for extracting text from PDF documents
gen-text-compr-aggl-clust-sum
A library for topic modeling based on the algorithm: Generative Text Compression with Agglomerative Clustering Summarization (GTCACS)
knowledge-graph-pruning-2022
Experiments on Knowledge Graph Embeddings models for link prediction and deletion
gctm
Generative Cooperative Topic Modeling (GCTM)
Language:PythonMIT000
py-web-miner
Extensible Web Miner to extract information from web pages. It is based on HTTP Requests library, Beautiful Soup parser, and Selenium WebDriver.
Language:PythonApache-2.0000