There are 13 repositories under extraction topic.
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
extract internal monitoring data from application logs for collection in a timeseries database
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
Provides functions to read and write from/to an object or array using a simple string notation
Extract files from any kind of container formats
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
🦜⛏️ Did you say you like data?
A C++ static library offering a clean and simple interface to the 7-zip shared libraries.
Stanford Open Information Extraction made simple!
北京航空航天大学大数据高精尖中心自然语言处理研究团队对信息抽取领域的调研。包括实体识别,关系抽取,属性抽取等子任务,每类子任务分别对学术界和工业界进行调研。
File Injector is a script that allows you to store any file in an image using steganography
PHP URI Template (RFC 6570) supports both URI expansion & extraction
DataTool is a program that lets you extract models, maps, and files from Overwatch.
Extracts OTP tokens from rooted Android devices
SEO Macroscope is a website scanning tool, to check your website for broken links; including some technical SEO functionality, site scraping, Excel reporting, and more.
An actual, updated, surviv.io cheat. Works great and we reply fast.
Java library to extract links (URLs, email addresses) from plain text; fast, small and smart
A simple archiving and compression library for Java
Open source Emoticons and Emoji detection library: emot
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Extract tables from PDF files (port of tabula-java)
Full-Text RSS can transform partial feeds to deliver the full content stripped of clutter and ads
Detect hidden files and text in images
A ground segmentation algorithm for 3D point clouds based on the work described in “Fast segmentation of 3D point clouds: a paradigm on LIDAR data for Autonomous Vehicle Applications”, D. Zermas, I. Izzat and N. Papanikolopoulos, 2017. Distinguish between road and non-road points. Road surface extraction. Plane fit ground filter