There are 6 repositories under url-extractor topic.
Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity
A fast tool to fetch URLs from HTML attributes by crawl-in.
A Minimal Yet Powerful Crawler for Extracting all The Internal/External/Fuzz-able Links from a website
Recursively extract urls from a web page for reconnaissance.
Tika based link (URL) extractor for httpreserve
The eBay Listing Matcher is a Python script designed to compare and match eBay listings with parts from an Inventree instance. This script utilizes the eBay Trading API and the Inventree API to gather and process data.
An Apache Drill UDF for working with Twitter tweet text via the twitter-text Java library (https://github.com/twitter/twitter-text/tree/master/java)
🍊🔗 Squeeze some juice from URLs: A URL crawler/extraction library.
URL Extractor | Designed By YogSec is a powerful Bash script that helps you extract URLs from a single file or all files inside a folder
A simple browser extension for managing tabs and URLs: batch creation, URL extraction, domain blocking, redirects, and profile-based configuration management. Built with React + Vite (Manifest V3) for optimal performance and simplicity.
Tool for extracting and sorting links from a text file.
A python script to extract URL from the text or paragraph.
Website URL Scanner is a simple command-line tool that allows you to scan a website and extract all URLs. It can be useful for various purposes, such as link analysis or checking for broken links.
Curated Bookmarks for OSINT & Investigations
Extract article title, description, images, keywords and authors from any URL
This repository demonstrates how to use OpenAI's Response API (with GPT-4.1 and tool calling) to extract the main product image URL from an e-commerce product page. It provides both Python and TypeScript implementations, returning a structured output for easy integration.
Extract URLs,endpoints,paths and word-lists form source files
Extact all URLs from anchor and image tags within a html/xhtml page and its children.
Extract urls from your a file or web address
Any blob of text → Extract the images → Preview & download = Fast
A Go (Golang) package for working with URLs. It provides robust tools for extracting URLs from text and parsing them into granular components.
Python tool to extract PID, Method, VHost, and Request from Apache server-status pages. Supports single and bulk URL checks for security analysis
Extract http/https URLs from any kind of text content.
URL Extractor is a simple Python code designed to extract the domain name from a list of URLs stored in a text file. This application provides a convenient way to extract and process URLs efficiently.
A small tool for extracting all urls from a blob of binary data (ex. PDFs).
File attachment and URL extractor for EML & MSG files using Python
This project contains a Python script to extract all unique absolute URLs from a webpage and write them into a text file. This can be useful for indexing purposes.
LinkLifter is a Python script that searches for URLs in a given text file or recursively in a directory and its subdirectories. The found URLs, along with the file they are located in, are saved to a CSV file.
A high-performance URL extractor that finds & normalizes HTTP(S) URLs in text with Git remote support. Ideal for logs, terminal output, and text processing pipelines.
Bootcamp Laboratoria - Produto final do sprint 4. Biblioteca no npm para extracao de links em documento markdown.
🖼️ Extract images from any text quickly, preview them in a gallery, and download your selections easily as individual files or a ZIP.
URL Title Extractor is a Python program that extracts the titles of web pages from a file containing URLs. It uses the requests and BeautifulSoup libraries to extract the title and decode any HTML entities.