There are 2 repositories under url-parsing topic.
JavaScript Library to extract domains, subdomains and public suffixes from complex URIs.
Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity
Simple scala library for building and parsing URIs
An Express.js-Style router for the front-end
Type safe url pattern matching without regular expressions and arguments type mismatches based on parser combinators.
Extracts the top level domain (TLD) from the URL given.
galimatias is a URL parsing and normalization library written in Java.
Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters
Urls de-duplication tool for better recon.
Fast and simple URL parsing for Java, with UTF-8 and path resolving support
The this.url class is designed to fetch and parse URL data, returning an object with structured information that can then be used for machine learning algorithms in a database or other storage.
NEXT-EVAL: From Web URLs to Structured Tables – Extraction and Evaluation
RFC 3986 compliant url parsing library with PSR-7 Uri component
Build an absolute URL from a base URL and a relative URL (RFC 1808).
A simple DuckDuckGo URL scraper.
urlyzer is a URL parsing analysis tool.
Go package to easily convert a URL's query parameters/values into usable struct values of the correct types.
A WHATWG URL spec compliant URL parser for working with URLs and their query strings.
A library function for joining a base URL and a target URL into a an absolute URL
DistillNET is a library for matching and filtering HTTP requests and HTML response content using the Adblock Plus Filter format.
Extract all internal and external links from a URL in Python.
C# port of the popular LinkedIn Java library to detect and normalize URLs in text
Simple URL parsing, building and manipulation without dependencies
A query string encoding and decoding library for Python. Ported from qs for JavaScript.
Typesafe representations of network concepts in Scala
A querystring parser with nesting support