stevenvachon / minurl

Reduce and normalize the components of a WHATWG URL.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

minurl NPM Version File Size Build Status Coverage Status Dependency Monitor

Reduce and normalize the components of a URL.

Installation

Node.js >= 14 is required. To install, type this at the command line:

npm install minurl

Usage

const minURL = require('minurl');

const url = new URL('http://www.domain.com/index.html?param1=va%20lue&param2=');

minURL(url, options);
//-> http://domain.com?param1=va+lue&param2

Note: "www" subdomains and "index.html" indexes are not part of any specification. They are merely a common configuration on many HTTP servers. Consider this when deciding on which options to use.

Options

It is simplest to use an option profile, but custom configurations are still possible.

clone

Type: Boolean
Default value: true
When set to true, the input URL will first be cloned before any changes are made. When set to false, the input URL will be mutated.

defaultPorts

Type: Object
Default value: {'ftps:':990, 'git:':9418, 'scp:':22, 'sftp:':22, 'ssh:':22}
A map of protocol default ports for removeDefaultPort. Be sure to include the suffixed ":" in the key. Common protocols already have their ports removed.

indexFilenames

Type: Array<RegExp|string>
Default value: ['index.html']
A list of file names for removeIndexFilename.

plusQueries

Type: Boolean or Function
Default value: true
When set to true or a function that returns true, a URL will use "+" instead of "%20" to encode spaces in query parameter names and values.

queryNames

Type: Array<RegExp|string>
Default value: []
A list of query parameters for removeQueryNames.

removeAuth

Type: Boolean or Function
Default value: false
When set to true or a function that returns true, a URL's username and password will be removed.

removeDefaultPort

Type: Boolean or Function
Default value: true
When set to true or a function that returns true, a URL's port that matches any found in defaultPorts will be removed.

removeEmptyHash

Type: Boolean or Function
Default value: true
When set to true or a function that returns true, a URL hash value of "#" will be removed.

removeEmptyQueries

Type: Boolean or Function
Default value: Function
When set to true or a function that returns true, a URL's empty query parameters (such as "?=") will be removed.

removeEmptyQueryNames

Type: Boolean or Function
Default value: Function
When set to true or a function that returns true, a URL's query parameters that contain a value with no name (such as "?=value") will be removed.

removeEmptyQueryValues

Type: Boolean or Function
Default value: Function
When set to true or a function that returns true, a URL's query parameters that contain no value (such as "?var=" and "?var") will be removed.

removeEmptySegmentNames

Type: Boolean or Function
Default value: false
When set to true or a function that returns true, empty segment names within a URL's path will be removed. For example, the "//" in "/path//to/" will become "/path/to/". Protocol-relative URLs will not be affected.

removeHash

Type: Boolean or Function
Default value: false
When set to true or a function that returns true, a URL's hash will be removed.

removeIndexFilename

Type: Boolean or Function
Default value: Function
When set to true or a function that returns true, a URL's file name that matches any found in indexFilenames will be removed.

removeQueryNames

Type: Boolean or Function
Default value: false
When set to true or a function that returns true, a URL's query parameters matching queryNames will be removed.

removeQueryOddities

Type: Boolean or Function
Default value: true
When set to true or a function that returns true, a URL's unnecessary occurrences of "?", "=" and "&" characters will be removed.

removeRootTrailingSlash

Type: Boolean or Function
Default value: true
When set to true or a function that returns true, a URL's root trailing slash (such as http://domain.com/?var) will be removed.

removeTrailingSlash

Type: Boolean or Function
Default value: false
When set to true or a function that returns true, any trailing slash in a URL (such as http://domain.com/dir/) will be removed.

removeWWW

Type: Boolean or Function
Default value: Function
When set to true or a function that returns true, a URL's "www" subdomain will be removed.

sortQueries

Type: Boolean or Function
Default value: Function
When set to true or a function that returns true, a URL's query parameters will be sorted alphanumerically.

stringify

Type: Boolean
Default value: true
When set to true, a string will be returned. When set to false, a URL will be returned. Beware that the removeRootTrailingSlash and removeTrailingSlash options can only be applied when this option is set to true.

Option as a Function

When an option is defined as a Function, it must return true to be included in the custom filter:

const options = {
  removeIndexFilename: url => {
    // Only URLs with these protocols will have their index filename removed
    return url.protocol === 'http:' && url.protocol === 'https:';
  }
};

Option Profiles

CAREFUL_PROFILE is useful for a URL to an unknown or third-party server that could be incorrectly configured according to specifications and common best practices.

COMMON_PROFILE, the default profile, is useful for a URL to a known server that you trust and expect to be correctly configured according to specifications and common best practices.

An example of checking for a trusted hostname:

const profile = url => {
  const trustedHosts = ['domain.com'];

  const isTrusted = trustedHosts.some(trustedHost => {
    return url.hostname === trustedHost || url.hostname.endsWith(`.${trustedHost}`);
  });

  return minURL[`${isTrusted ? 'COMMON' : 'CAREFUL'}_PROFILE`];
};

const url1 = new URL('http://www.domain.com/index.html?param1=va%20lue&param2=');
const url2 = new URL('http://www.fake-domain.com/index.html?param1=va%20lue&param2=');

minURL(url1, profile(url1));  // will use "common" profile
minURL(url2, profile(url2));  // will use "careful" profile

Customizing Profiles

const custom = {
  ...minURL.COMMON_PROFILE,
  indexFilenames: ['index.html', 'index.php']
};

Or:

const extend = require('extend');

const custom = extend(true, {}, minURL.COMMON_PROFILE, { indexFilenames:['index.php'] });

About

Reduce and normalize the components of a WHATWG URL.

License:MIT License


Languages

Language:JavaScript 100.0%