xurlunpack3r
is a command-line utility designed to extract specific parts from URLs.
- Multiple Extraction Modes
- Custom Formats
- Cross-Platform (Windows, Linux, and macOS)
Visit the releases page and find the appropriate archive for your operating system and architecture. Download the archive from your browser or copy its URL and retrieve it with wget
or curl
:
-
...with
wget
:wget https://github.com/hueristiq/xurlunpack3r/releases/download/v<version>/xurlunpack3r-<version>-linux-amd64.tar.gz
-
...or, with
curl
:curl -OL https://github.com/hueristiq/xurlunpack3r/releases/download/v<version>/xurlunpack3r-<version>-linux-amd64.tar.gz
...then, extract the binary:
tar xf xurlunpack3r-<version>-linux-amd64.tar.gz
Tip
The above steps, download and extract, can be combined into a single step with this onliner
curl -sL https://github.com/hueristiq/xurlunpack3r/releases/download/v<version>/xurlunpack3r-<version>-linux-amd64.tar.gz | tar -xzv
Note
On Windows systems, you should be able to double-click the zip archive to extract the xurlunpack3r
executable.
...move the xurlunpack3r
binary to somewhere in your PATH
. For example, on GNU/Linux and OS X systems:
sudo mv xurlunpack3r /usr/local/bin/
Note
Windows users can follow How to: Add Tool Locations to the PATH Environment Variable in order to add xurlunpack3r
to their PATH
.
Before you install from source, you need to make sure that Go is installed on your system. You can install Go by following the official instructions for your operating system. For this, we will assume that Go is already installed.
go install -v github.com/hueristiq/xurlunpack3r/cmd/xurlunpack3r@latest
-
Clone the repository
git clone https://github.com/hueristiq/xurlunpack3r.git
-
Build the utility
cd xurlunpack3r/cmd/xurlunpack3r && \ go build .
-
Move the
xurlunpack3r
binary to somewhere in yourPATH
. For example, on GNU/Linux and OS X systems:sudo mv xurlunpack3r /usr/local/bin/
Windows users can follow How to: Add Tool Locations to the PATH Environment Variable in order to add
xurlunpack3r
to theirPATH
.
Caution
While the development version is a good way to take a peek at xurlunpack3r
's latest features before they get released, be aware that it may have bugs. Officially released versions will generally be more stable.
To display help message for xurlunpack3r use the -h
flag:
xurlunpack3r -h
help message:
_ _ _____
__ ___ _ _ __| |_ _ _ __ _ __ __ _ ___| | _|___ / _ __
\ \/ / | | | '__| | | | | '_ \| '_ \ / _` |/ __| |/ / |_ \| '__|
> <| |_| | | | | |_| | | | | |_) | (_| | (__| < ___) | |
/_/\_\\__,_|_| |_|\__,_|_| |_| .__/ \__,_|\___|_|\_\____/|_|
|_| v0.1.0
USAGE:
xurlunpack3r [MODE] [FORMATSTRING] [OPTIONS]
MODES:
domains the hostname (e.g. sub.example.com)
apexes the apex domain (e.g. example.com from sub.example.com)
paths the request path (e.g. /users)
query `key=value` pairs from the query string (one per line)
params keys from the query string (one per line)
values query string values (one per line)
format custom format (see below)
FORMAT DIRECTIVES:
%% a literal percent character
%s the request scheme (e.g. https)
%u the user info (e.g. user:pass)
%d the domain (e.g. sub.example.com)
%S the subdomain (e.g. sub)
%r the root of domain (e.g. example)
%t the TLD (e.g. com)
%P the port (e.g. 8080)
%p the path (e.g. /users)
%e the path's file extension (e.g. jpg, html)
%q the raw query string (e.g. a=1&b=2)
%f the page fragment (e.g. page-section)
%@ inserts an @ if user info is specified
%: inserts a colon if a port is specified
%? inserts a question mark if a query string exists
%# inserts a hash if a fragment exists
%a authority (alias for %u%@%d%:%P)
INPUT:
-u, --url string[] target URL
-l, --list string target URLs list file path
TIP: For multiple input URLs use comma(,) separated value with `-u`,
specify multiple `-u`, load from file with `-l` or load from stdin.
OUTPUT:
--unique bool output unique values
--monochrome bool display no color output
-s, --silent bool stdout values only output
-v, --verbose bool stdout verbose output
$ cat urls.txt
https://sub.example.com/users?id=123&name=Sam
https://sub.example.com/orgs?org=ExCo#about
http://example.net/about#contact
You can extract the domains from the URLs with the domains
mode:
$ cat urls.txt | xurlunpack3r domains -i -
sub.example.com
sub.example.com
example.net
If you don't want to output duplicate values you can use the -u
or --unique
flag:
```
$ cat urls.txt | xurlunpack3r domains -i - --unique
sub.example.com
example.net
```
The -u
/--unique
flag works for all modes.
You can extract the apex part of the domain (e.g. the example.com
in http://sub.example.com
) using the apexes
mode:
$ cat urls.txt | unfurl apexes -i - -u
example.com
example.net
$ cat urls.txt | xurlunpack3r paths -i -
/users
/orgs
/about
$ cat urls.txt | xurlunpack3r query -i -
id=123
name=Sam
org=ExCo
$ cat urls.txt | xurlunpack3r params -i -
id
name
org
$ cat urls.txt | xurlunpack3r values -i -
123
Sam
ExCo
You can use the format
mode to specify a custom output format:
$ cat urls.txt | xurlunpack3r format %d%p -i -
sub.example.com/users
sub.example.com/orgs
example.net/about
The available format directives are:
%% A literal percent character
%s The request scheme (e.g. https)
%u The user info (e.g. user:pass)
%d The domain (e.g. sub.example.com)
%S The subdomain (e.g. sub)
%r The root of domain (e.g. example)
%t The TLD (e.g. com)
%P The port (e.g. 8080)
%p The path (e.g. /users)
%e The path's file extension (e.g. jpg, html)
%q The raw query string (e.g. a=1&b=2)
%f The page fragment (e.g. page-section)
%@ Inserts an @ if user info is specified
%: Inserts a colon if a port is specified
%? Inserts a question mark if a query string exists
%# Inserts a hash if a fragment exists
%a Authority (alias for %u%@%d%:%P)
For more format directives, checkout the help message
xurlunpack3r -h
underFormat Directives
.
Any characters that don't match a format directive remain untouched:
$ cat urls.txt | xurlunpack3r format "%d (%s)" -i - -u
sub.example.com (https)
example.net (http)
Note that if a URL does not include the data requested, there will be no output for that URL:
$ echo http://example.com | xurlunpack3r format "%P" -i -
$ echo http://example.com:8080 | xurlunpack3r format "%P" -i -
8080
Feel free to submit Pull Requests or report Issues. For more details, check out the contribution guidelines.
Huge thanks to the contributors thus far!
This package is licensed under the MIT license. You are free to use, modify, and distribute it, as long as you follow the terms of the license. You can find the full license text in the repository - Full MIT license text.