This tool takes a list of urls and extracts the content of <title>
tag.
titlextractor
needs golang.org/x/net/html package in order to parse HTML.
go get -u golang.org/x/net/html
go get -u github.com/dellalibera/titlextractor/
Parameter | Description |
---|---|
-t | Specify the request timeout (expressed in seconds - default 20s) |
-f | Boolean value used to decide to follow or not redirections (default false) |
-n | Number of concurrent workers (default 20) |
-c | Boolean value used to color the output (default false) |
cat urls.txt | titlextractor -f -c
I was inspired by: