sfeed_web - finds URLs to feeds from a HTML webpage


SFEED_WEB(1) General Commands Manual SFEED_WEB(1)

NAME

sfeed_web — finds URLs to feeds from a HTML webpage

SYNOPSIS

sfeed_web [baseurl]

DESCRIPTION

sfeed_web reads the HTML data of the webpage from stdin and writes the found URLs to stdout.

Such a link reference in HTML code looks like:

<link rel="alternate" href="atom.xml" type="application/atom+xml" />

OPTIONS
baseurl

Optional base URL to use for found feed URLs that are relative.

OUTPUT FORMAT

url<TAB>content-type<newline>

URL

Found relative or absolute URL.

For relative URLs if a <base href="..." /> tag is found it will be used, otherwise if the baseurl option is specified then that is used, if neither are set then the relative URL is printed.

content-type

Usually application/atom+xml or application/rss+xml.

EXIT STATUS

The sfeed_web utility exits 0 on success, and >0 if an error occurs.

EXAMPLES

Get URLs from a website:

curl -s -L ’https://codemadness.org/’ | sfeed_web ’https://codemadness.org/’

SEE ALSO

sfeed_update(1), sh(1)

AUTHORS

Hiltjo Posthuma <[email protected]> GNU July 27, 2021 SFEED_WEB(1)


Updated 2024-01-29 - jenkler.se | uex.se