-
Notifications
You must be signed in to change notification settings - Fork 174
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RSS feature request: sanitize/filter out HTML from $description
#1539
Comments
I assume the example you submitted to matrix-org/matrix-hookshot#732 is from https://bodhi.fedoraproject.org/rss/updates/ and the one you have here is from https://blog.torproject.org/feed.xml In both of these feeds, the <item><title>libphidget22-1.15.20230526-1.fc39</title><link>https://bodhi.fedoraproject.org/updates/FEDORA-2023-ffb20eb9af</link><description><h1>FEDORA-2023-ffb20eb9af</h1>
<h2>Packages in this update:</h2>
<ul>
<li>libphidget22-1.15.20230526-1.fc39</li>
</ul>
<h2>Update description:</h2>
<p>Automatic update for libphidget22-1.15.20230526-1.fc39.</p>
<h5><strong>Changelog</strong></h5>
<pre><code>* Mon May 29 2023 Richard Shaw &lt;<a href="mailto:[email protected]">[email protected]</a>&gt; - 1.15.20230526-1
- Update to 1.15.20230526.
</code></pre></description><pubDate>Mon, 29 May 2023 12:06:21 +0000</pubDate></item> and <entry><title>New Alpha Release: Tor Browser 12.5a6 (Android, Windows, macOS, Linux)</title><link href="https://blog.torproject.org/new-alpha-release-tor-browser-125a6/" rel="alternate"></link><updated>2023-05-24T00:00:00Z</updated><author><name>richard</name></author><id>urn:uuid:3d4a5097-1fc1-35ce-960d-7c29c6d28676</id><content type="html"><article class="blog-post">
<picture>
<source media="(min-width:415px)" srcset="https://blog.torproject.org/new-alpha-release-tor-browser-125a6/lead.webp" type="image/webp">
<source srcset="https://blog.torproject.org/new-alpha-release-tor-browser-125a6/lead_small.webp" type="image/webp">
<img class="lead" referrerpolicy="no-referrer" loading="lazy" src="https://blog.torproject.org/new-alpha-release-tor-browser-125a6/lead.png">
</picture>
<div class="body"><p>Tor Browser 12.5a6 is now available from the <a href="https://www.torproject.org/download/alpha/">Tor Browser download page</a> and also from our <a href="https://www.torproject.org/dist/torbrowser/12.5a6/">distribution directory</a>.</p>
<p>This release updates Firefox 102.11.0esr, including bug fixes, stability improvements and important <a href="https://www.mozilla.org/en-US/security/advisories/mfsa2023-17/">security updates</a>. There were no Android-specific security updates to backport from the Firefox 113 release.</p>
<h2>Build-Signing Infrastructure Updates</h2>
<p>We are in the process of updating our build signing infrastructure, and unfortunately are unable to ship code-signed 12.5a6 installers for Windows systems currently. Therefore we will not be providing full Window installers for this release. However, automatic build-to-build upgrades from 12.5a4 and 12.5a5 should continue to work as expected.</p>
<h2>Full changelog</h2>
<p>The full changelog since <a href="https://gitlab.torproject.org/tpo/applications/tor-browser-build/-/raw/main/projects/browser/Bundle-Data/Docs/ChangeLog.txt">Tor Browser 12.5a5</a> is:</p>
<ul>
<li>All Platforms<ul>
<li>Updated Translations</li>
<li>Updated Go to 11.9.9</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser-build/-/issues/40860">Bug tor-browser-build#40860</a>: Improve the transition from the old fontconfig file to the new one</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41728">Bug tor-browser#41728</a>: Pin bridges.torproject.org domains to Let's Encrypt's root cert public key</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41738">Bug tor-browser#41738</a>: Replace the patch to disable live reload with its preference</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41757">Bug tor-browser#41757</a>: Rebase Tor Browser Alpha to 102.11.0esr</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41763">Bug tor-browser#41763</a>: TTP-02-003 WP1: Data URI allows JS execution despite safest security level (Low)</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41764">Bug tor-browser#41764</a>: TTP-02-004 OOS: No user-activation required to download files (Low)</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41775">Bug tor-browser#41775</a>: Avoid re-defining some macros in nsUpdateDriver.cpp</li>
</ul>
</li>
<li>Windows + macOS + Linux<ul>
<li>Updated Firefox to 102.11esr</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41607">Bug tor-browser#41607</a>: Update "New Circuit" icon</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41736">Bug tor-browser#41736</a>: Customize the default CustomizableUI toolbar using CustomizableUI.jsm</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41770">Bug tor-browser#41770</a>: Keyboard navigation broken leaving the toolbar tor circuit button</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41777">Bug tor-browser#41777</a>: Internally shippped manual does not adapt to RTL languages (it always align to the left)</li>
</ul>
</li>
<li>Windows + Linux<ul>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41654">Bug tor-browser#41654</a>: UpdateInfo jumped into Data</li>
</ul>
</li>
<li>Linux<ul>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41732">Bug tor-browser#41732</a>: implement linux font whitelist as defense-in-depth</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser/-/issues/41776">Bug tor-browser#41776</a>: System fonts are temporarily leaked on Linux after the browser is updated from 12.5a4 or earlier</li>
</ul>
</li>
<li>Android<ul>
<li>Updated GeckoView to 102.11esr</li>
</ul>
</li>
<li>Build System<ul>
<li>All Platforms<ul>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser-build/-/issues/33953">Bug tor-browser-build#33953</a>: Provide a way for easily updating Go dependencies of projects</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser-build/-/issues/40673">Bug tor-browser-build#40673</a>: Avoid building each go module separately</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser-build/-/issues/40818">Bug tor-browser-build#40818</a>: Enable wasm target for rust compiler</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser-build/-/issues/40841">Bug tor-browser-build#40841</a>: Adapt signing scripts to new signing machines</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser-build/-/issues/40849">Bug tor-browser-build#40849</a>: Move Go dependencies to the projects dependent on them, not as a standalone projects</li>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser-build/-/issues/40856">Bug tor-browser-build#40856</a>: Unblock nightly builds</li>
</ul>
</li>
<li>Windows<ul>
<li><a href="https://gitlab.torproject.org/tpo/applications/tor-browser-build/-/issues/40846">Bug tor-browser-build#40846</a>: Temporarily disable Windows signing</li>
</ul>
</li>
</ul>
</li>
</ul>
</div>
<div class="categories">
<ul><li>
<a href="https://blog.torproject.org/../category/applications">
applications
</a>
</li><li>
<a href="https://blog.torproject.org/../category/releases">
releases
</a>
</li></ul>
</div>
</article>
</content></entry> you can see there are lots of Hookshot's PR decided to handle these buggy feeds as their authors intended, but instead it is now broken on correct feeds. For example, if a feed contained this: Therefore, I won't change Limnoria's behavior to accomodate buggy feeds while breaking correct feeds. The correct solution is to make the feeds' authors fix their feeds. |
Hmm actually it seems that feedparser (the library Limnoria uses to parse RSS and Atom feeds) has a heuristic to auto-fix such feeds (it detects if a description contains What version of feedparser do you have installed? |
Thank you, I reported this issue to GitHub so far. My |
The
$description
of many RSS feeds (e.g. GitHub, GitLab, crt.sh, Tor blog) contain HTML tags making them messy to read.I think Limnoria cleaning them up and just sending the user visible text would improve readability and thus usability of the plugin a lot.
While it's a different protocol and different capabilities, the Matrix bot Hookshot has this ability, matrix-org/matrix-hookshot#738
Possibly related:
The text was updated successfully, but these errors were encountered: