You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What goes wrong here is that the page you link is using h1 tags for the section titles, and rdrview expects that to be used only for the main title, so they get removed. It seems that firefox used to have this issue too, but it got fixed a few years ago: mozilla/readability@11093f011f57fa528a0. So I need to port that patch for rdrview, but it's not trivial because it uses a unicode regex.
Hi ! Thanks for rdrview.
I found that, on some websites, it does not extract titles.
An example:
this article looks normal in firefox reader view :
but with rdrview, there are no titles, only paragraphs:
On other websites, it sometimes displays subtitles normally but not the main title.
I use rdrview build from latest commit with gcc on alpine linux x86_64.
If you have an idea on why this happens, I would be happy to know.
The text was updated successfully, but these errors were encountered: