Title extraction fails when page title starts with #nn #4
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
On sites like readallcomics.com, the HTML
<title>tag sometimes contains only the issue number (e.g.#018 (2026)) rather than the full comic name. The current fallback checks<h1>first, then derives a title from the URL slug.Problem
The slug-derived title is always title-cased from hyphenated segments (e.g.
absolute-batman-018-2026→Absolute Batman 018), which works for most cases but loses any capitalisation nuance (e.g. acronyms, proper nouns likeDC).Steps to reproduce
Expected
Title derived from the actual comic name on the page, not just the URL slug.
Notes
<h1>on comic pagesog:titlemeta tag)