
: Re: Clean up converted ebook I have some ebooks which are converted from other formats to ePub. Some are ill-formatted: there are hard line breaks in words and orphaned page numbers between paragraphs.
Somehow I missed that … thanks to Donald.McLean’s answer, I found it.
In the ePub editor, I hit Ctrl+F, and a search & replace tool showed up at the bottom:
I have used the regex mode and the following patterns:
-</p>n<p[^>]*> replaced with nothing to remove paragraph breaks within words. I hit Replace all here.
s</p>n<p[^>]*> replaced with a single space to remove paragraph breaks within sentences. This had to be done manually, unfortunately, because there were some spaces at the end of lines which should stay separated.
n<p[^>]*>d+</p> replaced with nothing to remove orphaned page numbers. Replace all again.
I have also installed, but not yet tested, the plugin Modify ePub by Grant Drake. It offers some automated tasks:
Free books android app tbrJar TBR JAR Read Free books online gutenberg
More posts by @Phylliss

: Clean up converted ebook I have some ebooks which are converted from other formats to ePub. Some are ill-formatted: there are hard line breaks in words and orphaned page numbers between paragraphs.