bell notificationshomepageloginedit profileclubsdmBox

10.01% popularity   0 Reactions

I recently bought an ebook online (DRM-free). It was supposed to be in epub format, which it technically is, but as it turns out they just packaged an image of each individual page rather than the actual text. As result the whole thing is around 100MB and does not work on any of my readers. The only way to read it at the moment is to extract it and use an image viewer.

How can I convert this to actual text to make it usable?


Free books android app tbrJar TBR JAR Read Free books online gutenberg


Load Full (1)

Login to follow story

More posts by @Nuzhat

1 Comments

Sorted by latest first Latest Oldest Best

10% popularity   0 Reactions

What you need to do is run OCR (Optical Character Recognition) on the images of the book.

If you cannot find a program that does the OCR directly on the ePub file, you can try to convert it to PDF first, or extract the images. Make sure that if you do a conversion, that the images do not loose quality, as that would negatively influence the results of the conversion. Programs like Calibre allow you to do such conversions.

There are some online OCR services, mainly for conversion of PDF, make sure that you understand the implications of using such a service, as there is no guarantee that your book information is removed after conversion (and if you images are watermarked, or contain metadata pointing to you, that could get you in trouble).


Free books android app tbrJar TBR JAR Read Free books online gutenberg


Load Full (0)

 

Back to top