bell notificationshomepageloginedit profileclubsdmBox

10.01% popularity   0 Reactions

I've started exploring some of the texts on Project Gutenberg and the first happens to be the Douay-Rheims bible. I'm looking at the Text section and just see a blob of documents not divided by any semantic order, seemingly just a linearly spaced indexing of the document into uniform chunks. Does anybody have any insight to their document creation approach? Is there some performance reasons to do the above? Each text seems to be about 1400 odd lines of html. I'm thinking about editing the subtexts to be ordered around the actual books of the Bible but wonder if I would be unwittingly breaking/violating something.


Free books android app tbrJar TBR JAR Read Free books online gutenberg


Load Full (1)

Login to follow story

More posts by @Mike

1 Comments

Sorted by latest first Latest Oldest Best

 

@Mike

10% popularity   0 Reactions

Looks like indeed this was created with some automatic tool. I was snooping around the file in the Miscellaneous folder and found content.opf which has a bunch of traces of this Split on p operation showing up in the comments along with a relatively uniform chunk size.

Furthermore I emailed them about an error in the table of contents and got this response...

David Widger via RT
7:05 AM (3 hours ago)
Hi jxramos,

I find that this was one of the early PG productions which were only in the
ASCII format and had no accompanying html file made by the producer of the text
file. The html file listed with this ebook was one autogenerated and these are
often quite unsatisfactory.

A much better PG edition is:
www.gutenberg.org/files/8300/8300-h/8300-h.htm
The html file was manually produced and the mobile viewer files appear
satisfactory.

I would refer you to PG #8300

Regards,

Project Gutenberg


Free books android app tbrJar TBR JAR Read Free books online gutenberg


Load Full (0)

 

Back to top