The ePUB file format is one of a few ebook file formats. The Nook from Barnes and Noble uses ePUB. The Kindle from Amazon uses Mobi. For a general comparison of ebook file formats, see the Wikipedia Article. The demo for the NatickFOSS meeting will concentrate on ePUB for a few reasons.

  • I own a Nook
  • ePUB is an open standard.
  • ePUB is the format with which I have the most experience.
  • ePUB files are often available without DRM (Digital Rights Management or Digital Restrictions Management). See Project Gutenberg. That means we can look inside. That’s the plan for the meeting.

The ePUB format is an open standard with wide ereader support. Apple’s iBooks for the iPad and Barnes and Noble’s Nook use ePUB formats. Granted, the books they sell are commonly encumbered by DRM. The ePUB format does not require DRM. Some publishers have decided to abandon DRM. The technical publisher O’Reilly and Baen Books, are two examples.
Any more you recommend?

O'Reilly Logo Baen Logo

We will take a look at a simple ebook to keep our effort uncluttered, but the current ePUB version 3 enables all sorts of great features, including the ability to embed movies…Wait. It’s a “book”, isn’t it?”

The format is based on the same elements as the web. In some ways, ePUB software is a modified browser. The core of ePUB (and also of Mobi) is html code with cascading stylesheets (CSS) and some descriptive elements in XML files. All of these are common on the Web, too. Almost every web page has words and images kept in html code. The layout and color scheme are controlled by stylesheets. Extensible Markup Language (XML) is intended to be “meta” information, not usually displayed as part of a web page or ebook. XML contains code for the ereader, to identify file sources, author, title, table of contents, and other such things which a librarian might find especially interesting.

Linux has a good open source ePUB tool called Sigil which takes in text files or html files, formats them to the requirements of ePUB, adds the descriptive XML file automatically. Sigil also has Macintosh and Windows versions.

The following image shows the layout of the Sigil window. Click to enlarge the image and look closely. You should notice that it has many buttons that look very much like the ones in a word processor like LibreOffice Write. There are options specific to the ePUB format, but you would be able to immediately start work on your “Great American Novel” with Sigil.

sigil image

A very detailed exploration of ePUB by Elizabeth Castro EPUB: Straight to the Point is worth buying if you plan to really dig in to making them on your own. She writes about the kind of information you need to make a good-looking ebook. She begins with a section on Adobe InDesign as the development tool, but the details further into the book are useful with Sigil, too.

If anybody wants to explore the Dwarf Planet short story, you can download it and transfer it to your Kindle [mobi] or Nook [ePUB]. If you just want to be perverse and read it with your browser [html], go ahead.

After you have created and saved your work, you can also make conversions to other formats. The excellent open source ebook management tool called Calibre makes converting from one ebook format to another very easy. There are also on line tools for doing conversions. See the resources list at the end of this page.

Calibre Logo

Kindle and Mobi

Mobi is Kindle’s file storage system. The image below shows the file structure. It looks similar to the ePUB file structure we saw while looking in Sigil.

Mobi Files

What if I want to read an ebook on my computer?

I love to see these questions. Keep them coming.

Kindle-PC – [ Mac ] and Nook-PC – [ Mac ] applications are available for Mac and Windows along with iBook and Android devices like tablets and phones. Linux users can use the Kindle CloudReader, though you do need a Kindle account. Remember, most Kindle and Nook ebooks are encumbered with DRM.

The dawn of ebook availability was a while ago. The day has fully begun and the sun shines pretty brightly on us. Publishers of paper-based books are struggling to deal with it. Book stores are struggling. The Borders bookstore chain closed a couple of years ago.

Firefox has an extension which organizes and lets you read ePUB books right in your browser.

epub extension

Self Publishing

Writing is only one step. If you want to “publish” your options are wider today than ever before. You can go the way of famous authors and get an agent who will send your book to publishers, you’ll get involved with editors and contracts, etc. There are alternative paths available. A good place to start to understand the options for self publishing is the author J.A. Konrath who has successful experience with traditional publishing and doing it on his own.

Project Gutenberg
Getting DRM-free ebooks

gutenberg logo

A long time back, Project Gutenberg began converting out-of-copyright works from their paper format into digital text files. More recently the project has been converting the works to ebook formats, html and PDF. These works are often out of print, hard to get. But they are in the public domain, with their copyrights expired. The project is a service to all who love books, the reading part, not the physical tomes.

Electronic files like these ebooks can be legally downloaded to your computer and then can be easily transferred from your computer to a dedicated ebook reader like the Nook or Kindle. Check the manual of your device for the appropriate directions.

As an example of long fiction, here are screenshots of A Tale of Two Cities by Charles Dickens which is available in both ePUB (Nook) and Mobi (Kindle) formats.

Link to the book

Tale of Two Cities

ePUB file contents

This information was part of a presentation given at the NatickFOSS User Group, located in beautiful Natick, Massachusetts (Metrowest Boston). You are invited to attend.


Other Resources:
Online ePUB converter
Python Mobi Extraction Scripts

Advertisements