Every #ebook is slightly different so I'm just going to have to write a little code for each #ebook that I process and extract content from. (These are public domain #ebooks.) This is for the new project that I haven't released the details of.