If you want to skip the preamble, click here to get straight to the prerequisites.
Preamble
In order to create your files, the first thing you need to do is save the web page, along with all its images, css and any images that the css files refer to. You'll find with most web browsers, such as Firefox and Internet Explorer, that if you save complete, the image and css files are saved fine, but any images that the css refers to are not saved.I found that the best browser, by far, for complete saves is Opera, which also save the images, such as background images, that are referred to from css files. (I tested with Opera 11.01, but I'm not sure about previous versions). Unfortunately, however, it has a bug whereby any local links, following a #, are truncated and thus lost. My saviour was the Firefox add-on ScrapBook, which saves everything and handles local and external links perfectly.
For creating the .pdf file, I tried all kinds of stand alone apps, and Firefox add-ons, but in all cases either css would be missing, css images missing, or both. Finally, I found an excellent free tool called wkhtmltopdf, which produces an absolutely perfect file which is also searchable - i.e. it doesn't just create an image of your page and slap it in a .pdf document.
Prerequisites
- Download and Install the Firefox ScrapBook add-on
- If you're creating a .chm file, you'll need the Microsoft HTML Help Workshop
- If you're creating a .pdf file, you'll need wkhtmltopdf
Prepare Your web page
- Navigate to your live web page and click ScrapBook > Capture Page.
A folder will be created that contains the web page (index.html), the images, and all the css combined into one file (index.css). - Open up the page, index.htm, and delete any headers, footers or anything else that you don't need.
- If you'll be creating a .chm file and your web page contains conditional references to Internet Explorer css files, or any of your css files contain IE hacks, you'll need to do a little further work. Because all the css is combined into one index.css file, conditional references cannot be used. Unfortunately, if instead you use hacks, ScrapBook removes the entries. For this reason, you'll need to edit index.css and put the hacks back in.
Create the .chm file
- Open notepad, enter the contents in the box below, and save as helpbuilder.hhp, in the same folder that contains index.html:
[OPTIONS]Note you only need to add references to files referred to in css in the helpbuilder.hhp file. That is, if you refer to an image in HTML, the HTML Help Workshop compiler will automatically pick it up.
[FILES]
index.html
image_file_referred_to_in_css_1.png
image_file_referred_to_in_css_2.png
...
- Double click helpbuilder.hhp and click compile.
Create the .pdf file
- wkhtmltopdf is command line based, so I recommend creating a folder that is easy to navigate to in a command prompt, such as zTemp1. Then, copy all the ScrapBook output files over.
- Open up a command prompt, navigate to zTemp1 and type:
wkhtmltopdf index.html index.pdf
- You now have the .pdf document called index.pdf. Open it up, and see if there are any parts of the document that have clumsy splits between pages. If there are, you can edit the index.html file and do one of the following:
- Create a line break before the block that you don't want splitting with: <div style="page-break-after: always;"><span style="display: none;"> </span></div>
- Wrap the block with the following: <div style="page-break-inside: avoid;">...</div>
- Repeat from step 2, until you're happy with the page breaks.
That's all there is to it! If, like me, you have a bunch of help pages that all use the same css files, and hence the same css referenced images, you only need to create the HTML Help Workshop .hhp file once. Once you've created the file, it takes no time at all to save your web page, using ScrapBook, and then copy the .hhp template to the output directory and compile the file.
No comments:
Post a Comment