1 | pdftohtml(1) General Commands Manual pdftohtml(1)
|
---|
2 |
|
---|
3 |
|
---|
4 |
|
---|
5 | NAME
|
---|
6 | pdftohtml - Portable Document Format (PDF) to HTML converter (version
|
---|
7 | 4.00)
|
---|
8 |
|
---|
9 | SYNOPSIS
|
---|
10 | pdftohtml [options] PDF-file HTML-dir
|
---|
11 |
|
---|
12 | DESCRIPTION
|
---|
13 | Pdftohtml converts Portable Document Format (PDF) files to HTML.
|
---|
14 |
|
---|
15 | Pdftohtml reads the PDF file, PDF-file, and places an HTML file for
|
---|
16 | each page, along with auxiliary images in the directory, HTML-dir. The
|
---|
17 | HTML directory will be created; if it already exists, pdftohtml will
|
---|
18 | report an error.
|
---|
19 |
|
---|
20 | CONFIGURATION FILE
|
---|
21 | Pdftohtml reads a configuration file at startup. It first tries to
|
---|
22 | find the user's private config file, ~/.xpdfrc. If that doesn't exist,
|
---|
23 | it looks for a system-wide config file, typically /usr/local/etc/xpdfrc
|
---|
24 | (but this location can be changed when pdftohtml is built). See the
|
---|
25 | xpdfrc(5) man page for details.
|
---|
26 |
|
---|
27 | OPTIONS
|
---|
28 | Many of the following options can be set with configuration file com-
|
---|
29 | mands. These are listed in square brackets with the description of the
|
---|
30 | corresponding command line option.
|
---|
31 |
|
---|
32 | -f number
|
---|
33 | Specifies the first page to convert.
|
---|
34 |
|
---|
35 | -l number
|
---|
36 | Specifies the last page to convert.
|
---|
37 |
|
---|
38 | -z number
|
---|
39 | Specifies the initial zoom level. The default is 1.0, which
|
---|
40 | means 72dpi, i.e., 1 point in the PDF file will be 1 pixel in
|
---|
41 | the HTML. Using '-z 1.5', for example, will make the initial
|
---|
42 | view 50% larger.
|
---|
43 |
|
---|
44 | -r number
|
---|
45 | Specifies the resolution, in DPI, for background images. This
|
---|
46 | controls the pixel size of the background image files. The ini-
|
---|
47 | tial zoom level is controlled by the '-z' option. Specifying a
|
---|
48 | larger '-r' value will allow the viewer to zoom in farther with-
|
---|
49 | out upscaling artifacts in the background.
|
---|
50 |
|
---|
51 | -skipinvisible
|
---|
52 | Don't draw invisible text. By default, invisible text (commonly
|
---|
53 | used in OCR'ed PDF files) is drawn as transparent (alpha=0) HTML
|
---|
54 | text. This option tells pdftohtml to discard invisible text
|
---|
55 | entirely.
|
---|
56 |
|
---|
57 | -allinvisible
|
---|
58 | Treat all text as invisible. By default, regular (non-invisi-
|
---|
59 | ble) text is not drawn in the background image, and is instead
|
---|
60 | drawn with HTML on top of the image. This option tells pdfto-
|
---|
61 | html to include the regular text in the background image, and
|
---|
62 | then draw it as transparent (alpha=0) HTML text.
|
---|
63 |
|
---|
64 | -opw password
|
---|
65 | Specify the owner password for the PDF file. Providing this
|
---|
66 | will bypass all security restrictions.
|
---|
67 |
|
---|
68 | -upw password
|
---|
69 | Specify the user password for the PDF file.
|
---|
70 |
|
---|
71 | -q Don't print any messages or errors. [config file: errQuiet]
|
---|
72 |
|
---|
73 | -cfg config-file
|
---|
74 | Read config-file in place of ~/.xpdfrc or the system-wide config
|
---|
75 | file.
|
---|
76 |
|
---|
77 | -v Print copyright and version information.
|
---|
78 |
|
---|
79 | -h Print usage information. (-help and --help are equivalent.)
|
---|
80 |
|
---|
81 | BUGS
|
---|
82 | Some PDF files contain fonts whose encodings have been mangled beyond
|
---|
83 | recognition. There is no way (short of OCR) to extract text from these
|
---|
84 | files.
|
---|
85 |
|
---|
86 | EXIT CODES
|
---|
87 | The Xpdf tools use the following exit codes:
|
---|
88 |
|
---|
89 | 0 No error.
|
---|
90 |
|
---|
91 | 1 Error opening a PDF file.
|
---|
92 |
|
---|
93 | 2 Error opening an output file.
|
---|
94 |
|
---|
95 | 3 Error related to PDF permissions.
|
---|
96 |
|
---|
97 | 99 Other error.
|
---|
98 |
|
---|
99 | AUTHOR
|
---|
100 | The pdftohtml software and documentation are copyright 1996-2017 Glyph
|
---|
101 | & Cog, LLC.
|
---|
102 |
|
---|
103 | SEE ALSO
|
---|
104 | xpdf(1), pdftops(1), pdftotext(1), pdfinfo(1), pdffonts(1), pdfde-
|
---|
105 | tach(1), pdftoppm(1), pdftopng(1), pdfimages(1), xpdfrc(5)
|
---|
106 | http://www.xpdfreader.com/
|
---|
107 |
|
---|
108 |
|
---|
109 |
|
---|
110 | 10 Aug 2017 pdftohtml(1)
|
---|