1 | .\" Copyright 1997-2017 Glyph & Cog, LLC
|
---|
2 | .TH pdftohtml 1 "10 Aug 2017"
|
---|
3 | .SH NAME
|
---|
4 | pdftohtml \- Portable Document Format (PDF) to HTML converter
|
---|
5 | (version 4.00)
|
---|
6 | .SH SYNOPSIS
|
---|
7 | .B pdftohtml
|
---|
8 | [options]
|
---|
9 | .I PDF-file
|
---|
10 | .I HTML-dir
|
---|
11 | .SH DESCRIPTION
|
---|
12 | .B Pdftohtml
|
---|
13 | converts Portable Document Format (PDF) files to HTML.
|
---|
14 | .PP
|
---|
15 | Pdftohtml reads the PDF file,
|
---|
16 | .IR PDF-file ,
|
---|
17 | and places an HTML file for each page, along with auxiliary images
|
---|
18 | in the directory,
|
---|
19 | .IR HTML-dir .
|
---|
20 | The HTML directory will be created; if it already exists, pdftohtml
|
---|
21 | will report an error.
|
---|
22 | .SH CONFIGURATION FILE
|
---|
23 | Pdftohtml reads a configuration file at startup. It first tries to
|
---|
24 | find the user's private config file, ~/.xpdfrc. If that doesn't
|
---|
25 | exist, it looks for a system-wide config file, typically
|
---|
26 | /usr/local/etc/xpdfrc (but this location can be changed when pdftohtml
|
---|
27 | is built). See the
|
---|
28 | .BR xpdfrc (5)
|
---|
29 | man page for details.
|
---|
30 | .SH OPTIONS
|
---|
31 | Many of the following options can be set with configuration file
|
---|
32 | commands. These are listed in square brackets with the description of
|
---|
33 | the corresponding command line option.
|
---|
34 | .TP
|
---|
35 | .BI \-f " number"
|
---|
36 | Specifies the first page to convert.
|
---|
37 | .TP
|
---|
38 | .BI \-l " number"
|
---|
39 | Specifies the last page to convert.
|
---|
40 | .TP
|
---|
41 | .BI \-z " number"
|
---|
42 | Specifies the initial zoom level. The default is 1.0, which means
|
---|
43 | 72dpi, i.e., 1 point in the PDF file will be 1 pixel in the HTML.
|
---|
44 | Using \'-z 1.5', for example, will make the initial view 50% larger.
|
---|
45 | .TP
|
---|
46 | .BI \-r " number"
|
---|
47 | Specifies the resolution, in DPI, for background images. This
|
---|
48 | controls the pixel size of the background image files. The initial
|
---|
49 | zoom level is controlled by the \'-z' option. Specifying a larger
|
---|
50 | \'-r' value will allow the viewer to zoom in farther without upscaling
|
---|
51 | artifacts in the background.
|
---|
52 | .TP
|
---|
53 | .B \-skipinvisible
|
---|
54 | Don't draw invisible text. By default, invisible text (commonly used
|
---|
55 | in OCR'ed PDF files) is drawn as transparent (alpha=0) HTML text.
|
---|
56 | This option tells pdftohtml to discard invisible text entirely.
|
---|
57 | .TP
|
---|
58 | .B \-allinvisible
|
---|
59 | Treat all text as invisible. By default, regular (non-invisible) text
|
---|
60 | is not drawn in the background image, and is instead drawn with HTML
|
---|
61 | on top of the image. This option tells pdftohtml to include the
|
---|
62 | regular text in the background image, and then draw it as transparent
|
---|
63 | (alpha=0) HTML text.
|
---|
64 | .TP
|
---|
65 | .BI \-opw " password"
|
---|
66 | Specify the owner password for the PDF file. Providing this will
|
---|
67 | bypass all security restrictions.
|
---|
68 | .TP
|
---|
69 | .BI \-upw " password"
|
---|
70 | Specify the user password for the PDF file.
|
---|
71 | .TP
|
---|
72 | .B \-q
|
---|
73 | Don't print any messages or errors.
|
---|
74 | .RB "[config file: " errQuiet ]
|
---|
75 | .TP
|
---|
76 | .BI \-cfg " config-file"
|
---|
77 | Read
|
---|
78 | .I config-file
|
---|
79 | in place of ~/.xpdfrc or the system-wide config file.
|
---|
80 | .TP
|
---|
81 | .B \-v
|
---|
82 | Print copyright and version information.
|
---|
83 | .TP
|
---|
84 | .B \-h
|
---|
85 | Print usage information.
|
---|
86 | .RB ( \-help
|
---|
87 | and
|
---|
88 | .B \-\-help
|
---|
89 | are equivalent.)
|
---|
90 | .SH BUGS
|
---|
91 | Some PDF files contain fonts whose encodings have been mangled beyond
|
---|
92 | recognition. There is no way (short of OCR) to extract text from
|
---|
93 | these files.
|
---|
94 | .SH EXIT CODES
|
---|
95 | The Xpdf tools use the following exit codes:
|
---|
96 | .TP
|
---|
97 | 0
|
---|
98 | No error.
|
---|
99 | .TP
|
---|
100 | 1
|
---|
101 | Error opening a PDF file.
|
---|
102 | .TP
|
---|
103 | 2
|
---|
104 | Error opening an output file.
|
---|
105 | .TP
|
---|
106 | 3
|
---|
107 | Error related to PDF permissions.
|
---|
108 | .TP
|
---|
109 | 99
|
---|
110 | Other error.
|
---|
111 | .SH AUTHOR
|
---|
112 | The pdftohtml software and documentation are copyright 1996-2017 Glyph
|
---|
113 | & Cog, LLC.
|
---|
114 | .SH "SEE ALSO"
|
---|
115 | .BR xpdf (1),
|
---|
116 | .BR pdftops (1),
|
---|
117 | .BR pdftotext (1),
|
---|
118 | .BR pdfinfo (1),
|
---|
119 | .BR pdffonts (1),
|
---|
120 | .BR pdfdetach (1),
|
---|
121 | .BR pdftoppm (1),
|
---|
122 | .BR pdftopng (1),
|
---|
123 | .BR pdfimages (1),
|
---|
124 | .BR xpdfrc (5)
|
---|
125 | .br
|
---|
126 | .B http://www.xpdfreader.com/
|
---|