source: main/trunk/binaries/windows/bin/xpdf-tools/doc/pdftohtml.txt@ 32207

Last change on this file since 32207 was 32207, checked in by ak19, 6 years ago

Got a basic Windows version of PDFPlugin's new paged_html mode working

File size: 4.0 KB
Line 
1pdftohtml(1) General Commands Manual pdftohtml(1)
2
3
4
5NAME
6 pdftohtml - Portable Document Format (PDF) to HTML converter (version
7 4.00)
8
9SYNOPSIS
10 pdftohtml [options] PDF-file HTML-dir
11
12DESCRIPTION
13 Pdftohtml converts Portable Document Format (PDF) files to HTML.
14
15 Pdftohtml reads the PDF file, PDF-file, and places an HTML file for
16 each page, along with auxiliary images in the directory, HTML-dir. The
17 HTML directory will be created; if it already exists, pdftohtml will
18 report an error.
19
20CONFIGURATION FILE
21 Pdftohtml reads a configuration file at startup. It first tries to
22 find the user's private config file, ~/.xpdfrc. If that doesn't exist,
23 it looks for a system-wide config file, typically /usr/local/etc/xpdfrc
24 (but this location can be changed when pdftohtml is built). See the
25 xpdfrc(5) man page for details.
26
27OPTIONS
28 Many of the following options can be set with configuration file com-
29 mands. These are listed in square brackets with the description of the
30 corresponding command line option.
31
32 -f number
33 Specifies the first page to convert.
34
35 -l number
36 Specifies the last page to convert.
37
38 -z number
39 Specifies the initial zoom level. The default is 1.0, which
40 means 72dpi, i.e., 1 point in the PDF file will be 1 pixel in
41 the HTML. Using '-z 1.5', for example, will make the initial
42 view 50% larger.
43
44 -r number
45 Specifies the resolution, in DPI, for background images. This
46 controls the pixel size of the background image files. The ini-
47 tial zoom level is controlled by the '-z' option. Specifying a
48 larger '-r' value will allow the viewer to zoom in farther with-
49 out upscaling artifacts in the background.
50
51 -skipinvisible
52 Don't draw invisible text. By default, invisible text (commonly
53 used in OCR'ed PDF files) is drawn as transparent (alpha=0) HTML
54 text. This option tells pdftohtml to discard invisible text
55 entirely.
56
57 -allinvisible
58 Treat all text as invisible. By default, regular (non-invisi-
59 ble) text is not drawn in the background image, and is instead
60 drawn with HTML on top of the image. This option tells pdfto-
61 html to include the regular text in the background image, and
62 then draw it as transparent (alpha=0) HTML text.
63
64 -opw password
65 Specify the owner password for the PDF file. Providing this
66 will bypass all security restrictions.
67
68 -upw password
69 Specify the user password for the PDF file.
70
71 -q Don't print any messages or errors. [config file: errQuiet]
72
73 -cfg config-file
74 Read config-file in place of ~/.xpdfrc or the system-wide config
75 file.
76
77 -v Print copyright and version information.
78
79 -h Print usage information. (-help and --help are equivalent.)
80
81BUGS
82 Some PDF files contain fonts whose encodings have been mangled beyond
83 recognition. There is no way (short of OCR) to extract text from these
84 files.
85
86EXIT CODES
87 The Xpdf tools use the following exit codes:
88
89 0 No error.
90
91 1 Error opening a PDF file.
92
93 2 Error opening an output file.
94
95 3 Error related to PDF permissions.
96
97 99 Other error.
98
99AUTHOR
100 The pdftohtml software and documentation are copyright 1996-2017 Glyph
101 & Cog, LLC.
102
103SEE ALSO
104 xpdf(1), pdftops(1), pdftotext(1), pdfinfo(1), pdffonts(1), pdfde-
105 tach(1), pdftoppm(1), pdftopng(1), pdfimages(1), xpdfrc(5)
106 http://www.xpdfreader.com/
107
108
109
110 10 Aug 2017 pdftohtml(1)
Note: See TracBrowser for help on using the repository browser.