Context Navigation

← Previous Revision
Latest Revision
Next Revision →
Blame
Revision Log

pdftohtml.txt@ 32207

Last change on this file since 32207 was 32207, checked in by ak19, 6 years ago
Got a basic Windows version of PDFPlugin's new paged_html mode working
File size: 4.0 KB

Line
1	pdftohtml(1) General Commands Manual pdftohtml(1)
2
3
4
5	NAME
6	pdftohtml - Portable Document Format (PDF) to HTML converter (version
7	4.00)
8
9	SYNOPSIS
10	pdftohtml [options] PDF-file HTML-dir
11
12	DESCRIPTION
13	Pdftohtml converts Portable Document Format (PDF) files to HTML.
14
15	Pdftohtml reads the PDF file, PDF-file, and places an HTML file for
16	each page, along with auxiliary images in the directory, HTML-dir. The
17	HTML directory will be created; if it already exists, pdftohtml will
18	report an error.
19
20	CONFIGURATION FILE
21	Pdftohtml reads a configuration file at startup. It first tries to
22	find the user's private config file, ~/.xpdfrc. If that doesn't exist,
23	it looks for a system-wide config file, typically /usr/local/etc/xpdfrc
24	(but this location can be changed when pdftohtml is built). See the
25	xpdfrc(5) man page for details.
26
27	OPTIONS
28	Many of the following options can be set with configuration file com-
29	mands. These are listed in square brackets with the description of the
30	corresponding command line option.
31
32	-f number
33	Specifies the first page to convert.
34
35	-l number
36	Specifies the last page to convert.
37
38	-z number
39	Specifies the initial zoom level. The default is 1.0, which
40	means 72dpi, i.e., 1 point in the PDF file will be 1 pixel in
41	the HTML. Using '-z 1.5', for example, will make the initial
42	view 50% larger.
43
44	-r number
45	Specifies the resolution, in DPI, for background images. This
46	controls the pixel size of the background image files. The ini-
47	tial zoom level is controlled by the '-z' option. Specifying a
48	larger '-r' value will allow the viewer to zoom in farther with-
49	out upscaling artifacts in the background.
50
51	-skipinvisible
52	Don't draw invisible text. By default, invisible text (commonly
53	used in OCR'ed PDF files) is drawn as transparent (alpha=0) HTML
54	text. This option tells pdftohtml to discard invisible text
55	entirely.
56
57	-allinvisible
58	Treat all text as invisible. By default, regular (non-invisi-
59	ble) text is not drawn in the background image, and is instead
60	drawn with HTML on top of the image. This option tells pdfto-
61	html to include the regular text in the background image, and
62	then draw it as transparent (alpha=0) HTML text.
63
64	-opw password
65	Specify the owner password for the PDF file. Providing this
66	will bypass all security restrictions.
67
68	-upw password
69	Specify the user password for the PDF file.
70
71	-q Don't print any messages or errors. [config file: errQuiet]
72
73	-cfg config-file
74	Read config-file in place of ~/.xpdfrc or the system-wide config
75	file.
76
77	-v Print copyright and version information.
78
79	-h Print usage information. (-help and --help are equivalent.)
80
81	BUGS
82	Some PDF files contain fonts whose encodings have been mangled beyond
83	recognition. There is no way (short of OCR) to extract text from these
84	files.
85
86	EXIT CODES
87	The Xpdf tools use the following exit codes:
88
89	0 No error.
90
91	1 Error opening a PDF file.
92
93	2 Error opening an output file.
94
95	3 Error related to PDF permissions.
96
97	99 Other error.
98
99	AUTHOR
100	The pdftohtml software and documentation are copyright 1996-2017 Glyph
101	& Cog, LLC.
102
103	SEE ALSO
104	xpdf(1), pdftops(1), pdftotext(1), pdfinfo(1), pdffonts(1), pdfde-
105	tach(1), pdftoppm(1), pdftopng(1), pdfimages(1), xpdfrc(5)
106	http://www.xpdfreader.com/
107
108
109
110	10 Aug 2017 pdftohtml(1)

Note: See TracBrowser for help on using the repository browser.

Context Navigation

source: main/trunk/binaries/windows/bin/xpdf-tools/doc/pdftohtml.txt@ 32207

Download in other formats: