source: main/trunk/greenstone2/bin/linux/xpdf-tools/doc/pdfinfo.1@ 32205

Last change on this file since 32205 was 32205, checked in by ak19, 6 years ago

First set of commits to do with implementing the new 'paged_html' output option of PDFPlugin that uses using xpdftools' new pdftohtml. So far tested only on Linux (64 bit), but things work there so I'm optimistically committing the changes since they work. 2. Committing the pre-built Linux binaries of XPDFtools for both 32 and 64 bit built by the XPDF group. 2. To use the correct bitness variant of xpdftools, setup.bash now exports the BITNESS env var, consulted by gsConvert.pl. 3. All the perl code changes to do with using xpdf tools' pdftohtml to generate paged_html and feed it in the desired form into GS(3): gsConvert.pl, PDFPlugin.pm and its parent ConvertBinaryPFile.pm have been modified to make it all work. xpdftools' pdftohtml generates a folder containing an html file and a screenshot for each page in a PDF (as well as an index.html linking to each page's html). However, we want a single html file that contains each individual 'page' html's content in a div, and need to do some further HTML style, attribute and structure modifications to massage the xpdftool output to what we want for GS. In order to parse and manipulate the HTML 'DOM' to do this, we're using the Mojo::DOM package that Dr Bainbridge found and which he's compiled up. Mojo::DOM is therefore also committed in this revision. Some further changes and some display fixes are required, but need to check with the others about that.

File size: 3.2 KB
Line 
1.\" Copyright 1999-2017 Glyph & Cog, LLC
2.TH pdfinfo 1 "10 Aug 2017"
3.SH NAME
4pdfinfo \- Portable Document Format (PDF) document information
5extractor (version 4.00)
6.SH SYNOPSIS
7.B pdfinfo
8[options]
9.RI [ PDF-file ]
10.SH DESCRIPTION
11.B Pdfinfo
12prints the contents of the \'Info' dictionary (plus some other useful
13information) from a Portable Document Format (PDF) file.
14.PP
15The \'Info' dictionary contains the following values:
16.PP
17.RS
18title
19.RE
20.RS
21subject
22.RE
23.RS
24keywords
25.RE
26.RS
27author
28.RE
29.RS
30creator
31.RE
32.RS
33producer
34.RE
35.RS
36creation date
37.RE
38.RS
39modification date
40.RE
41.PP
42In addition, the following information is printed:
43.PP
44.RS
45tagged (yes/no)
46.RE
47.RS
48form (AcroForm / static XFA / dynamic XFA / none)
49.RE
50.RS
51page count
52.RE
53.RS
54encrypted flag (yes/no)
55.RE
56.RS
57print and copy permissions (if encrypted)
58.RE
59.RS
60page size and rotation
61.RE
62.RS
63file size
64.RE
65.RS
66linearized (yes/no)
67.RE
68.RS
69PDF version
70.RE
71.RS
72metadata (only if requested)
73.RE
74.SH CONFIGURATION FILE
75Pdfinfo reads a configuration file at startup. It first tries to find
76the user's private config file, ~/.xpdfrc. If that doesn't exist, it
77looks for a system-wide config file, typically /usr/local/etc/xpdfrc
78(but this location can be changed when pdfinfo is built). See the
79.BR xpdfrc (5)
80man page for details.
81.SH OPTIONS
82Many of the following options can be set with configuration file
83commands. These are listed in square brackets with the description of
84the corresponding command line option.
85.TP
86.BI \-f " number"
87Specifies the first page to examine. If multiple pages are requested
88using the "\-f" and "\-l" options, the size of each requested page
89(and, optionally, the bounding boxes for each requested page) are
90printed. Otherwise, only page one is examined.
91.TP
92.BI \-l " number"
93Specifies the last page to examine.
94.TP
95.B \-box
96Prints the page box bounding boxes: MediaBox, CropBox, BleedBox,
97TrimBox, and ArtBox.
98.TP
99.B \-meta
100Prints document-level metadata. (This is the "Metadata" stream from
101the PDF file's Catalog object.)
102.TP
103.B \-rawdates
104Prints the raw (undecoded) date strings, directly from the PDF file.
105.TP
106.BI \-enc " encoding-name"
107Sets the encoding to use for text output. The
108.I encoding\-name
109must be defined with the unicodeMap command (see
110.BR xpdfrc (5)).
111This defaults to "Latin1" (which is a built-in encoding).
112.RB "[config file: " textEncoding ]
113.TP
114.BI \-opw " password"
115Specify the owner password for the PDF file. Providing this will
116bypass all security restrictions.
117.TP
118.BI \-upw " password"
119Specify the user password for the PDF file.
120.TP
121.BI \-cfg " config-file"
122Read
123.I config-file
124in place of ~/.xpdfrc or the system-wide config file.
125.TP
126.B \-v
127Print copyright and version information.
128.TP
129.B \-h
130Print usage information.
131.RB ( \-help
132and
133.B \-\-help
134are equivalent.)
135.SH EXIT CODES
136The Xpdf tools use the following exit codes:
137.TP
1380
139No error.
140.TP
1411
142Error opening a PDF file.
143.TP
1442
145Error opening an output file.
146.TP
1473
148Error related to PDF permissions.
149.TP
15099
151Other error.
152.SH AUTHOR
153The pdfinfo software and documentation are copyright 1996-2017 Glyph &
154Cog, LLC.
155.SH "SEE ALSO"
156.BR xpdf (1),
157.BR pdftops (1),
158.BR pdftotext (1),
159.BR pdftohtml (1),
160.BR pdffonts (1),
161.BR pdfdetach (1),
162.BR pdftoppm (1),
163.BR pdftopng (1),
164.BR pdfimages (1),
165.BR xpdfrc (5)
166.br
167.B http://www.xpdfreader.com/
Note: See TracBrowser for help on using the repository browser.