Ticket #670 (closed defect: fixed)

Opened 10 years ago

Last modified 10 years ago

PDF repeatable metadata

Reported by: kjdon Owned by: kjdon
Priority: high Milestone: 2.84 Release
Component: Collection Building: Plugins Severity: enhancement
Keywords: Cc:

Description

From John Rose:

The institutions cooperating with IRD in French-speaking Africa are all it seems putting metadata in their pdf files and then extracting it with PDFPlugin. Luigi badly needs to extract repeatable keywords from the pdf heading information by specifying a separator as is done for ISISPlugin. For example the attached file from the IRD collection has several keywords in the pdf field KEYWORDS, separated by "; ".

Luz has also asked for this.

Need to add an option to HTMLPlugin (and then PDF etc) to use a separator for metadata.

Change History

Changed 10 years ago by kjdon

  • status changed from new to closed
  • resolution set to fixed

I have added -metadata_field_separator option to PDFPlugin and HTMLPlugin

Note: See TracTickets for help on using tickets.