Ticket #313 (assigned feature)

Opened 12 years ago

Last modified 11 years ago

metadata cleaning phase

Reported by: dmn Owned by: dmn
Priority: moderate Milestone: Greenstone 3 wishlist
Component: Collection Building Severity: minor
Keywords: Cc:


Before or after metadata enrichment might it be useful to have a metadata cleaning phase? Possible tasks include:

- integration with metadata control - modification of metadata values - creation of new metadata values from other ones

this would enable building a classifier on cleanTitle whilst still keeping originalTitle around for display

Where should this go?

In BasPlug? so it can be part of every plugin or as a separate phase of the workflow?

Change History

Changed 12 years ago by dmn

  • owner changed from nobody to dmn
  • status changed from new to assigned

Changed 12 years ago by dmn

  • type changed from defect to new feature
  • component changed from Collection Building: Plugins to Collection Building

can we make this work with right click in the Enrich panel so that it can be applied to a folder (and sub-folders) rather than everything.

This is probably a job for the Perl code which should pass the relevant filenames into the Java cleaning code as command-line arguments.

Changed 12 years ago by dmn

  • severity changed from enhancement to minor

Changed 11 years ago by dmn

I have Java code to do this now, needs a Perl wrapper.

Note: See TracTickets for help on using tickets.