Ignore:
Timestamp:
2014-11-28T17:30:00+13:00 (9 years ago)
Author:
ak19
Message:

Second of 2 commits (previous committed as user sjs49) to get diffcol on the 64 bit Ubuntu that has perl 5.18 to work again for the Multimedia and MARC-Singlefile collections. This commit does updates the etc/collect.cfg for both collections, sets a dc.Title for a doc in the Multimedia collection (strawberry.txt in tablature, now set to Strawberry Fields For Ever, whereas it defaulted to Forever as one word) which produced a new metadata.xml inside the tablature folder, and an updated marc file for the MARC-Singlefile collection, which has 2 changes: a duplicate subject of Computer Programming was removed from a Steve Reeves paper and the subject Database management occurred frequently with a period at end and once without period, the last of which has now got a period too for merging with the other. The collect.cfg file for Multimedia sorts the AZCompactList classifier by ex.Identifier and sets the removeprefix flag to ignore occurrences of A and The at the start of titles, and modifies the removesuffix flag to not require a space between the final character of a title and any subsequent punctuation. All these changes were necessary to ensure that the ordering (of titles and other influential metadata) was always consistent by making them unique, by making more titles group into bookshelves, or enforcing a sort order.

Location:
other-projects/nightly-tasks/diffcol/trunk/model-collect/Multimedia
Files:
1 added
1 edited

Legend:

Unmodified
Added
Removed
  • other-projects/nightly-tasks/diffcol/trunk/model-collect/Multimedia/etc/collect.cfg

    r28237 r29477  
    4242plugin  DirectoryPlugin
    4343
    44 classify    AZCompactList -metadata dc.Title,ex.Title -removesuffix (?i)(\\s+\\d+)|(\\s+[[:punct:]].*) -sort ex.Title
     44classify    AZCompactList -metadata dc.Title,ex.Title -removesuffix (?i)(\\s+\\d+)|(\\s*[[:punct:]].*) -removeprefix (?i)(a|the)\\s* -sort ex.Identifier
    4545classify    List -sort_leaf_nodes_using ex.Title -metadata dc.Format -buttonname browse -partition_type_within_level none -bookshelf_type always
    4646
Note: See TracChangeset for help on using the changeset viewer.