source: other-projects/nightly-tasks/diffcol/trunk/model-collect/MARC-Singlefile/etc/collect.cfg@ 29477

Last change on this file since 29477 was 29477, checked in by ak19, 9 years ago

Second of 2 commits (previous committed as user sjs49) to get diffcol on the 64 bit Ubuntu that has perl 5.18 to work again for the Multimedia and MARC-Singlefile collections. This commit does updates the etc/collect.cfg for both collections, sets a dc.Title for a doc in the Multimedia collection (strawberry.txt in tablature, now set to Strawberry Fields For Ever, whereas it defaulted to Forever as one word) which produced a new metadata.xml inside the tablature folder, and an updated marc file for the MARC-Singlefile collection, which has 2 changes: a duplicate subject of Computer Programming was removed from a Steve Reeves paper and the subject Database management occurred frequently with a period at end and once without period, the last of which has now got a period too for merging with the other. The collect.cfg file for Multimedia sorts the AZCompactList classifier by ex.Identifier and sets the removeprefix flag to ignore occurrences of A and The at the start of titles, and modifies the removesuffix flag to not require a space between the final character of a title and any subsequent punctuation. All these changes were necessary to ensure that the ordering (of titles and other influential metadata) was always consistent by making them unique, by making more titles group into bookshelves, or enforcing a sort order.

File size: 1.9 KB
Line 
1creator
2maintainer
3public true
4
5buildtype mgpp
6infodbtype gdbm
7
8indexes text dc.Title,ex.dc.Title,ex.Title dc.Subject
9defaultindex text
10
11levels document
12
13indexoptions accentfold casefold stem
14
15defaultlevel document
16
17# import options, needed for diffcol testing
18# especially in conjunction with -sort flag to ArchivesInfPlugin
19OIDtype hash_on_full_filename
20sortmeta OID
21
22plugin ZIPPlugin
23plugin GreenstoneXMLPlugin
24plugin TextPlugin
25plugin HTMLPlugin
26plugin EmailPlugin
27plugin PDFPlugin
28plugin RTFPlugin
29plugin WordPlugin
30plugin PostScriptPlugin
31plugin PowerPointPlugin
32plugin ExcelPlugin
33plugin ImagePlugin
34plugin ISISPlugin
35plugin NulPlugin
36plugin EmbeddedMetadataPlugin
37plugin MARCPlugin
38plugin MetadataXMLPlugin
39plugin ArchivesInfPlugin -sort
40plugin DirectoryPlugin
41
42classify List -metadata dc.Title;ex.Title -partition_type_within_level approximate_size
43
44classify AZCompactList -metadata dc.Subject -sort dc.Title
45
46format VList "<td valign=\"top\">[link][icon][/link]</td>
47<td valign=\"top\">[ex.srclink]{Or}{[ex.thumbicon],[ex.srcicon]}[ex./srclink]</td>
48<td valign=\"top\">[highlight]
49{Or}{[dc.Title],[exp.Title],[ex.Title],Untitled}
50[/highlight]{If}{[ex.Source],<br><i>([ex.Source])</i>}</td>"
51
52format HList "[link][highlight][ex.Title][/highlight][/link]"
53
54format DocumentHeading "{Or}{[parent(Top):Title],[Title],untitled}<br>"
55
56format DocumentText "[Text]"
57
58format DocumentButtons "Detach|Highlight"
59
60format SearchTypes "form"
61
62collectionmeta collectionname [l=en] "MARC-Singlefile"
63collectionmeta .text [l=en] "_labeltext_"
64collectionmeta .dc.Title,ex.dc.Title,Title [l=en] "_labelTitle_"
65collectionmeta .document [l=en] "_textdocument_"
66collectionmeta .section [l=en] "_textsection_"
67collectionmeta .document:text [l=en] "_labeltext_"
68collectionmeta .document:dc.Title,Title,ex.dc.Title [l=en] "_labelTitle_"
69collectionmeta .document:Source [l=en] "_labelSource_"
70collectionmeta .document:dc.Title,Title [l=en] "_labelTitle_"
71collectionmeta .dc.Subject [l=en] "_labelSubject_"
Note: See TracBrowser for help on using the repository browser.