source: other-projects/nightly-tasks/diffcol/trunk/model-collect/PDFBox/index/build.cfg@ 27951

Last change on this file since 27951 was 27951, checked in by ak19, 11 years ago

Updating PDFBox collection with the extra metadata extracted (when using the PDFBox extension) sorted in doc.xml, for diffcol to give consistent results on CentOS and Ubuntu.

File size: 388 bytes
1builddate 1375327485
2buildtype mgpp
3earliestdatestamp 1375327479
4indexfieldmap text->TX dc.Title,ex.dc.Title,Title->TI Source->SO
5indexfields text dc.Title,ex.dc.Title,Title Source
6indexlevels Doc
7indexmap text;dc.Title,ex.dc.Title,Title;Source;->idx
8indexstem PDFBox
9infodbtype gdbm
10levelmap document->Doc
11maxnumeric 4
12numbytes 209914
13numdocs 2
14numsections 2
15stemindexes 7
16textlevel Sec
Note: See TracBrowser for help on using the repository browser.