s Command: perl -S /research/ak19/GS2bin_5Aug2013/bin/script/full-import.pl -gli -language en -collectdir /research/ak19/GS2bin_5Aug2013/collect Demo-Section-Tagging import.pl> Detected -sortmeta. To effect the stipulated sorting by metadata (or OID) remember this option should be paired with either the '-reversesort' or '-sort' option to ArchivesInfPlugin. import.pl> Removing current contents of the archives directory... import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import/b17mie import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import/b18ase import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import/b20cre import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import/b21wae import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import/b22bue import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import/ec158e import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import/ec159e import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import/ec160e import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import/fb33fe import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import/fb34fe import.pl> Global file scan checking directory: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/import/wb34te import.pl> MetadataXMLPlugin: processing b17mie/metadata.xml import.pl> HTMLPlugin processing b17mie/b17mie.htm import.pl> MetadataXMLPlugin: processing b18ase/metadata.xml import.pl> HTMLPlugin processing b18ase/b18ase.htm import.pl> MetadataXMLPlugin: processing b20cre/metadata.xml import.pl> HTMLPlugin processing b20cre/b20cre.htm import.pl> MetadataXMLPlugin: processing b21wae/metadata.xml import.pl> HTMLPlugin processing b21wae/b21wae.htm import.pl> MetadataXMLPlugin: processing b22bue/metadata.xml import.pl> HTMLPlugin processing b22bue/b22bue.htm import.pl> MetadataXMLPlugin: processing ec158e/metadata.xml import.pl> HTMLPlugin processing ec158e/ec158e.htm import.pl> MetadataXMLPlugin: processing ec159e/metadata.xml import.pl> HTMLPlugin processing ec159e/ec159e.htm import.pl> MetadataXMLPlugin: processing ec160e/metadata.xml import.pl> HTMLPlugin processing ec160e/ec160e.htm import.pl> MetadataXMLPlugin: processing fb33fe/metadata.xml import.pl> HTMLPlugin processing fb33fe/fb33fe.htm import.pl> MetadataXMLPlugin: processing fb34fe/metadata.xml import.pl> HTMLPlugin processing fb34fe/fb34fe.htm import.pl> MetadataXMLPlugin: processing wb34te/metadata.xml import.pl> HTMLPlugin processing wb34te/wb34te.htm import.pl> ********************************************* import.pl> Import complete import.pl> ********************************************* import.pl> * 11 documents were considered for processing import.pl> * 11 were processed and included in the collection import.pl> Command complete. import.pl> Extracting new metadata from archive files. import.pl> Archived metadata extraction complete. Command: perl -S /research/ak19/GS2bin_5Aug2013/bin/script/full-buildcol.pl -gli -language en -collectdir /research/ak19/GS2bin_5Aug2013/collect Demo-Section-Tagging buildcol.pl> *** creating the compressed text buildcol.pl> collecting text statistics (mgpp_passes -T1) buildcol.pl> ArchivesInfPlugin: processing /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/archives/archiveinf-doc.gdb buildcol.pl> GreenstoneXMLPlugin: processing b17mie.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b18ase.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b20cre.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b21wae.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b22bue.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec158e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec159e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec160e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing fb33fe.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing fb34fe.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing wb34te.dir/doc.xml buildcol.pl> Stats (Compressing text from text) buildcol.pl> Total bytes in collection: 3071304 buildcol.pl> Total bytes in text: 3071699 buildcol.pl> creating the compression dictionary buildcol.pl> compressing the text (mgpp_passes -T2) buildcol.pl> ArchivesInfPlugin: processing /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/archives/archiveinf-doc.gdb buildcol.pl> GreenstoneXMLPlugin: processing b17mie.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b18ase.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b20cre.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b21wae.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b22bue.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec158e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec159e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec160e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing fb33fe.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing fb34fe.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing wb34te.dir/doc.xml buildcol.pl> Stats (Compressing text from text) buildcol.pl> Total bytes in collection: 3071304 buildcol.pl> Total bytes in text: 3071699 buildcol.pl> *** building index text;dc.Title,Title;dc.Subject;dls.Organization;dls.Keyword; in subdirectory idx buildcol.pl> creating index dictionary (mgpp_passes -I1) buildcol.pl> ArchivesInfPlugin: processing /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/archives/archiveinf-doc.gdb buildcol.pl> GreenstoneXMLPlugin: processing b17mie.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b18ase.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b20cre.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b21wae.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b22bue.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec158e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec159e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec160e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing fb33fe.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing fb34fe.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing wb34te.dir/doc.xml buildcol.pl> Stats (Creating index text;dc.Title,Title;dc.Subject;dls.Organization;dls.Keyword;) buildcol.pl> Total bytes in collection: 3071304 buildcol.pl> Total bytes in text;dc.Title,Title;dc.Subject;dls.Organization;dls.Keyword;: 2789962 buildcol.pl> inverting the text (mgpp_passes -I2) buildcol.pl> ArchivesInfPlugin: processing /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/archives/archiveinf-doc.gdb buildcol.pl> GreenstoneXMLPlugin: processing b17mie.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b18ase.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b20cre.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b21wae.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b22bue.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec158e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec159e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec160e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing fb33fe.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing fb34fe.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing wb34te.dir/doc.xml buildcol.pl> Stats (Creating index text;dc.Title,Title;dc.Subject;dls.Organization;dls.Keyword;) buildcol.pl> Total bytes in collection: 3071304 buildcol.pl> Total bytes in text;dc.Title,Title;dc.Subject;dls.Organization;dls.Keyword;: 2789962 buildcol.pl> create the weights file buildcol.pl> creating 'on-disk' stemmed dictionary buildcol.pl> creating stem indexes buildcol.pl> BuildDir: /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/building buildcol.pl> *** creating the info database and processing associated files buildcol.pl> ArchivesInfPlugin: processing /research/ak19/GS2bin_5Aug2013/collect/Demo-Section-Tagging/archives/archiveinf-doc.gdb buildcol.pl> GreenstoneXMLPlugin: processing b17mie.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b18ase.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b20cre.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b21wae.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing b22bue.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec158e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec159e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing ec160e.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing fb33fe.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing fb34fe.dir/doc.xml buildcol.pl> GreenstoneXMLPlugin: processing wb34te.dir/doc.xml buildcol.pl> *** outputting information for classifier: CL1 buildcol.pl> *** outputting information for classifier: CL2 buildcol.pl> *** outputting information for classifier: CL3 buildcol.pl> *** outputting information for classifier: CL4 buildcol.pl> *** outputting information for classifier: oai buildcol.pl> *** creating auxiliary files buildcol.pl> Copying rss-items.rdf file from archives to building (eventually to index) buildcol.pl> Command complete.