greenstone@cs.waikato.ac.nz true Gutenberg XML Texts <p>This collection is designed to demonstrate some of the flexibility of the Greenstone 3 architecture. Collections are not restricted to those built using the standard building process, but rather can be anything you want, as long as suitable services are provided.</p> <p> This collection contains 4 documents obtained from the <a href="http://gutenberg.net/">Gutenberg project</a>. Project Gutenberg documents are marked up in XML, and come with DTDs and CSS stylesheets. The collection has not been built using either Greenstone2 or Greenstone3 style building. Instead, simple java programs using the Lucene API have been created to do the indexing, and document retrieval is simply done from the original (actually slightly modified) XML documents. A description of the building process can be found in the <a href="sites/localsite/collect/gberg/README">README</a> file for this collection.</p> <p> This collection has a different appearance to standard collections, accomplished through the use of custom XSLT files, found in the transform directory of the collection. The 'about' page for the collection shows a list of all the documents. This is achieved by using a new about.xsl file. Document display is quite different for this collection: the display can be either table of contents style, or text style. Two new xslts were written to handle the two display types. A new document action also had to be created (XMLDocumentAction), and the interface config file was altered to map the xd action code to this new Action class.</p> <p> &lt;action name='xd' class='XMLDocumentAction'&gt;<br/> &nbsp;&nbsp;&lt;subaction name='toc' xslt='document-toc.xsl'/&gt;<br/> &nbsp;&nbsp;&lt;subaction name='text' xslt='document-text.xsl'/&gt;<br/> &lt;/action&gt;</p> <p>The collection uses a generic Greenstone Lucene search service, and a specific retrieval service class, GutBookRetrieve. This extends the generic XMLRetrieve serviceRack to handle metadata extraction for the Gutenberg documents.</p> <p>The collection configuration file can be seen <a href="sites/localsite/collect/gberg/etc/collectionConfig.xml">here</a>, and the building configuration file can be seen <a href="sites/localsite/collect/gberg/index/buildConfig.xml">here</a>.</p> (section) from On the Origin of Species Charles Darwin The Iliad Homer An Occurrence At Owl Creek Bridge Ambrose Bierce Tales of the Fish Patrol Jack London