Timestamp:
2020-06-14T03:40:21+12:00 (4 years ago)
Author:
ak19
Message:

All GS3 needs to convert docx files to basic html (no images) out of the box. 1. Adding in the Tika jar with its Apache 2.0 licence, a handcrafted notice derived from the license, and a Readme with links and examples of its use. 2. Updating model collectionConfig.xml with a pre-configured UnknownConverterPlugin to use the tika jar to convert docx to basic html. So all future GS3 collections will have this set up in the document pipeline and be ready for docx files. When the chance arises, need to set up a model coll for GS2 that uses the UnknownConverterPlugin in this way too.

Location:
main/trunk/greenstone2/ext/tika
Files:
5 added

Note: See TracChangeset for help on using the changeset viewer.