Changeset 34206


Ignore:
Timestamp:
2020-06-18T19:22:34+12:00 (4 years ago)
Author:
ak19
Message:

UnknownConverterPlugin configured to use Tika for doc processing

Location:
main/trunk/model-sites-dev/opotiki/collect
Files:
9 edited

Legend:

Unmodified
Added
Removed
  • main/trunk/model-sites-dev/opotiki/collect/textdemo/etc/collectionConfig.xml

    r34136 r34206  
    107107            <plugin name="PDFv2Plugin">
    108108                <option name="-convert_to" value="paged_pretty_html"/>
     109            </plugin>
     110            <!-- Configuring an UnknownConverterPlugin for docx processing with Tika -->
     111            <plugin name="UnknownConverterPlugin">
     112              <option name="-exec_cmd" value="java -jar $GSDLHOME/ext/tika/tika-app-1.24.1.jar --html %%INPUT_FILE &gt; %%OUTPUT"/>
     113              <option name="-convert_to" value="html"/>
     114              <option name="-mime_type" value="application/vnd.openxmlformats-officedocument.wordprocessingml.document"/>
     115              <option name="-srcicon" value="icondocx"/>
     116              <option name="-process_extension" value="docx"/>
    109117            </plugin>
    110118            <plugin name="RTFPlugin"/>
  • main/trunk/model-sites-dev/opotiki/collect/waiatade/etc/collectionConfig.xml

    r34136 r34206  
    1414        <displayItem assigned="true" lang="en" name="description">A Collection of Waiata.
    1515Songs (waiata) are sung solo, in unison or at the octave.
    16 Types of song include lullabies (oriori), love songs (waitata aroha) and laments (waiata tangi).
     16Types of song include lullabies (oriori), love songs (waiata aroha) and laments (waiata tangi).
    1717Traditionally all formal speeches are followed by a waiata sung by the speaker and their group of supporters.
    1818Some of the smaller wind instruments are also sung into, and the sound of the poi (raupo ball swung on the end of a flax cord) provides a rhythmic accompaniment to waiata poi.</displayItem>
     
    9393                <option name="-convert_to" value="paged_pretty_html"/>
    9494            </plugin>
     95            <!-- Configuring an UnknownConverterPlugin for docx processing with Tika -->
     96            <plugin name="UnknownConverterPlugin">
     97              <option name="-exec_cmd" value="java -jar $GSDLHOME/ext/tika/tika-app-1.24.1.jar --html %%INPUT_FILE &gt; %%OUTPUT"/>
     98              <option name="-convert_to" value="html"/>
     99              <option name="-mime_type" value="application/vnd.openxmlformats-officedocument.wordprocessingml.document"/>
     100              <option name="-srcicon" value="icondocx"/>
     101              <option name="-process_extension" value="docx"/>
     102            </plugin>           
    95103            <plugin name="RTFPlugin"/>
    96104            <plugin name="WordPlugin"/>
  • main/trunk/model-sites-dev/opotiki/collect/waiatade/etc/fail.log

    r34088 r34206  
    1 Tera te po pango.docx: WordPlugin failed to convert to html
    2 06\Tera te po pango.docx: no plugin could process this file
     106/Tera te po pango.docx: no plugin could recognise this file
Note: See TracChangeset for help on using the changeset viewer.