Changeset 34206

Show
Ignore:
Timestamp:
18.06.2020 19:22:34 (2 weeks ago)
Author:
ak19
Message:

UnknownConverterPlugin? configured to use Tika for doc processing

Location:
main/trunk/model-sites-dev/opotiki/collect
Files:
9 modified

Legend:

Unmodified
Added
Removed
  • main/trunk/model-sites-dev/opotiki/collect/textdemo/etc/collectionConfig.xml

    r34136 r34206  
    107107            <plugin name="PDFv2Plugin"> 
    108108                <option name="-convert_to" value="paged_pretty_html"/> 
     109            </plugin> 
     110            <!-- Configuring an UnknownConverterPlugin for docx processing with Tika --> 
     111            <plugin name="UnknownConverterPlugin"> 
     112              <option name="-exec_cmd" value="java -jar $GSDLHOME/ext/tika/tika-app-1.24.1.jar --html %%INPUT_FILE &gt; %%OUTPUT"/> 
     113              <option name="-convert_to" value="html"/> 
     114              <option name="-mime_type" value="application/vnd.openxmlformats-officedocument.wordprocessingml.document"/> 
     115              <option name="-srcicon" value="icondocx"/> 
     116              <option name="-process_extension" value="docx"/> 
    109117            </plugin> 
    110118            <plugin name="RTFPlugin"/> 
  • main/trunk/model-sites-dev/opotiki/collect/waiatade/etc/collectionConfig.xml

    r34136 r34206  
    1414        <displayItem assigned="true" lang="en" name="description">A Collection of Waiata. 
    1515Songs (waiata) are sung solo, in unison or at the octave.  
    16 Types of song include lullabies (oriori), love songs (waitata aroha) and laments (waiata tangi).  
     16Types of song include lullabies (oriori), love songs (waiata aroha) and laments (waiata tangi).  
    1717Traditionally all formal speeches are followed by a waiata sung by the speaker and their group of supporters.  
    1818Some of the smaller wind instruments are also sung into, and the sound of the poi (raupo ball swung on the end of a flax cord) provides a rhythmic accompaniment to waiata poi.</displayItem> 
     
    9393                <option name="-convert_to" value="paged_pretty_html"/> 
    9494            </plugin> 
     95            <!-- Configuring an UnknownConverterPlugin for docx processing with Tika --> 
     96            <plugin name="UnknownConverterPlugin"> 
     97              <option name="-exec_cmd" value="java -jar $GSDLHOME/ext/tika/tika-app-1.24.1.jar --html %%INPUT_FILE &gt; %%OUTPUT"/> 
     98              <option name="-convert_to" value="html"/> 
     99              <option name="-mime_type" value="application/vnd.openxmlformats-officedocument.wordprocessingml.document"/> 
     100              <option name="-srcicon" value="icondocx"/> 
     101              <option name="-process_extension" value="docx"/> 
     102            </plugin>            
    95103            <plugin name="RTFPlugin"/> 
    96104            <plugin name="WordPlugin"/> 
  • main/trunk/model-sites-dev/opotiki/collect/waiatade/etc/fail.log

    r34088 r34206  
    1 Tera te po pango.docx: WordPlugin failed to convert to html 
    2 06\Tera te po pango.docx: no plugin could process this file 
     106/Tera te po pango.docx: no plugin could recognise this file