Ignore:
Timestamp:
2018-07-17T19:47:42+12:00 (6 years ago)
Author:
ak19
Message:

Our custom pdf-box class PDFToImagesAndText.java now takes two additional flags, textOnly and imagesOnly, which can be used to support paged_text and the original pagedimg_ output formats, besides pagedimgtxt_

File:
1 edited

Legend:

Unmodified
Added
Removed
  • gs2-extensions/pdf-box/trunk/GS_PDFBox_README.txt

    r32197 r32278  
    5353
    5454- grab the svn version of the Greenstone pdfbox extension
    55 - Then from the svn checked out pdfbox folder, run
    56     $pdfbox > javac -cp /path/to/pdfbox/java/lib/java/pdfbox-app.jar -d /path/to/pdfbox/java/build java/src/org/greenstone/pdfbox/PDFBoxToImagesAndText.java
    57 which will compile against the pdfbox-app.jar in the classpath (-cp_ and output the .class file into the directory denoted by -d
     55- Then from the svn checked out pdfbox (trunk/java) folder, run
     56    $ javac -cp `pwd`/lib/java/pdfbox-app.jar -d `pwd`/build src/org/greenstone/pdfbox/PDFBoxToImagesAndText.java
     57
     58which will compile our custom PDFBoxToImagesAndText.java file against the pdfbox-app.jar in the classpath (-cp) and output the .class file into the directory denoted by -d
    5859
    5960To run, that build folder needs to be on the classpath, besides pdfbox-app.jar itself. See PDFBoxConverter.pm
Note: See TracChangeset for help on using the changeset viewer.