Last change
on this file since 34190 was 34190, checked in by ak19, 4 years ago |
- The tessdata folder was being created when compiling tesseract, and needn't be created and populated manually (except for the lang files), so there's less work for CASCADE-MAKE/TESSERACT.sh to do. However, the tessdata folder was being created in the linux/share folder. 'share' is probably a place where people expect tesseract's tessdata to be by default, so am updating the setup scripts to work with that, as I've donw with CASCADE-MAKE/TESSERACT.sh. 2. Adding useful instructions for users on getting more OCR language scripts' support in new file GETTING-OCR-SUPPORT-FOR-MORE-LANGS.txt, now included in the tesseract binary tarball too. Adjusted the README for us. 3. Removing the sample.jpg, converted from sample.tif which I'd downloaded from online and for which I don't know the copyright to. Replacing with sample.tif, a 96 DPI TIF file at 1870x2420 resolution produced from the first page of pdf05-notext.pdf by www.sejda.com/pdf-to-jpg. Moreover, this sample file contains lots of text, in 2 columns, not just 4 words like the original sample file. Good for testing a tesseract built from CASCADE-MAKE on. Also including the pdf05-notext-ocr-with-tikaTesseract.pdf istelf from the tutorial sample files, but only Tika with Tesseract can work on PDFs and not Tesseract by itself, indicated in the filename.
|
-
Property svn:executable
set to
*
|
File size:
1.6 KB
|
Line | |
---|
1 | extdesc="the Tesseract OCR support library extension"
|
---|
2 |
|
---|
3 | full_setup=`pwd`/${BASH_SOURCE}
|
---|
4 | fulldir=${full_setup%/*}
|
---|
5 | fulldir=${fulldir%/.}
|
---|
6 |
|
---|
7 | if [ "x$GSDLHOME" = "x" ] ; then
|
---|
8 | if [ -e ../../setup.bash ] ; then
|
---|
9 | export gsopt_noexts=1 && pushd ../.. && source ./setup.bash && popd
|
---|
10 | export gsopt_noexts=
|
---|
11 | elif [ -e ../../gs3-setup.sh ] ; then
|
---|
12 | export gsopt_noexts=1 && pushd ../.. && source ./gs3-setup.sh && popd
|
---|
13 | export gsopt_noexts=
|
---|
14 | fi
|
---|
15 | fi
|
---|
16 |
|
---|
17 | if [ "x$GEXTTESS" = "x" ] ; then
|
---|
18 | export GEXTTESS=$fulldir
|
---|
19 | export GEXTTESS_INSTALLED=$GEXTTESS/$GSDLOS
|
---|
20 |
|
---|
21 | # tesseract needs the TESSDATA_PREFIX env var set to the languages folder (tessdata)
|
---|
22 | export TESSDATA_PREFIX=$GEXTTESS_INSTALLED/share/tessdata
|
---|
23 |
|
---|
24 | export PATH=$GEXTTESS_INSTALLED/bin:$PATH
|
---|
25 |
|
---|
26 | # if [ "x$LD_LIBRARY_PATH" = "x" ] ; then
|
---|
27 | # export LD_LIBRARY_PATH=$GEXTTESS_INSTALLED/lib
|
---|
28 | # else
|
---|
29 | # export LD_LIBRARY_PATH=$GEXTTESS_INSTALLED/lib:$LD_LIBRARY_PATH
|
---|
30 | # fi
|
---|
31 | # if [ "x$DYLD_LIBRARY_PATH" = "x" ] ; then
|
---|
32 | # export DYLD_LIBRARY_PATH=$GEXTTESS_INSTALLED/lib
|
---|
33 | # else
|
---|
34 | # export DYLD_LIBRARY_PATH=$GEXTTESS_INSTALLED/lib:$DYLD_LIBRARY_PATH
|
---|
35 | # fi
|
---|
36 | #
|
---|
37 | # if [ -e "devel-srcpack" ] ; then
|
---|
38 | # export LD_LIBRARY_PATH=$GEXTTESS_INSTALLED/devel/lib:$LD_LIBRARY_PATH
|
---|
39 | # export DYLD_LIBRARY_PATH=$GEXTTESS_INSTALLED/devel/lib:$DYLD_LIBRARY_PATH
|
---|
40 | # fi
|
---|
41 |
|
---|
42 |
|
---|
43 | extdir=${GEXTTESS##*/}
|
---|
44 |
|
---|
45 | if [ "x$GSDLEXTS" = "x" ] ; then
|
---|
46 | export GSDLEXTS=$extdir
|
---|
47 | else
|
---|
48 | export GSDLEXTS=$GSDLEXTS:$extdir
|
---|
49 | fi
|
---|
50 |
|
---|
51 |
|
---|
52 | echo "+Your environment is now setup for $extdesc"
|
---|
53 | else
|
---|
54 | echo "+Your environment is already setup for $extdesc"
|
---|
55 | fi
|
---|
Note:
See
TracBrowser
for help on using the repository browser.