source: gs2-extensions/tesseract/trunk/README.txt@ 34181

Last change on this file since 34181 was 34181, checked in by ak19, 4 years ago

Committing the cut-down, binaries-only tesseract tarball for x64 linux and README with instructions on how this was created. Hopefully if it works, these will become the instructions for how to recreate such a tarball in future too.

File size: 1.7 KB
Line 
1-------------------------------------------------
2COMPILING TESSERACT GS2-EXTENSION
3& CREATING THE CUT-DOWN BINARY-ONLY TARBALL
4-------------------------------------------------
5
6To compile the Tesseract gs2-extension and then create the "binary" tarball needed to run
7Tesseract, we follow an equivalent version of the instructions for the imagemagick gs2-extension
8at http://trac.greenstone.org/browser/gs2-extensions/imagemagick/trunk/README
9
101. Find a location on your machine
11
122. Check out the tesseract extension from gs2-extensions
13 svn co http://trac.greenstone.org/browser/gs2-extensions/tesseract/trunk tesseract
14
153. Compile it all up (tesseract and dependencies):
16 cd tesseract
17 ./CASCADE-MAKE.sh
18
194. Open a fresh terminal and check that the tesseract now installed in src/linux/bin works:
20
21 cd src
22 source ./setup.bash
23
24This should have set up env vars like GEXTTESS, GEXTTESS_INSTALLED, and TESSDATA_PREFIX
25which Tesseract needs to have set
26
27 tesseract --list-langs
28 tesseract sample.tif out
29
30OCRs sample.tif and generates out.txt from it.
31
32 cat out.txt
33
345. If successful, create a folder at the same level as src alled tesseract
35 cd src
36 cd ..
37 mkdir tesseract
38
39COPY the setup files and MOVE the installed folder (src/linux) into there:
40 cp src/setup.ba* tesseract/.
41 mv src/linux tesseract/.
42
436. Create a tarball of the cut down tesseract folder named tesseract-<os>-<arch>.tar.gz:
44 tar -cvzf tesseract-linux-x64.tar.gz tesseract
45
467. (Add/SVN up and) commit that to svn:
47 svn up
48 svn add tesseract-linux-x64.tar.gz
49 (or svn diff tesseract-linux-x64.tar.gz if there was an earlier version to confirm modified)
50 svn commit -m "MESSAGE" tesseract-linux-x64.tar.gz
51
52
53-------------------------------------------------
Note: See TracBrowser for help on using the repository browser.