source: gs2-extensions/video-and-audio/trunk/src/opt/Terrier/README.txt@ 26956

Last change on this file since 26956 was 26956, checked in by jmt12, 11 years ago

minor update - but still can't remember why I changed SimpleFileCollection.java... maybe it will come to me

File size: 1.7 KB
Line 
1===== Image and Video support for Terrier =====
2
3==== Installation Instructions ====
4
51. Copy the several new document plugins into Terrier:
6
7cd <greenstone>/ext/video-and-audio/opt/Terrier/
8cp *Document.java <terrier>/src/core/org/terrier/indexing/
9
102. Recompile:
11
12cd <terrier>
13ant
14
153. Edit the configuration to include the new document plugins. Your usage may vary depending on what type of Terrier installation you are setting up, but I'll show the example of the configuration for a SimpleFileIndexer installation - available as part of the parallel-building extension:
16
17cd <terrier>/etc/
18emacs terrier.properties
19###################################################################
20#SimpleFileCollection specific properties
21###################################################################
22trec.collection.class=SimpleFileCollection
23##use this to define parsers for know file extensions
24indexing.simplefilecollection.extensionsparsers=txt:FileDocument, pdf:PDFDocument, html:TaggedDocument,htm:TaggedDocument,xhtml:TaggedDocument,xml:TaggedDocument, jpg:ImageDocument, ts:VideoDocument
25
264. Overwrite the simple results JSP file in order to support images and video:
27
28cp <gsdl>/ext/video-and-audio/opt/Terrier/results.jsp <terrier>/src/webapps/simple/results.jsp
29
305. Build your index including some JPG and TS files.
31
326. If you plan to use Terrier built in Jetty server, then you may need to replace the launching script - otherwise symlinked files won't work:
33
34cp <gsdl>/ext/video-and-audio/opt/Terrier/http_terrier.sh <terrier>/bin/http_terrier.sh
35
367. Test the index in your preferred tool, and marvel at the beautiful image and video metadata (or thumbnails, if you are in a web interface)
Note: See TracBrowser for help on using the repository browser.