Last change
on this file since 26186 was 26186, checked in by jmt12, 12 years ago |
Adding in (optional) support for video and image processing in DSpace and Terrier. These kinda belong here as they depend on the video-and-audio support (like MediaInfo, HandbrakeCLI, and Hive2) to work
|
File size:
1.3 KB
|
Line | |
---|
1 | ===== Image and Video support for Terrier =====
|
---|
2 |
|
---|
3 | ==== Installation Instructions ====
|
---|
4 |
|
---|
5 | 1. Copy the two document plugins into Terrier:
|
---|
6 |
|
---|
7 | cd <greenstone>/ext/video-and-audio/opt/Terrier/
|
---|
8 | cp *Document.java <terrier>/src/core/org/terrier/indexing/
|
---|
9 |
|
---|
10 | 2. Recompile:
|
---|
11 |
|
---|
12 | cd <terrier>
|
---|
13 | ant
|
---|
14 |
|
---|
15 | 3. Edit the configuration to include the new document plugins. Your usage may vary depending on what type of Terrier installation you are setting up, but I'll show the example of the configuration for a SimpleFileIndexer installation - available as part of the parallel-building extension:
|
---|
16 |
|
---|
17 | cd <terrier>/etc/
|
---|
18 | emacs terrier.properties
|
---|
19 | ###################################################################
|
---|
20 | #SimpleFileCollection specific properties
|
---|
21 | ###################################################################
|
---|
22 | trec.collection.class=SimpleFileCollection
|
---|
23 | ##use this to define parsers for know file extensions
|
---|
24 | indexing.simplefilecollection.extensionsparsers=txt:FileDocument, pdf:PDFDocument, html:TaggedDocument,htm:TaggedDocument,xhtml:TaggedDocument,xml:TaggedDocument, jpg:ImageDocument, ts:VideoDocument
|
---|
25 |
|
---|
26 | 4. Build your index including some JPG and TS files.
|
---|
27 |
|
---|
28 | 5. Test the index in your preferred tool, and marvel at the beautiful image and video metadata (or thumbnails, if you are in a web interface)
|
---|
Note:
See
TracBrowser
for help on using the repository browser.