Last change
on this file since 33017 was 33017, checked in by cpb16, 5 years ago |
Renamed Downloaders as .sh; Completed a metadata formater script; Creating a metadata extractor java program; Have found error in code for downloading records. Commandline thinks that some ids are directories, need to correct that
|
File size:
629 bytes
|
Rev | Line | |
---|
[33010] | 1 | INSTRUCTIONS
|
---|
| 2 | TEST CORPUS GENERATION
|
---|
| 3 | (java-gen-corpus/..):
|
---|
| 4 |
|
---|
| 5 | "make" has presets compiling and generating datasets.
|
---|
| 6 | See makefile for commands and details
|
---|
| 7 |
|
---|
[33017] | 8 | METADATA-FORMATER
|
---|
| 9 | "./metadata-formater"
|
---|
| 10 | Be sure to run this before processing the hathifiles
|
---|
[33010] | 11 |
|
---|
| 12 |
|
---|
| 13 | DOWNLOADING SET OF PNG'S
|
---|
[33014] | 14 | (COMPX520-DOWNLOADER-PNG.txt , COMPX520-RUN-PNG.sh , ddapiclient2-extended.pl
|
---|
| 15 | "source ./SETUP.bash"
|
---|
| 16 | "./COMPX520-DOWNLOADER-PNG.txt <java-gen-corpus/filename> <page_num>"
|
---|
[33010] | 17 |
|
---|
| 18 |
|
---|
| 19 |
|
---|
| 20 |
|
---|
| 21 | DOWNLOADING SET OF ZIP'S
|
---|
| 22 | (COMPX520-DOWNLOADER-ZIP.txt , COMPX520-RUN-XIP.sh , ddapiclient2-extended-ZIP.pl):
|
---|
| 23 |
|
---|
| 24 | "./COMPX520-DOWNLOADER-ZIP.txt <java-gen-corpus/filename>"
|
---|
| 25 |
|
---|
| 26 |
|
---|
| 27 |
|
---|
| 28 |
|
---|
| 29 |
|
---|
| 30 |
|
---|
| 31 |
|
---|
| 32 |
|
---|
Note:
See
TracBrowser
for help on using the repository browser.