Last change
on this file since 33044 was 33044, checked in by cpb16, 5 years ago |
Streamlined numpages checking and random selection. Corrected COMPX-RUN-X.sh to download all files (naming error). NEXT: Clean up corpus generation and move on to the next phase
|
-
Property svn:executable
set to
*
|
File size:
360 bytes
|
Rev | Line | |
---|
[33014] | 1 | #!/bin/bash
|
---|
| 2 |
|
---|
| 3 | if [ $# != 1 ] ; then
|
---|
| 4 | echo "Usage: ./COMPX520-RUN-META.sh doc_id" 1>&2
|
---|
| 5 | exit 1
|
---|
| 6 | fi
|
---|
| 7 |
|
---|
| 8 | doc_id=$1
|
---|
[33044] | 9 | doc_id_file=`echo $doc_id | sed 's/:/+/' | sed 's/\//=/g'`
|
---|
[33014] | 10 |
|
---|
[33044] | 11 | output_file="java-gen-corpus/download-meta/$doc_id_file-META.txt"
|
---|
[33014] | 12 | echo "Retrieving doc-id-page: $doc_id -> $output_file"
|
---|
| 13 | echo ""
|
---|
| 14 |
|
---|
| 15 | ./dapiclient2-extended-META.pl "$doc_id" > "$output_file"
|
---|
| 16 |
|
---|
| 17 |
|
---|
Note:
See
TracBrowser
for help on using the repository browser.