Last change
on this file since 34385 was 33044, checked in by cpb16, 5 years ago |
Streamlined numpages checking and random selection. Corrected COMPX-RUN-X.sh to download all files (naming error). NEXT: Clean up corpus generation and move on to the next phase
|
-
Property svn:executable
set to
*
|
File size:
360 bytes
|
Line | |
---|
1 | #!/bin/bash
|
---|
2 |
|
---|
3 | if [ $# != 1 ] ; then
|
---|
4 | echo "Usage: ./COMPX520-RUN-META.sh doc_id" 1>&2
|
---|
5 | exit 1
|
---|
6 | fi
|
---|
7 |
|
---|
8 | doc_id=$1
|
---|
9 | doc_id_file=`echo $doc_id | sed 's/:/+/' | sed 's/\//=/g'`
|
---|
10 |
|
---|
11 | output_file="java-gen-corpus/download-meta/$doc_id_file-META.txt"
|
---|
12 | echo "Retrieving doc-id-page: $doc_id -> $output_file"
|
---|
13 | echo ""
|
---|
14 |
|
---|
15 | ./dapiclient2-extended-META.pl "$doc_id" > "$output_file"
|
---|
16 |
|
---|
17 |
|
---|
Note:
See
TracBrowser
for help on using the repository browser.