Show
Ignore:
Timestamp:
02.05.2019 14:51:03 (6 months ago)
Author:
cpb16
Message:

Streamlined numpages checking and random selection. Corrected COMPX-RUN-X.sh to download all files (naming error). NEXT: Clean up corpus generation and move on to the next phase

Files:
1 modified

Legend:

Unmodified
Added
Removed
  • other-projects/is-sheet-music-encore/trunk/COMPX520-RUN-META.sh

    r33017 r33044  
    77 
    88doc_id=$1 
     9doc_id_file=`echo $doc_id | sed 's/:/+/' | sed 's/\//=/g'` 
    910 
    10 output_file="java-gen-corpus/download-meta/$doc_id-META.txt" 
     11output_file="java-gen-corpus/download-meta/$doc_id_file-META.txt" 
    1112echo "Retrieving doc-id-page: $doc_id -> $output_file" 
    1213echo ""