Changeset 32965
- Timestamp:
- 2019-04-01T18:17:36+13:00 (5 years ago)
- Location:
- other-projects/is-sheet-music-encore/trunk/gen-corpus-ids
- Files:
-
- 2 edited
Legend:
- Unmodified
- Added
- Removed
-
other-projects/is-sheet-music-encore/trunk/gen-corpus-ids/HATHI-EXTRACT-FORMAT.sh
r32963 r32965 22 22 echo "... Done" 23 23 echo "" 24 25 echo "====" 26 echo " Next, extract entried that are Music Format, Public Domain and" 27 echo " NOT scanned by Google (so called 'open-open' files):" 28 echo " ./HATHI-EXTRACT-PD-NON-GOOGLE.sh" 29 echo "====" 30 -
other-projects/is-sheet-music-encore/trunk/gen-corpus-ids/HATHI-GET-TAB-DELIM-DUMP.sh
r32964 r32965 3 3 wget --verbose "https://www.hathitrust.org/filebrowser/download/287235" \ 4 4 -O "hathi_full_20190301.txt.gz" 5 6 echo "====" 7 echo " Next, extract Format (and a few related fields) so working with" 8 echo " smaller file size:" 9 echo " ./HATHI-EXTRACT-FORMAT.sh" 10 echo "===="
Note:
See TracChangeset
for help on using the changeset viewer.