source: other-projects/is-sheet-music-encore/trunk/gen-corpus-ids/HATHI-EXTRACT-FORMAT.sh@ 32962

Last change on this file since 32962 was 32962, checked in by davidb, 3 years ago

Scripts for processing the tab-delimited HathiTrust metadata files

  • Property svn:executable set to *
File size: 130 bytes
Line 
1#!/bin/bash
2
3input=${1:-'hathi_full_20190301.txt.gz'}
4
5zcat "$input" \
6 | awk -F '\t' '{print $1 "\t" $3 "\t" $20 "\t" $24} '
Note: See TracBrowser for help on using the repository browser.