source: main/trunk/model-sites-dev/heritage-nz/collect/pdf-reports/PDFS-MISSING-FROM-XLS.sh@ 33033

Last change on this file since 33033 was 33033, checked in by davidb, 5 years ago

Some cross-checking of the Excel spreadsheet and PDFs present, as derived from what is in the archives directcory after running IMPORT.sh

  • Property svn:executable set to *
File size: 556 bytes
Line 
1#!/bin/bash
2
3echo ""
4echo "===="
5echo "Listing 'archives/HASH*/doc.xml' as indicative of no metadata in spreadsheet:"
6echo "===="
7
8fgrep SourceFile archives/HASH*/doc.xml \
9 | sed 's/<[^>]\+>//g' \
10 | awk '{print " " $2}'
11
12echo ""
13
14also_unknown_hash=`fgrep UnknownPlugin archives/HASH*/doc.xml | awk -F/ '{print $2}'`
15
16echo "===="
17echo "Listing those from above that also have no text, as idicated by presence of UnknownPlugin"
18echo "===="
19
20for f in $also_unknown_hash ; do
21 fgrep SourceFile "archives/$f/doc.xml" \
22 | sed 's/<[^>]\+>//g'
23
24done
25
26
27
Note: See TracBrowser for help on using the repository browser.