Scripts for downloading sample JSON data from public domain extracted feature set, and some initial processing using Apache Spark

1rsync -pav --progress --files-from=pd-file-listing-step1000.txt json-files/.
