root/other-projects

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Rev Chgset Date Author Log Message
(edit) @30940 [30940] 3 years davidb Change to using URI not fileIn directly
(edit) @30939 [30939] 3 years davidb Minor tweaks
(edit) @30938 [30938] 3 years davidb Experiment with using Hadoop's FileSystem? class for local  file:// access
(edit) @30937 [30937] 3 years davidb Expanded set of ClusterFileIO methods
(edit) @30936 [30936] 3 years davidb Refinement of Spark Monitor echo statements
(edit) @30935 [30935] 3 years davidb Fixed variable name typo, plus added a couple of 'sleep' pauses of 1 sec
(edit) @30934 [30934] 3 years davidb Providing json-filelist now a compulsory argument, rather than an option
(edit) @30933 [30933] 3 years davidb More careful parsing of file prefix
(edit) @30932 [30932] 3 years davidb Support both  file:// and  hdfs://
(edit) @30931 [30931] 3 years davidb Version that runs using  fil:// tested
(edit) @30930 [30930] 3 years davidb Expansion of useful alias commands for Hadoop and Spark
(edit) @30929 [30929] 3 years davidb Tweaks made while testing the script
(edit) @30928 [30928] 3 years davidb Forgot to set json_filelist
(edit) @30927 [30927] 3 years davidb Fixed silly typo in stdout redirect
(edit) @30926 [30926] 3 years davidb Restructuring of RUN scripts to be more flexible
(edit) @30925 [30925] 3 years davidb Improved instrutions
(edit) @30924 [30924] 3 years davidb Tidy up of code. Removed commented out code
(edit) @30923 [30923] 3 years davidb Rough cut version that reads in each JSON file over HDFS
(edit) @30922 [30922] 3 years davidb Additional rough-cut notes
(edit) @30921 [30921] 3 years davidb Code change to read in JSON file over HDFS
(edit) @30919 [30919] 3 years davidb More consistent naming of folders used
(edit) @30918 [30918] 3 years davidb More flexible command-line args
(edit) @30917 [30917] 3 years davidb Changes resulting from a fresh run at provisioning, which yielded the …
(edit) @30916 [30916] 3 years davidb Some additional details -- note form
(edit) @30915 [30915] 3 years davidb Initial cut at instructions to follow to get code set up and running
(edit) @30914 [30914] 3 years davidb Tidy up of setup description
(edit) @30913 [30913] 3 years davidb Renaming to better represent what the cluster is designed for
(edit) @30912 [30912] 3 years davidb Changed to Unix style line-endings
(edit) @30911 [30911] 3 years davidb Changed name of input directory
(edit) @30910 [30910] 3 years davidb Additional finesse added in as a result of further testing on Vagrant …
(edit) @30909 [30909] 3 years davidb Additional finesse added in as a result of further testing on Vagrant …
(edit) @30908 [30908] 3 years davidb Additional finesse added in as a result of further testing on Vagrant …
(edit) @30907 [30907] 3 years davidb Name change to reflect need for 'bash' not 'sh'
(edit) @30906 [30906] 3 years davidb Bash version of BAT script
(edit) @30905 [30905] 3 years davidb Additional resources
(edit) @30904 [30904] 3 years davidb Extra resource/links added
(edit) @30903 [30903] 3 years davidb Vagrant provisioning files for a 4-node Hadoop cluster. See README.txt …
(edit) @30902 [30902] 3 years davidb Details of what packages are needed
(edit) @30901 [30901] 3 years davidb Template setup file
(edit) @30900 [30900] 3 years davidb For support Java packages
(edit) @30899 [30899] 3 years davidb Files for compilation using Eclipse
(edit) @30898 [30898] 3 years davidb Scripts for downloading sample JSON data from public domain extracted …
(edit) @30897 [30897] 3 years davidb Sub-project for converted HTRC Extract Feature dataset into a form that …
(edit) @30890 [30890] 3 years davidb folder to group together hathitrust related projects
(edit) @30846 [30846] 3 years ak19 Wrong module in script name.
(edit) @30818 [30818] 3 years ak19 Script needs to get rid of another intermediate file.
(edit) @30722 [30722] 3 years ak19 Remove repeated empty lines, leaving just a single blank line between …
(edit) @30720 [30720] 3 years ak19 Getting the nightly gti email to be sent again on the new machine after …
(edit) @30652 [30652] 3 years ak19 Committing outstanding files for diffcol supporting jdb for GS3 diffing. …
(edit) @30613 [30613] 3 years ak19 Don't send nightly email messages about updates to the test language …
(edit) @30611 [30611] 3 years ak19 Modified version of remove_extra_lines script to handle the Updated and …
(edit) @30605 [30605] 3 years ak19 Committing the changes necessary to get the GTI crons to work on the new …
(edit) @30594 [30594] 3 years kjdon modifying the input and results areas to get it looking the same in …
(edit) @30590 [30590] 3 years kjdon this change was made on puka, and when I tried this on commdev, it didn't …
(edit) @30581 [30581] 3 years ak19 GTI related changes to add gs3 demo collection config files' displayitems …
(edit) @30425 [30425] 3 years davidb Save metadata as JSON file. Create sub-directories to spreadout the …
(edit) @30424 [30424] 3 years ak19 Dr Bainbridge improved the code so that the script always gets the latest …
(edit) @30422 [30422] 3 years davidb Tidier treatment of 'bin' and 'audio' directories from an SVN point of …
(edit) @30421 [30421] 3 years davidb Removed from SVN tree
(edit) @30420 [30420] 3 years davidb svn:ignore 'audio' and 'bin'
(edit) @30419 [30419] 3 years davidb Support for JSON added
(edit) @30418 [30418] 3 years davidb Code updated to work through a sequence of pages for one artist
(edit) @30417 [30417] 3 years davidb Switched order of stdout and stderr redirects
(edit) @30416 [30416] 3 years davidb Initial cut at code for scraping music excerpts from AMC site
(edit) @30415 [30415] 3 years davidb Main trunk.
(edit) @30414 [30414] 3 years davidb Top-level folder for Western Sydney University led Music Affect …
(edit) @30410 [30410] 3 years ak19 Finally got the upload of the binary to turn up.
(edit) @30408 [30408] 3 years ak19 Fix (from Puka) for the upload destination machine to work out the …
(edit) @30407 [30407] 3 years ak19 Script for clearing old binaries and logs from the machine where they …
(edit) @30406 [30406] 3 years ak19 Minor change. rke log for expeditee should have the os suffix on all OS.
(edit) @30405 [30405] 3 years ak19 Swap month and day in date string for generated nightly binary, to get …
(edit) @30404 [30404] 3 years ak19 Cosmetic change: more helpful comment for configuring rke-setup on mac.
(edit) @30399 [30399] 3 years ak19 Minor changes to mirror the way the expeditee nightly generation scripts …
(edit) @30398 [30398] 3 years ak19 Parallel changes for the linux machines to use variables in …
(edit) @30397 [30397] 3 years ak19 Fixed values in environment setup file, and used as variables in …
(edit) @30396 [30396] 3 years ak19 Uploading expeditee nightly on linux 32 bit works.
(edit) @30395 [30395] 3 years ak19 Got uploading to work at last.
(edit) @30393 [30393] 3 years ak19 Bat file also needs to delete stale expeditee snapshot before regenerating …
(edit) @30392 [30392] 3 years ak19 Deleting expeditee snapshot before regenerating.
(edit) @30391 [30391] 3 years sjm84 Additional setup.sh.in file for linux-lsb VM environments. To be used …
(edit) @30390 [30390] 3 years sjm84 New expeditee SVN repository URL.
(edit) @30387 [30387] 3 years ak19 First step in creating nightly scripts for generating expeditee nightly …
(edit) @30137 [30137] 4 years jmt12 Obsolete - merged with documentation/wiki
(edit) @30136 [30136] 4 years jmt12 Merged with Greenstone Dokuwiki files in documentation/wiki - sending …
(edit) @30126 [30126] 4 years davidb Some svn:ignore values to cover oof VS automatically generated folders
(edit) @30125 [30125] 4 years davidb VS seems to want this icon in the top level
(edit) @30124 [30124] 4 years davidb branded icon for project
(edit) @30123 [30123] 4 years davidb Refactoring to use better playing-in-the-street (PITS) 'branding'
(edit) @30122 [30122] 4 years davidb Deleting, as it looks like it was mistakenly committed within this folder, …
(edit) @30121 [30121] 4 years davidb Refactoring of files names used
(edit) @30075 [30075] 4 years davidb Initial cut at a setup file
(edit) @30074 [30074] 4 years davidb To be used in the Java code that downloads PDFs using APIs such as Digital …
(edit) @30073 [30073] 4 years davidb GPL v3 license
(edit) @30072 [30072] 4 years davidb Folder for code that does the downloading work
(edit) @30071 [30071] 4 years davidb 'trunk' for Top level folder for the Institutional Repository harveting …
(edit) @30070 [30070] 4 years davidb Top level folder for the Institutional Repository harveting metadata …
(edit) @30069 [30069] 4 years ak19 Jeremy jts1 found that the Payload archive file is just a tar. Though …
(edit) @30064 [30064] 4 years davidb Useful files for compiling and running the code. Inlucde 'ant' for …
(edit) @30063 [30063] 4 years davidb Replacement of NetBeans? based ant compile files with much simpler …
(edit) @30062 [30062] 4 years davidb Removal/Tidy-up of debug statements
Note: See TracRevisionLog for help on using the revision log.