root/gs3-extensions

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Rev Chgset Date Author Log Message
(edit) @34427 [34427] 3 weeks davidb Brought across from Essentia source, and preped for use from the …
(edit) @34426 [34426] 3 weeks davidb Ignore downloaded zip and unziped dir
(edit) @34425 [34425] 3 weeks davidb More robust version; takes into account dir change from 'jar' to 'jars'
(edit) @34424 [34424] 3 weeks davidb Revised name for directory
(edit) @34423 [34423] 3 weeks davidb First cut at getting set up with Weka within Mars extension
(edit) @34420 [34420] 3 weeks davidb Grab Weka via wget
(edit) @34419 [34419] 3 weeks davidb Version of script where an Essentia profile is also specified
(edit) @34411 [34411] 6 weeks davidb Inclusion of HPCP calc
(edit) @34410 [34410] 6 weeks davidb Rough cut at something following in a similar suit to essestia-hpcp.py but …
(edit) @34409 [34409] 6 weeks davidb Code tidyup
(edit) @34408 [34408] 6 weeks davidb Fine tuning of build script for WaveSurfer?
(edit) @34407 [34407] 6 weeks davidb NodeJS project to generate Viridis colormap as JSON file
(edit) @34406 [34406] 6 weeks davidb Introduction of Viridis colormap
(edit) @34405 [34405] 6 weeks davidb Location for some bespoke plugin work to fits in with wavesurfer
(edit) @34392 [34392] 6 weeks davidb Changed to default to python v3
(edit) @34391 [34391] 6 weeks davidb More careful control over the creation of python venvs
(edit) @34390 [34390] 6 weeks davidb More logical folder for this to be in
(edit) @34389 [34389] 6 weeks davidb First cut at script to produce a borderless HPCP images of audio file
(edit) @34388 [34388] 6 weeks davidb Work with virtual-env if present; assume python to use is on path
(edit) @34387 [34387] 6 weeks davidb Some refinement of the development setup scripts
(edit) @34386 [34386] 6 weeks davidb Fixed typo in directory name
(edit) @34385 [34385] 6 weeks davidb Better location for these development/compile tools
(edit) @34384 [34384] 6 weeks davidb Better location for these development/compile tools
(edit) @34383 [34383] 6 weeks davidb Better location for these development/compile tools
(edit) @34382 [34382] 6 weeks davidb Better location for these development/compile tools
(edit) @34381 [34381] 6 weeks davidb Area for development compilation tools such as cmake and nodejs
(edit) @34380 [34380] 6 weeks davidb Area for development compilation tools such as cmake and nodejs
(edit) @34379 [34379] 6 weeks davidb Some further refinement of what to print out, after some initial testing
(edit) @34378 [34378] 6 weeks davidb No longer need the JSON file copied into the web/ext/audio area
(edit) @34377 [34377] 6 weeks davidb Better placement and document of what to do with this file
(edit) @34375 [34375] 6 weeks davidb Introductions of spectrogram visualization
(edit) @34374 [34374] 6 weeks davidb Used to build the wavesurfer-js code from source
(edit) @34373 [34373] 6 weeks davidb The result of running gen-heatmap.js
(edit) @34372 [34372] 6 weeks davidb NodeJS code to generate a JSON heatmap to be used with WaveSurferJS
(edit) @34371 [34371] 6 weeks davidb Top-level scripting and checks so CLI is ready to operate with the MARS …
(edit) @34370 [34370] 6 weeks davidb WaveSurfer?-JS source files and top-up player
(edit) @34369 [34369] 6 weeks davidb Adding in NodeJS to compilation sequence, so wavesurfer-js can be built …
(edit) @34368 [34368] 6 weeks davidb No longer needed
(edit) @34367 [34367] 6 weeks davidb Now supports https URLs as well
(edit) @34362 [34362] 6 weeks davidb First rough cut at some notes
(edit) @34361 [34361] 6 weeks davidb Collating of python essensia custom scripts and essentia perl plugin code …
(edit) @34360 [34360] 6 weeks davidb Collating of python essensia custom scripts and essentia perl plugin code
(edit) @34359 [34359] 6 weeks davidb Needs to be updated to be brought back into line with setup.bash
(edit) @34358 [34358] 6 weeks davidb Changed to be a Greenstone3 extension
(edit) @34356 [34356] 6 weeks davidb Some initial work computing essensia audio features when the collection is …
(edit) @34355 [34355] 6 weeks davidb Scripts for processing audio files can extracting audio features for ML
(edit) @34354 [34354] 6 weeks davidb Script to checkout/clone essentia from its git-hub repository
(edit) @34353 [34353] 6 weeks davidb Useful in combo with a python2 to create a virtualenv python2 under user …
(edit) @34349 [34349] 6 weeks davidb Used to stand up a version of python where extra pip packages have been …
(edit) @34348 [34348] 6 weeks davidb Adding in Essential source code to go along with compile scripts
(edit) @34347 [34347] 6 weeks davidb Adding in Essential compile scripts
(edit) @34346 [34346] 6 weeks davidb Further dir that needs to be installed as a header file area
(edit) @34345 [34345] 6 weeks davidb Already done in setup.bash
(edit) @34344 [34344] 6 weeks davidb Extended to now setup/install Eigen3
(edit) @34343 [34343] 6 weeks davidb Tweak to sourcing file
(edit) @34342 [34342] 6 weeks davidb Added block to set GSDLOS
(edit) @34341 [34341] 7 weeks davidb Shift to using cascade-make
(edit) @34340 [34340] 7 weeks davidb Added in cascade-make as an external property
(edit) @34339 [34339] 7 weeks davidb Some initial files to compile up essentia, used in the Mars extension to …
(edit) @34166 [34166] 5 months ak19 Adding Italian language translations of the gs3colcfg module. Many thanks …
(edit) @33997 [33997] 8 months davidb Top-level folder for MARS related Greenstone3 code
(edit) @33736 [33736] 11 months kjdon fixed a spelling mistake
(edit) @33635 [33635] 12 months ak19 Maori-language-detection doesn't use Greenstone 3 at present, it's not a …
(edit) @33634 [33634] 12 months ak19 Rewrote NutchTextDumpProcessor? as NutchTextDumpToMongoDB.java, which uses …
(edit) @33633 [33633] 12 months ak19 1. TextLanguageDetector? now has methods for collecting all sentences and …
(edit) @33626 [33626] 12 months ak19 TODOs
(edit) @33625 [33625] 12 months ak19 A file listing domains with seedurls containing /mi(/) that are located …
(edit) @33624 [33624] 12 months ak19 Some cleanup surrounding the now renamed function createSeedURLsFile, now …
(edit) @33623 [33623] 12 months ak19 1. Incorporated Dr Nichols earlier suggestion of storing page modified …
(edit) @33622 [33622] 12 months ak19 File rename
(edit) @33621 [33621] 12 months ak19 Comitting jotted down mongodb related instructions from what Dr Bainbridge …
(edit) @33620 [33620] 12 months ak19 Final crawl, done on vagrant VM node6. Crawl site IDs 01407-01462.
(edit) @33618 [33618] 12 months ak19 Adding in the download URL
(edit) @33617 [33617] 12 months ak19 Node5 is now full and here is the finished crawl (up to and including site …
(edit) @33616 [33616] 12 months ak19 Beginnings of Java class that is to interact with MongoDB. I don't yet …
(edit) @33615 [33615] 12 months ak19 1. Worked out how to configure log4j to log both to console and logfile, …
(edit) @33609 [33609] 12 months ak19 The tar files containing the crawled sites data shouldn't be called tar.gz …
(edit) @33608 [33608] 12 months ak19 1. New script to export from HBase so that we could in theory reimport …
(edit) @33607 [33607] 12 months ak19 Updated with the remaining successfully crawled sites on node4 before …
(edit) @33606 [33606] 12 months ak19 1. Committing crawl data from node3 (2nd VM for nutch crawling). 2. …
(edit) @33605 [33605] 12 months ak19 Node 4 VM still works, but committing first set of crawled sites on there
(edit) @33604 [33604] 12 months ak19 1. Better output into possible-product-sites.txt including the overseas …
(edit) @33603 [33603] 12 months ak19 Incorporating Dr Nichols suggestion to help weed out product sites: if tld …
(edit) @33602 [33602] 12 months ak19 1. The final csv file, mri-sentences.csv, is now written out. 2. Only …
(edit) @33601 [33601] 12 months ak19 Creates the 2nd csv file, with info about webpages. At present stores …
(edit) @33600 [33600] 12 months ak19 Work in progress of writing out CSV files. In future, may write the same …
(edit) @33599 [33599] 12 months ak19 First one-third sites crawled. Committing to SVN despite the tarred …
(edit) @33598 [33598] 12 months ak19 More instructions on setting up Nutch now that I've remembered to commit …
(edit) @33597 [33597] 12 months ak19 Committing active version of template file which has a newline at end of …
(edit) @33596 [33596] 12 months ak19 Adding in the nutch-site.xml and regex-urlfilter.GS_TEMPLATE template file …
(edit) @33588 [33588] 12 months ak19 Committing the MRI sentence model that I'm actually using, the one in my …
(edit) @33587 [33587] 12 months ak19 1. Better stats reporting on crawled sites: not just if a page was in MRI …
(edit) @33586 [33586] 12 months ak19 Refactored MaoriTextDetector?.java class into more general …
(edit) @33585 [33585] 12 months ak19 Much simpler way of using sentence and language detection model to work on …
(edit) @33584 [33584] 12 months ak19 Committing experimental version 2 using the sentence detector model, …
(edit) @33583 [33583] 12 months ak19 Committing experimental version 1 using the sentence detector model, …
(edit) @33582 [33582] 13 months ak19 NutchTextDumpProcessor? prints each crawled site's stats: number of …
(edit) @33581 [33581] 13 months ak19 Minor fix. Noticed when looking for work I did on MRI sentence detection
(edit) @33580 [33580] 13 months ak19 Finally fixed the thus-far identified bugs when parsing dump.txt.
(edit) @33579 [33579] 13 months ak19 Debugging. Solved one problem.
Note: See TracRevisionLog for help on using the revision log.