source: gs3-extensions

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @34542   3 weeks ak19 German language gs3colcfg module of GS interface. Many thanks to Nora …
(edit) @34541   3 weeks ak19 Croatian language gs3colcfg module of GS interface. Many thanks to …
(edit) @34427   2 months davidb Brought across from Essentia source, and preped for use from the …
(edit) @34426   2 months davidb Ignore downloaded zip and unziped dir
(edit) @34425   2 months davidb More robust version; takes into account dir change from 'jar' to 'jars'
(edit) @34424   2 months davidb Revised name for directory
(edit) @34423   2 months davidb First cut at getting set up with Weka within Mars extension
(edit) @34420   2 months davidb Grab Weka via wget
(edit) @34419   2 months davidb Version of script where an Essentia profile is also specified
(edit) @34411   3 months davidb Inclusion of HPCP calc
(edit) @34410   3 months davidb Rough cut at something following in a similar suit to essestia-hpcp.py …
(edit) @34409   3 months davidb Code tidyup
(edit) @34408   3 months davidb Fine tuning of build script for WaveSurfer
(edit) @34407   3 months davidb NodeJS project to generate Viridis colormap as JSON file
(edit) @34406   3 months davidb Introduction of Viridis colormap
(edit) @34405   3 months davidb Location for some bespoke plugin work to fits in with wavesurfer
(edit) @34392   3 months davidb Changed to default to python v3
(edit) @34391   3 months davidb More careful control over the creation of python venvs
(edit) @34390   3 months davidb More logical folder for this to be in
(edit) @34389   3 months davidb First cut at script to produce a borderless HPCP images of audio file
(edit) @34388   3 months davidb Work with virtual-env if present; assume python to use is on path
(edit) @34387   3 months davidb Some refinement of the development setup scripts
(edit) @34386   3 months davidb Fixed typo in directory name
(edit) @34385   3 months davidb Better location for these development/compile tools
(edit) @34384   3 months davidb Better location for these development/compile tools
(edit) @34383   3 months davidb Better location for these development/compile tools
(edit) @34382   3 months davidb Better location for these development/compile tools
(edit) @34381   3 months davidb Area for development compilation tools such as cmake and nodejs
(edit) @34380   3 months davidb Area for development compilation tools such as cmake and nodejs
(edit) @34379   3 months davidb Some further refinement of what to print out, after some initial testing
(edit) @34378   3 months davidb No longer need the JSON file copied into the web/ext/audio area
(edit) @34377   3 months davidb Better placement and document of what to do with this file
(edit) @34375   3 months davidb Introductions of spectrogram visualization
(edit) @34374   3 months davidb Used to build the wavesurfer-js code from source
(edit) @34373   3 months davidb The result of running gen-heatmap.js
(edit) @34372   3 months davidb NodeJS code to generate a JSON heatmap to be used with WaveSurferJS
(edit) @34371   3 months davidb Top-level scripting and checks so CLI is ready to operate with the …
(edit) @34370   3 months davidb WaveSurfer-JS source files and top-up player
(edit) @34369   3 months davidb Adding in NodeJS to compilation sequence, so wavesurfer-js can be …
(edit) @34368   3 months davidb No longer needed
(edit) @34367   3 months davidb Now supports https URLs as well
(edit) @34362   3 months davidb First rough cut at some notes
(edit) @34361   3 months davidb Collating of python essensia custom scripts and essentia perl plugin …
(edit) @34360   3 months davidb Collating of python essensia custom scripts and essentia perl plugin code
(edit) @34359   3 months davidb Needs to be updated to be brought back into line with setup.bash
(edit) @34358   3 months davidb Changed to be a Greenstone3 extension
(edit) @34356   3 months davidb Some initial work computing essensia audio features when the …
(edit) @34355   3 months davidb Scripts for processing audio files can extracting audio features for ML
(edit) @34354   3 months davidb Script to checkout/clone essentia from its git-hub repository
(edit) @34353   3 months davidb Useful in combo with a python2 to create a virtualenv python2 under …
(edit) @34349   3 months davidb Used to stand up a version of python where extra pip packages have …
(edit) @34348   3 months davidb Adding in Essential source code to go along with compile scripts
(edit) @34347   3 months davidb Adding in Essential compile scripts
(edit) @34346   3 months davidb Further dir that needs to be installed as a header file area
(edit) @34345   3 months davidb Already done in setup.bash
(edit) @34344   3 months davidb Extended to now setup/install Eigen3
(edit) @34343   3 months davidb Tweak to sourcing file
(edit) @34342   3 months davidb Added block to set GSDLOS
(edit) @34341   3 months davidb Shift to using cascade-make
(edit) @34340   3 months davidb Added in cascade-make as an external property
(edit) @34339   3 months davidb Some initial files to compile up essentia, used in the Mars extension …
(edit) @34166   6 months ak19 Adding Italian language translations of the gs3colcfg module. Many …
(edit) @33997   9 months davidb Top-level folder for MARS related Greenstone3 code
(edit) @33736   12 months kjdon fixed a spelling mistake
(edit) @33635   13 months ak19 Maori-language-detection doesn't use Greenstone 3 at present, it's not …
(edit) @33634   13 months ak19 Rewrote NutchTextDumpProcessor as NutchTextDumpToMongoDB.java, which …
(edit) @33633   13 months ak19 1. TextLanguageDetector now has methods for collecting all sentences …
(edit) @33626   13 months ak19 TODOs
(edit) @33625   13 months ak19 A file listing domains with seedurls containing /mi(/) that are …
(edit) @33624   13 months ak19 Some cleanup surrounding the now renamed function createSeedURLsFile, …
(edit) @33623   13 months ak19 1. Incorporated Dr Nichols earlier suggestion of storing page modified …
(edit) @33622   13 months ak19 File rename
(edit) @33621   13 months ak19 Comitting jotted down mongodb related instructions from what Dr …
(edit) @33620   13 months ak19 Final crawl, done on vagrant VM node6. Crawl site IDs 01407-01462.
(edit) @33618   13 months ak19 Adding in the download URL
(edit) @33617   13 months ak19 Node5 is now full and here is the finished crawl (up to and including …
(edit) @33616   13 months ak19 Beginnings of Java class that is to interact with MongoDB. I don't yet …
(edit) @33615   13 months ak19 1. Worked out how to configure log4j to log both to console and …
(edit) @33609   13 months ak19 The tar files containing the crawled sites data shouldn't be called …
(edit) @33608   13 months ak19 1. New script to export from HBase so that we could in theory reimport …
(edit) @33607   13 months ak19 Updated with the remaining successfully crawled sites on node4 before …
(edit) @33606   13 months ak19 1. Committing crawl data from node3 (2nd VM for nutch crawling). 2. …
(edit) @33605   13 months ak19 Node 4 VM still works, but committing first set of crawled sites on there
(edit) @33604   14 months ak19 1. Better output into possible-product-sites.txt including the …
(edit) @33603   14 months ak19 Incorporating Dr Nichols suggestion to help weed out product sites: if …
(edit) @33602   14 months ak19 1. The final csv file, mri-sentences.csv, is now written out. 2. Only …
(edit) @33601   14 months ak19 Creates the 2nd csv file, with info about webpages. At present stores …
(edit) @33600   14 months ak19 Work in progress of writing out CSV files. In future, may write the …
(edit) @33599   14 months ak19 First one-third sites crawled. Committing to SVN despite the tarred …
(edit) @33598   14 months ak19 More instructions on setting up Nutch now that I've remembered to …
(edit) @33597   14 months ak19 Committing active version of template file which has a newline at end …
(edit) @33596   14 months ak19 Adding in the nutch-site.xml and regex-urlfilter.GS_TEMPLATE template …
(edit) @33588   14 months ak19 Committing the MRI sentence model that I'm actually using, the one in …
(edit) @33587   14 months ak19 1. Better stats reporting on crawled sites: not just if a page was in …
(edit) @33586   14 months ak19 Refactored MaoriTextDetector.java class into more general …
(edit) @33585   14 months ak19 Much simpler way of using sentence and language detection model to …
(edit) @33584   14 months ak19 Committing experimental version 2 using the sentence detector model, …
(edit) @33583   14 months ak19 Committing experimental version 1 using the sentence detector model, …
(edit) @33582   14 months ak19 NutchTextDumpProcessor prints each crawled site's stats: number of …
(edit) @33581   14 months ak19 Minor fix. Noticed when looking for work I did on MRI sentence detection
Note: See TracRevisionLog for help on using the revision log.