source: gs3-extensions/maori-lang-detection/src/org

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @33469   5 years ak19 Don't want URLs with the word product(s) in them (but production …
(edit) @33468   5 years ak19 More meaningful to (also) write out the keep vs discard URLs into keep …
(edit) @33467   5 years ak19 Improved the code to use a static block to load the needed properties …
(edit) @33466   5 years ak19 1. WETProcessor.main() now processes a folder of *.warc.wet(.gz) …
(edit) @33465   5 years ak19 Committing first version of the WETProcessor.java which takes a …
(edit) @33411   5 years ak19 Newer version now doesn't mirror sites with wget but gets WET files …
(edit) @33410   5 years ak19 Committing some variable name changes before I replace this file with …
(edit) @33405   5 years ak19 Even though we're probably not going to use this code after all, will …
(edit) @33402   5 years ak19 Beginnings of the Java class to wget sites and process its pages to …
(add) @33398   5 years ak19 Committing the actual package structure and the updated README after …
Note: See TracRevisionLog for help on using the revision log.