Timeline
2019-09-03:
- 14:41 Changeset [33454] by
- updated metadata_selection_mode to be …
- 13:16 Changeset [33453] by
- the new and modified strings for revamped List classifier
- 13:15 Changeset [33452] by
- revamp of list classifier. More precise handling of numeric metadata …
- 12:55 Changeset [33451] by
- added a comment
- 12:54 Changeset [33450] by
- removed some unnecessary comments
2019-09-02:
- 17:08 Changeset [33449] by
- termnal version executes correctly. (Didnt include init threshold in …
2019-08-30:
- 18:27 Changeset [33448] by
- Minor clarification and inclusion of helpful command
- 18:03 Changeset [33447] by
- starting to implement terminal version of new morphology. need to fix. …
2019-08-29:
- 19:12 Changeset [33446] by
- 1. Committing working version of export_maori_subset.sh which takes …
- 17:01 Changeset [33445] by
- The first working hadoop spark script for processing common crawl …
- 16:57 Changeset [33444] by
- Have created a preprocess to remove large objects. …
2019-08-28:
- 20:22 Changeset [33443] by
- More notes
- 19:30 Changeset [33442] by
- Updated gutil.jar file (with SafeProcses debugging)
- 19:30 Changeset [33441] by
- Adding further notes to do with running the CC-index examples on spark.
- 19:17 Changeset [33440] by
- Split file to move vagrant-spark-hadoop notes into own file.
- 17:03 Changeset [33439] by
- Have created properties file and accessibility from …
2019-08-27:
- 17:14 Changeset [33438] by
- Forgot to commit a change made for Georgian.
2019-08-26:
- 16:44 Changeset [33437] by
- made progress with morphology. Need to have a better area dimension …
2019-08-23:
- 23:21 Changeset [33436] by
- 3 important changes for 2 separate bugfixes where one bugfix is …
- 21:28 Changeset [33435] by
- Georgian language translations for the language's new glihelp module …
- 21:22 Changeset [33434] by
- Correcting syntax errors in this bash script.
2019-08-20:
- 20:15 Changeset [33433] by
- New Georgian language translation for perlmodules module of the GS …
- 19:35 Changeset [33432] by
- New Georgian language translation for glidict module of the GS …
- 19:18 Changeset [33431] by
- Corrections of automated processing, noticed when processing Georgian …
- 16:14 Ticket #954 (GTI: Correct Existing Translations form needs fixing and enhancement) created by
- GTI's "Correct Existing Translations" form needs 1. fixing: search …
- 14:40 Changeset [33430] by
- Undo call to to_utf8() on the query_string argument (arg[q]) to …
- 11:04 Changeset [33429] by
- fixed a bug in get_or_create_shortname where it wasn't storing the new …
2019-08-19:
- 20:31 Changeset [33428] by
- Working commoncrawl cc-warc-examples' WET wordcount example using …
- 14:25 Changeset [33427] by
- Some initial files on how to get going
- 14:23 Changeset [33426] by
- Folder to details on how to standup the HTRC DevEnv locally
2019-08-16:
- 22:15 Changeset [33425] by
- A few more links now that I got past getting the vagrant VM with spark …
- 18:19 Changeset [33424] by
- Georgian (code ka) language translations for the gs3interface module …
2019-08-15:
- 20:07 Changeset [33423] by
- Adding in the link to the vagrant VM with Hadoop, Spark for cluster …
- 17:52 Changeset [33422] by
- Some more links.
- 16:39 Changeset [33421] by
- Forgot to fix up svn externals property for the Georgian …
- 16:38 Changeset [33420] by
- Update to svnproperty externals for the Georgian (code: ka) …
- 16:20 Changeset [33419] by
- Last evening, I had found some links about how language-detection is …
- 13:53 Changeset [33418] by
- made progress with morphology, based one image, need to refine …
2019-08-14:
- 19:55 Changeset [33417] by
- Georgian language translations for the coredm for GS2, gsinstaller …
- 17:48 Changeset [33416] by
- DEC collections weren't getting built on 32 bit linux VM after trying …
- 11:42 Changeset [33415] by
- updated, after unable to commit due to setup.bash being out of date. …
2019-08-13:
- 21:57 Changeset [33414] by
- Adding important links
- 21:57 Changeset [33413] by
- Splitting the get_commoncrawl_nz_urls.sh script back into 2 scripts, …
- 21:54 Changeset [33412] by
- config command for wgetting a single file
- 21:50 Changeset [33411] by
- Newer version now doesn't mirror sites with wget but gets WET files …
- 21:48 Changeset [33410] by
- Committing some variable name changes before I replace this file with …
- 15:59 Changeset [33409] by
- Forgot to commit 2 files with links and shuffling some links around …
- 15:09 Changeset [33408] by
- Some rough notes. Will move into appropriate file later.
- 14:40 Changeset [33407] by
- gutil.jar was rebuilt yesterday in GS3 after a bugfix. Recommitting …
- 12:17 Changeset [33406] by
- if there is a semicolon after the file name, it ends up in the URL …
2019-08-12:
- 20:37 Changeset [33405] by
- Even though we're probably not going to use this code after all, will …
- 20:35 Changeset [33404] by
- 1. Links to other Java ways of extracting text from web content. 2. …
- 15:07 Changeset [33403] by
- Mistake to do with launchdir in SafeProcess: if the environment for …
2019-08-11:
- 22:03 Changeset [33402] by
- Beginnings of the Java class to wget sites and process its pages to …
- 21:16 Changeset [33401] by
- MaoriTextDetector.class file now generated inside its package folder …
- 21:15 Changeset [33400] by
- 1. Setting up log4j.properties based on the macronizer's basic one …
- 20:48 Changeset [33399] by
- Putting properties files into the conf folder and keeping the lib …
- 19:35 Changeset [33398] by
- Committing the actual package structure and the updated README after …
- 19:30 Changeset [33397] by
- 1. Changing package structure and instructions on compiling/running as …
- 18:20 Changeset [33396] by
- Georgian language gs3colcfg module of GS interface. Many thanks to …
- 18:03 Changeset [33395] by
- Georgian language translation work for the gs3interface module of the …
2019-08-09:
- 20:37 Changeset [33394] by
- 1. Started a file on feasibility with the data now available and some …
- 18:57 Changeset [33393] by
- Modified the get_commoncrawl_nz_urls.sh to also create a reduced urls …
2019-08-08:
- 15:15 Changeset [33392] by
- Kathy found a problem whereby she wanted to run consecutive buildcols …
2019-08-07:
- 19:11 Changeset [33391] by
- Some rough bash scripting lines that work but aren't complete.
- 17:31 Changeset [33390] by
- Minor message telling the user to wait for a task that takes some time.
2019-08-06:
- 13:19 Changeset [33389] by
- store csv field array associated with filename, because you might have …
- 11:46 Changeset [33388] by
- tidied up some debug statements
- 11:33 Changeset [33387] by
- removed all my debug statements
- 11:06 Changeset [33386] by
- modified the test for whether this is the selected node or not. cant …
2019-08-05:
- 12:53 Changeset [33385] by
- need to import response node as it is not part of same document
- 12:39 Changeset [33384] by
- backup before intellij working
- 12:20 Changeset [33383] by
- some more work on the help page
- 12:14 Changeset [33382] by
- don't add collection/collname to pref and help link if collname is empty
- 12:13 Changeset [33381] by
- use nice /page/gsdl url for about greenstone page
- 12:12 Changeset [33380] by
- some more mods and strings for collection help page
Note:
See TracTimeline
for information about the timeline view.