source: other-projects

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @33803   4 years ak19 geojson mapdata and map for mongodb results on …
(edit) @33802   4 years ak19 With an extra adult site removed and with setting countrycodes that …
(edit) @33801   4 years ak19 1. NutchTextDumpToMongoDB Added an extra field to each document in …
(edit) @33800   4 years ak19 Removed an adult site from crawled contents and added its url to …
(edit) @33799   4 years ak19 1. Adding breadcrumb for next step at end of running …
(edit) @33798   4 years ak19 Adding the geojson related files related to querying mongodb for sites …
(edit) @33797   4 years ak19 Updated json and imaegs files, and new files for when /mi(/) is in the …
(edit) @33796   4 years ak19 Instead of a hack for US' count being too great that its histogram …
(edit) @33794   4 years ak19 Wrote the geojson map data created from the site counts per …
(edit) @33790   4 years ak19 Got the MultiPoint geojson mapdata of the country code counts working: …
(edit) @33789   4 years ak19 Redid the mongodb query to get the countrycode counts for all the …
(edit) @33788   4 years ak19 Adding all the jar files needed to work in Java with geojson Simple …
(edit) @33787   4 years ak19 Documented another mongodb query that I'm using, the one to produce …
(edit) @33778   4 years ak19 Made a beginning on getting the geojson map data automated. Couldn't …
(edit) @33776   4 years ak19 Field Separator (IFS) conflicting with backticks and other ways of …
(edit) @33760   4 years ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Rebuilding after GLI …
(edit) @33759   4 years ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Rebuilding after GLI …
(edit) @33723   4 years ak19 On linux 64 bit, the additional wrap command did not work because the …
(edit) @33722   4 years ak19 Adding in additional instructions in mongodb.txt, before I forgot how …
(edit) @33710   4 years ak19 Working queries and map coords for geojson.tools (ironically, Lat and …
(edit) @33698   4 years ak19 Links to more reading
(edit) @33675   4 years ak19 Committing the newer query results (but from before today's …
(edit) @33674   4 years ak19 Changes to support the top 5 predicted langcodes and their confidence …
(edit) @33666   4 years ak19 Having finished sending all the crawl data to mongodb 1. Recrawled the …
(edit) @33657   4 years ak19 Some fixes after brief testing against 1/3 of the crawl. Restarted …
(edit) @33656   4 years ak19 Final minor changes before I start processing the crawls of node2.
(edit) @33655   4 years ak19 Minor change to print statement
(edit) @33654   4 years ak19 Removing jar file that wasn't used after all.
(edit) @33653   4 years ak19 1. As suggested by Dr Bainbridge, made the code changes to use Morphia …
(edit) @33652   4 years ak19 Introducing morphia subpackage
(edit) @33651   4 years ak19 1. Bugfix: overlappingSentences works. 2. storing numSentencesInMaor
(edit) @33646   4 years ak19 Saving the mongodb queries and learning links that Dr Bainbridge found …
(edit) @33645   4 years ak19 Fix to 2 bugs when sending data to MongoDB: 1. overlappingSentences …
(edit) @33644   4 years ak19 Just committing the growing mongodb.txt file with links and …
(edit) @33643   4 years ak19 Brought the template log4j.properties.in back up to speed. I forgot it …
(edit) @33642   4 years ak19 Forgot to commit the java driver for mongodb when I committed the Java …
(edit) @33635   4 years ak19 Maori-language-detection doesn't use Greenstone 3 at present, it's not …
(edit) @33589   4 years cpb16 final01. Need Map results still
(edit) @33521   5 years ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Redoing the CDS-ISIS …
(edit) @33520   5 years ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Redoing the CDS-ISIS …
(edit) @33512   5 years ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Rebuilding all the …
(edit) @33511   5 years ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Rebuilding all the …
(edit) @33458   5 years cpb16 Running new morphology version after quick meeting with david last …
(edit) @33455   5 years cpb16 Started implementing Davids suggested morphology sequence, codeversion9
(edit) @33449   5 years cpb16 termnal version executes correctly. (Didnt include init threshold in …
(edit) @33447   5 years cpb16 starting to implement terminal version of new morphology. need to fix. …
(edit) @33444   5 years cpb16 Have created a preprocess to remove large objects. …
(edit) @33439   5 years cpb16 Have created properties file and accessibility from …
(edit) @33437   5 years cpb16 made progress with morphology. Need to have a better area dimension …
(edit) @33427   5 years davidb Some initial files on how to get going
(edit) @33426   5 years davidb Folder to details on how to standup the HTRC DevEnv locally
(edit) @33418   5 years cpb16 made progress with morphology, based one image, need to refine …
(edit) @33415   5 years cpb16 updated, after unable to commit due to setup.bash being out of date. …
(edit) @33384   5 years cpb16 backup before intellij working
(edit) @33375   5 years cpb16 Full backup after running first successful highres classifier run
(edit) @33367   5 years cpb16 Pre-hires classification w/o MU
(edit) @33354   5 years davidb Template file for producing OpenOffice spreadsheet format
(edit) @33353   5 years davidb Initial set of files to page scrape and turn in the OpenOffice
(edit) @33352   5 years davidb Top-level folder for code to page-scrape BookStumper site
(edit) @33351   5 years davidb Top-level folder for code to page-scrape BookStumper site
(edit) @33340   5 years cpb16 transferred backup of low res images. Classifiers work as expected. …
(edit) @33332   5 years ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Recommitting the …
(edit) @33331   5 years ak19 AUTOCOMMIT by gen-model-colls.sh script. Message: Recommitting the …
(edit) @33326   5 years cpb16 Completed linecluster with x position dectection, need to test
(edit) @33325   5 years cpb16 Added x pos checker, needs testing, and remove errors
(edit) @33324   5 years cpb16 Backup for 4th crash of the day. Need to reimplement x corrodinate checker
(edit) @33319   5 years cpb16 added high res download sorter
(edit) @33310   5 years cpb16 developing line clustering. Have completed line cluster algorithm. …
(edit) @33304   5 years cpb16 Backup for computer crash, only lost 5 lines of code in development …
(edit) @33243   5 years cpb16 Had break through with the refined houghlinesP algorithm overall …
(edit) @33221   5 years cpb16 back up pre-houghlineP-refinement progress
(edit) @33170   5 years cpb16 refined houghlineP alogirthm
(edit) @33141   5 years cpb16 Completed end-to-end pipeline and one classifier
(edit) @33138   5 years davidb Scripts that focus on language (for non-music related work)
(edit) @33137   5 years davidb Added a bit more detail to instructions for ssl
(edit) @33136   5 years davidb Extra echo statement added, to help with details printed as script runs
(edit) @33135   5 years davidb These should not be committed into the repository
(edit) @33134   5 years davidb Avoid having hypen at start of filename
(edit) @33133   5 years davidb No need for this backup file in the repository
(edit) @33132   5 years davidb No need for this backup file in the repository
(edit) @33131   5 years davidb No need to keep backup files in repository
(edit) @33110   5 years cpb16 Ground truth complete for SE and BK. Added file to keep track of all …
(edit) @33097   5 years cpb16 Have compiled openCV java from terminal. Have created classifier one. …
(edit) @33082   5 years cpb16 added notes from meeting
(edit) @33070   5 years cpb16 Completed Straight Line Finding Code
(edit) @33069   5 years cpb16 Have started basic line detection using OPENCV and intelliJ
(edit) @33066   5 years cpb16 Corrected makefile and 10page downloader
(edit) @33060   5 years cpb16 Modified ..PNG-10PAGES.sh to display a item download counter
(edit) @33059   5 years cpb16 backup after downloading 5000 MU pages
(edit) @33053   5 years ak19 I still had some stuff of Nathan Kelly's (FileTransfer-WebSocketPair) …
(edit) @33047   5 years cpb16 Corpus generator complete!
(edit) @33044   5 years cpb16 Streamlined numpages checking and random selection. Corrected …
(edit) @33031   5 years cpb16 Completed numpages checking. Generated makefiles and scripts to …
(edit) @33017   5 years cpb16 Renamed Downloaders as .sh; Completed a metadata formater script; …
(edit) @33014   5 years cpb16 Created metadata downloader for numpages metadata
(edit) @33010   5 years cpb16 Updated and Renamed PNG retrieving code, ZIP file downlading is also …
(edit) @33009   5 years cpb16 undo
(edit) @33008   5 years cpb16 Deleted old java file
(edit) @33007   5 years cpb16 Have made java files compatiable with args.
(edit) @33002   5 years cpb16 Added args compatilibility to java programs and updated makefile to …
Note: See TracRevisionLog for help on using the revision log.