source: gs3-extensions/maori-lang-detection/MoreReading/CommonCrawl.txt

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @33425   5 years ak19 A few more links now that I got past getting the vagrant VM with spark …
(edit) @33423   5 years ak19 Adding in the link to the vagrant VM with Hadoop, Spark for cluster …
(edit) @33422   5 years ak19 Some more links.
(edit) @33419   5 years ak19 Last evening, I had found some links about how language-detection is …
(edit) @33414   5 years ak19 Adding important links
(edit) @33409   5 years ak19 Forgot to commit 2 files with links and shuffling some links around …
(edit) @33393   5 years ak19 Modified the get_commoncrawl_nz_urls.sh to also create a reduced urls …
(edit) @33391   5 years ak19 Some rough bash scripting lines that work but aren't complete.
(add) @33376   5 years ak19 Links and extracts I've read so far on the Web Curator Tool (WCT), …
Note: See TracRevisionLog for help on using the revision log.