source:
other-projects/maori-lang-detection/hdfs-cc-work/scripts@
36663
Name | Size | Rev | Age | Author | Last Change |
---|---|---|---|---|---|
../ | |||||
batchcrawl.sh | 7.7 KB | 33608 | 5 years | 1. New script to export from HBase so that we could in theory reimport … | |
export_maori_index_csv.sh | 3.4 KB | 33524 | 5 years | 1. Further adjustments to documenting what we did to get things to run … | |
export_maori_subset.sh | 3.0 KB | 33524 | 5 years | 1. Further adjustments to documenting what we did to get things to run … | |
export_maori_subset_from_scratch.sh | 3.3 KB | 33524 | 5 years | 1. Further adjustments to documenting what we did to get things to run … | |
exportHBase.sh | 2.6 KB | 33608 | 5 years | 1. New script to export from HBase so that we could in theory reimport … | |
get_maori_WET_records_for_crawl.sh | 9.1 KB | 33524 | 5 years | 1. Further adjustments to documenting what we did to get things to run … | |
get_Maori_WET_records_from_CCSep2018_on.sh | 1.4 KB | 33534 | 5 years | Correction: toplevel script has to be placed inside cc-index-table not … | |
GS_README | 812 bytes | 33524 | 5 years | 1. Further adjustments to documenting what we did to get things to run … | |
limit10_export_index.sh | 2.6 KB | 33524 | 5 years | 1. Further adjustments to documenting what we did to get things to run … | |
setup.sh | 3.8 KB | 33538 | 5 years | Some additions to the setup.sh script to query commoncrawl for MRI … |
Note:
See TracBrowser
for help on using the repository browser.