source: other-projects/maori-lang-detection/hdfs-cc-work/scripts@ 33815

Name Size Rev Age Author Last Change
../
batchcrawl.sh 7.7 KB 33608   4 years ak19 1. New script to export from HBase so that we could in theory reimport …
export_maori_index_csv.sh 3.4 KB 33524   5 years ak19 1. Further adjustments to documenting what we did to get things to run …
export_maori_subset.sh 3.0 KB 33524   5 years ak19 1. Further adjustments to documenting what we did to get things to run …
export_maori_subset_from_scratch.sh 3.3 KB 33524   5 years ak19 1. Further adjustments to documenting what we did to get things to run …
exportHBase.sh 2.6 KB 33608   4 years ak19 1. New script to export from HBase so that we could in theory reimport …
get_maori_WET_records_for_crawl.sh 9.1 KB 33524   5 years ak19 1. Further adjustments to documenting what we did to get things to run …
get_Maori_WET_records_from_CCSep2018_on.sh 1.4 KB 33534   5 years ak19 Correction: toplevel script has to be placed inside cc-index-table not …
GS_README 812 bytes 33524   5 years ak19 1. Further adjustments to documenting what we did to get things to run …
limit10_export_index.sh 2.6 KB 33524   5 years ak19 1. Further adjustments to documenting what we did to get things to run …
setup.sh 3.8 KB 33538   5 years ak19 Some additions to the setup.sh script to query commoncrawl for MRI …
Note: See TracBrowser for help on using the repository browser.