source: other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest/src/main/java/org/hathitrust/extractedfeatures@ 31197

Name Size Rev Age Author Last Change
../
__PerPageJSONForeach.java 2.4 KB 31011   7 years davidb Further RDD flatMap/map restructuring and refactoring, for per-page
ClusterFileIO.java 5.9 KB 31088   7 years davidb Shift to newIstance for FileSystem due to StackOverflow page …
JSONClusterFileIO.java 782 bytes 31089   7 years davidb Change in way the JSON file is read in. Motivation was a …
PerPageJSONFlatmap.java 4.4 KB 31096   7 years davidb Only need to create a volume's pages output directory is _output_dir …
PerPageJSONMap.java 2.5 KB 31045   7 years davidb More careful treatment of what to do when a JSON file isn't there
PerVolumeJSON.java 3.9 KB 31096   7 years davidb Only need to create a volume's pages output directory is _output_dir …
PerVolumeWordStreamFlatmap.java 3.2 KB 31175   7 years davidb Trial to find memory difference betwen Hashmap and Bloom filters
ProcessForSolrIngest.java 10.6 KB 31100   7 years davidb Change to using solr-cloud-nodes that include port number
ProcessForWhitelist.java 6.2 KB 31175   7 years davidb Trial to find memory difference betwen Hashmap and Bloom filters
SolrDocJSON.java 6.4 KB 31176   7 years davidb Support added for producing whitelist word count
TestWhitelistBloomFilter.java 3.7 KB 31196   7 years davidb File renaming to make way for newer version of classes needed in the …
TestWhitelistDictionaryMain.java 1011 bytes 31196   7 years davidb File renaming to make way for newer version of classes needed in the …
TESTWhitelistHashmap.java 1.2 KB 31195   7 years davidb File renaming to make way for newer version of classes needed in the …
Note: See TracBrowser for help on using the repository browser.