source: other-projects/hathitrust/wcsa/extracted-features-solr/trunk/solr-ingest/src/main/java/org/hathitrust/extractedfeatures/ProcessForCatalogLangCount.java

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @31371   7 years davidb Trying to get saveAsSequenceFile working
(edit) @31369   7 years davidb Trial new save
(edit) @31368   7 years davidb downsample-100 added
(edit) @31365   7 years davidb Quick code added to downsample
(edit) @31364   7 years davidb removed sample() line
(edit) @31363   7 years davidb Control num of partitions on sort
(edit) @31362   7 years davidb use Spark sample() to make for smaller test with Sequence files
(edit) @31361   7 years davidb Change from String to Text
(edit) @31359   7 years davidb Changed over to use sequenceFiles as input
(add) @31294   7 years davidb Version for language counting the catalog assignment language …
Note: See TracRevisionLog for help on using the revision log.