source: trunk/gsdl/bin

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @14111   17 years sjboddie Added support for additional collection customisation code to be put …
(edit) @14032   17 years xiao Changes made to look for collectionConfig.xml in gs3 mode and …
(edit) @14031   17 years xiao Changes made to look for collectionConfig.xml in gs3 mode and …
(edit) @14028   17 years xiao 1. Changes made to look for collectionConfig.xml in gs3 mode and …
(edit) @13948   17 years nzdl Last updatation lost the get-all-chunks command and code. don't know why.
(edit) @13946   17 years nzdl fixed the submitted source text changed problem
(edit) @13910   17 years kjdon updated jar file for code changes
(edit) @13775   17 years kjdon LuceneWrap.jar renamed to LuceneWrapper.jar, and uses the new package name
(edit) @13774   17 years kjdon org.nzdl.gsdl.LuceneWrap is now org.greenstone.LuceneWrapper
(edit) @13693   17 years mdewsnip Added a new command "get-all-chunks" for getting all of the chunks for …
(edit) @13690   17 years mdewsnip GLI modules are no longer enabled by default, so the GTI works with …
(edit) @13588   17 years kjdon updated jar file for my changes to LuceneQuery - made it use a class …
(edit) @13473   17 years shaoqun added the method used by translator
(edit) @13470   17 years shaoqun make it accept xml files from a command line
(edit) @13318   17 years mdewsnip Now removes all whitespace from the empty line between the chunks.
(edit) @13296   17 years mdewsnip Utility script to convert Excel Unicode text files into the GTI …
(edit) @13223   17 years shaoqun used utf-8 coding for the input stream
(edit) @13216   17 years mdewsnip Now makes the chunk keys XML safe also, to prevent XML errors with the …
(edit) @13186   17 years kjdon removed RecPlug -use_metadata_files option, added MetadataXMLPlug
(edit) @13169   17 years kjdon debug mode now passes debug flag to plugout rather than using …
(edit) @13166   17 years mdewsnip Can now obtain documents from the web containing spaces.
(edit) @13165   17 years mdewsnip Need to unescape filename separators on Windows.
(edit) @13163   17 years kjdon changed groupsize mode back to 3 as per John Rose comment
(edit) @13072   18 years kjdon fixed a bug where the first page was given pagenumber of 2
(edit) @13067   18 years kjdon if we are appending, and a lucene collection, then set builddir to be …
(edit) @13054   18 years mdewsnip Now puts the terms through xmlSafe() as well, to prevent invalid XML …
(edit) @13053   18 years kjdon removed some 'use xxx' statments for modules which are not used
(edit) @12993   18 years mdewsnip Now stores the query results XML in a string buffer before outputting …
(edit) @12991   18 years mdewsnip Ooops... managed to lose the header of the XML output in my recent changes.
(edit) @12989   18 years mdewsnip Follow to close the searcher object.
(edit) @12987   18 years mdewsnip You can now specify the query string as a command-line argument to …
(edit) @12983   18 years mdewsnip Moved the stuff for running the query into a new runQuery function, in …
(edit) @12981   18 years mdewsnip Tidied up command-line option parsing in preparation for allowing the …
(edit) @12980   18 years mdewsnip Now passes the endresults value (if defined) into the …
(edit) @12976   18 years mdewsnip Rearranged some code to make the fact that the term information is now …
(edit) @12965   18 years kjdon scriptutil::check_removeold_and_keepold now has a incremental argument …
(edit) @12964   18 years kjdon added a new option: -incremental, which invokes David's archives.inf …
(edit) @12903   18 years kjdon remove the trailing slash from cache_dir -on windows this stuff things up
(edit) @12878   18 years mdewsnip Fixed a bug where '&' characters in filenames aren't made XML safe.
(edit) @12874   18 years mdewsnip No longer sets the plugin's input encoding back to auto, to prevent it …
(edit) @12873   18 years mdewsnip Can now obtain multiple documents for a record (and assign the …
(edit) @12848   18 years nzdl try the JAVA_HOME variable first to find java, otherwise just use …
(edit) @12846   18 years mdewsnip Minor changes.
(edit) @12844   18 years mdewsnip Incremental building and dynamic GDBM updating code, many thanks to …
(edit) @12821   18 years kjdon changed the gli modes for some options
(edit) @12820   18 years kjdon made index option glimode 4
(edit) @12819   18 years kjdon moved some options around
(edit) @12776   18 years mdewsnip Fixed a bug where misspelled words could be marked as stop words with …
(edit) @12775   18 years mdewsnip Fixed bug where some terms have zero frequency (because they don't …
(edit) @12774   18 years kjdon new jar file after xmlSafe change, see log of GS2LuceneQuery.java
(edit) @12770   18 years mdewsnip Changed the Lucene "-fuzzy" argument to "-fuzziness <value>", for more …
(edit) @12706   18 years mdewsnip Added a "-records_per_folder" option to explode_metadata_database.pl, …
(edit) @12704   18 years davidb convert RTF upgraded so it can also use windows scripting option.
(edit) @12691   18 years kjdon added OIDtype and OIDmetadata to the option list. it was using OIDtype …
(edit) @12656   18 years mdewsnip Put old range filter stuff back, and added "-startresults" and …
(edit) @12653   18 years mdewsnip Made it a little bit easier to use a custom set of stop words with Lucene.
(edit) @12640   18 years mdewsnip Now returns valid XML instead of an error when -listall and …
(edit) @12639   18 years mdewsnip Changed the "-collect" option to "-collection", because it's a million …
(edit) @12629   18 years mdewsnip Merged the "-listall" and "-describeall" code, and made both always …
(edit) @12625   18 years mdewsnip Removed the DTD stuff from the top of the XML output... it's just one …
(edit) @12622   18 years jrm21 if we can't open an output file, also give the operating system's …
(edit) @12619   18 years kjdon it seems that when I added in the option OIDmetadata, I didn't …
(edit) @12616   18 years kjdon now accepts -h as well as --help
(edit) @12615   18 years kjdon changed slightly the checking of how many args we have left after …
(edit) @12614   18 years kjdon changed plugin to classifier in the cut and pasted text
(edit) @12613   18 years kjdon changed slightly the checking of how many args we have left after …
(edit) @12598   18 years shaoqun added mapping_file option for MARCXML plugout
(edit) @12594   18 years shaoqun added code that uses MARCXML mapping file
(edit) @12593   18 years shaoqun a util class that converts a string to its lowercase
(edit) @12574   18 years shaoqun remove the default value for cache_dir because HOME environment is not set
(edit) @12566   18 years mdewsnip Added fix for warnings when submitting to a new file.
(edit) @12545   18 years kjdon changed parse2::parse so that it returns -1 on error, 0 on success, or …
(edit) @12500   18 years kjdon hide the keepold and removeold options from gli
(edit) @12484   18 years mdewsnip The username of the person who did the translations is now recorded in …
(edit) @12483   18 years mdewsnip Changed the way the Updated comments are dealt with, in preparation …
(edit) @12481   18 years mdewsnip Turned tutorial translation off by default.
(edit) @12458   18 years kjdon gzip option is only a flag, so don't pass a value to plugouts
(edit) @12429   18 years mdewsnip Changed the "-filter" argument to use a general Lucene QueryFilter, …
(edit) @12425   18 years mdewsnip Fixed a bug where buildcol would try to continue when invalid …
(edit) @12418   18 years mdewsnip Now returns parse exceptions and too many clauses exceptions as …
(edit) @12415   18 years mdewsnip Moved the code that messes around with the query to add the fuzziness …
(edit) @12408   18 years mdewsnip Added a "-filter" option which can currently be used for specifying …
(edit) @12406   18 years shaoqun uses the cache_dir par rather than ${ENV{'HOME'} to get the cache dir
(edit) @12399   18 years kjdon I rearranged some stuff so that all the essential checks are done …
(edit) @12394   18 years shaoqun class for xslt transformation
(edit) @12390   18 years mdewsnip More fixes, many thanks to John Thompson and DL Consulting Ltd.
(edit) @12387   18 years mdewsnip Fixes for fuzzy searching, many thanks to John Thompson and DL …
(edit) @12377   18 years mdewsnip Now returns query term occurrences correctly, and does fuzzy searching …
(edit) @12375   18 years mdewsnip Ooops... StopWord output went to STDERR instead of STDOUT.
(edit) @12373   18 years mdewsnip Bit tidy up, particularly regarding command-line option parsing.
(edit) @12372   18 years mdewsnip Now returns the stop words that have been removed from the query.
(edit) @12370   18 years kjdon now create the archives directory here rather than expecting plugouts …
(edit) @12364   18 years mdewsnip Now uses the t variable to control whether a "some" or "all" search is …
(edit) @12361   18 years kjdon changed a comment
(edit) @12360   18 years kjdon changed sortmeta type to metadata instead of metadatum, cos the latter …
(edit) @12359   18 years kjdon added a comment
(edit) @12358   18 years kjdon added back in the unshift that I removed in last commit
(edit) @12357   18 years mdewsnip Put back in adding GSDLCOLLECTDIR/perllib to INC, so …
(edit) @12355   18 years kjdon updated to use plugouts instead of docsave
(edit) @12354   18 years kjdon removed docsave reference
Note: See TracRevisionLog for help on using the revision log.