source: trunk/gsdl/bin/script

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @13318   17 years mdewsnip Now removes all whitespace from the empty line between the chunks.
(edit) @13296   17 years mdewsnip Utility script to convert Excel Unicode text files into the GTI …
(edit) @13216   18 years mdewsnip Now makes the chunk keys XML safe also, to prevent XML errors with the …
(edit) @13186   18 years kjdon removed RecPlug -use_metadata_files option, added MetadataXMLPlug
(edit) @13169   18 years kjdon debug mode now passes debug flag to plugout rather than using …
(edit) @13166   18 years mdewsnip Can now obtain documents from the web containing spaces.
(edit) @13165   18 years mdewsnip Need to unescape filename separators on Windows.
(edit) @13163   18 years kjdon changed groupsize mode back to 3 as per John Rose comment
(edit) @13072   18 years kjdon fixed a bug where the first page was given pagenumber of 2
(edit) @13067   18 years kjdon if we are appending, and a lucene collection, then set builddir to be …
(edit) @13053   18 years kjdon removed some 'use xxx' statments for modules which are not used
(edit) @12965   18 years kjdon scriptutil::check_removeold_and_keepold now has a incremental argument …
(edit) @12964   18 years kjdon added a new option: -incremental, which invokes David's archives.inf …
(edit) @12903   18 years kjdon remove the trailing slash from cache_dir -on windows this stuff things up
(edit) @12878   18 years mdewsnip Fixed a bug where '&' characters in filenames aren't made XML safe.
(edit) @12874   18 years mdewsnip No longer sets the plugin's input encoding back to auto, to prevent it …
(edit) @12873   18 years mdewsnip Can now obtain multiple documents for a record (and assign the …
(edit) @12848   18 years nzdl try the JAVA_HOME variable first to find java, otherwise just use …
(edit) @12844   18 years mdewsnip Incremental building and dynamic GDBM updating code, many thanks to …
(edit) @12821   18 years kjdon changed the gli modes for some options
(edit) @12820   18 years kjdon made index option glimode 4
(edit) @12819   18 years kjdon moved some options around
(edit) @12770   18 years mdewsnip Changed the Lucene "-fuzzy" argument to "-fuzziness <value>", for more …
(edit) @12706   18 years mdewsnip Added a "-records_per_folder" option to explode_metadata_database.pl, …
(edit) @12704   18 years davidb convert RTF upgraded so it can also use windows scripting option.
(edit) @12691   18 years kjdon added OIDtype and OIDmetadata to the option list. it was using OIDtype …
(edit) @12656   18 years mdewsnip Put old range filter stuff back, and added "-startresults" and …
(edit) @12640   18 years mdewsnip Now returns valid XML instead of an error when -listall and …
(edit) @12639   18 years mdewsnip Changed the "-collect" option to "-collection", because it's a million …
(edit) @12629   18 years mdewsnip Merged the "-listall" and "-describeall" code, and made both always …
(edit) @12625   18 years mdewsnip Removed the DTD stuff from the top of the XML output... it's just one …
(edit) @12622   18 years jrm21 if we can't open an output file, also give the operating system's …
(edit) @12619   18 years kjdon it seems that when I added in the option OIDmetadata, I didn't …
(edit) @12616   18 years kjdon now accepts -h as well as --help
(edit) @12615   18 years kjdon changed slightly the checking of how many args we have left after …
(edit) @12614   18 years kjdon changed plugin to classifier in the cut and pasted text
(edit) @12613   18 years kjdon changed slightly the checking of how many args we have left after …
(edit) @12598   18 years shaoqun added mapping_file option for MARCXML plugout
(edit) @12574   18 years shaoqun remove the default value for cache_dir because HOME environment is not set
(edit) @12566   18 years mdewsnip Added fix for warnings when submitting to a new file.
(edit) @12545   18 years kjdon changed parse2::parse so that it returns -1 on error, 0 on success, or …
(edit) @12500   18 years kjdon hide the keepold and removeold options from gli
(edit) @12484   18 years mdewsnip The username of the person who did the translations is now recorded in …
(edit) @12483   18 years mdewsnip Changed the way the Updated comments are dealt with, in preparation …
(edit) @12481   18 years mdewsnip Turned tutorial translation off by default.
(edit) @12458   18 years kjdon gzip option is only a flag, so don't pass a value to plugouts
(edit) @12425   18 years mdewsnip Fixed a bug where buildcol would try to continue when invalid …
(edit) @12408   18 years mdewsnip Added a "-filter" option which can currently be used for specifying …
(edit) @12406   18 years shaoqun uses the cache_dir par rather than ${ENV{'HOME'} to get the cache dir
(edit) @12399   18 years kjdon I rearranged some stuff so that all the essential checks are done …
(edit) @12373   18 years mdewsnip Bit tidy up, particularly regarding command-line option parsing.
(edit) @12370   18 years kjdon now create the archives directory here rather than expecting plugouts …
(edit) @12364   18 years mdewsnip Now uses the t variable to control whether a "some" or "all" search is …
(edit) @12361   18 years kjdon changed a comment
(edit) @12360   18 years kjdon changed sortmeta type to metadata instead of metadatum, cos the latter …
(edit) @12359   18 years kjdon added a comment
(edit) @12358   18 years kjdon added back in the unshift that I removed in last commit
(edit) @12357   18 years mdewsnip Put back in adding GSDLCOLLECTDIR/perllib to INC, so …
(edit) @12355   18 years kjdon updated to use plugouts instead of docsave
(edit) @12354   18 years kjdon removed docsave reference
(edit) @12342   18 years kjdon added modegli=3 to maxnumeric option
(edit) @12341   18 years shaoqun fixed the bugs
(edit) @12338   18 years kjdon added the maxnumeric option to buildcol. so it can be set in GLI. …
(edit) @12335   18 years shaoqun now it uses plugouts
(edit) @12334   18 years shaoqun a module that displays plugouts info
(edit) @12333   18 years shaoqun now it uses plugout
(edit) @12290   18 years kjdon had to add 'use FileHandle' to this file - was getting an error about …
(edit) @12275   18 years mdewsnip Added a command-line option for sorting the search results.
(edit) @12266   18 years kjdon added a new option: OIDmetadata, which is used with OIDtype=assigned, …
(edit) @12258   18 years mdewsnip Now references the GS2Lucene classes in the org.nzdl.gsdl.LuceneWrap
(edit) @12003   18 years davidb Scripts upgraded to perform more efficiently with incremental addition.
(edit) @11944   18 years kjdon tidied up the code a bit, added a new option: noinstall, which makes a …
(edit) @11895   18 years kjdon commented out or deleted all references to win32s, net16 and net32 …
(edit) @11858   18 years kjdon create_itemfile, had different regex for windows image files, but all …
(edit) @11844   18 years mdewsnip Added an option to send the message as HTML.
(edit) @11786   18 years kjdon changed a few of the strings
(edit) @11785   18 years kjdon jefferey's script from downloading from an external server. uses …
(edit) @11782   18 years kjdon modified version of Jeffrey's downloadinfo - used to describe the …
(edit) @11747   18 years kjdon no netscape option any more. no longer bundle netscape
(edit) @11746   18 years kjdon added in a default for saveas_version - greenstone, which isn't used …
(edit) @11705   18 years mdewsnip Changed the number of characters in the .nul filenames from 4 to 8, so …
(edit) @11683   18 years kjdon added a -describeall option, so we can generate the descriptive output …
(edit) @11667   18 years kjdon changed PlugName element in PluginName in DTD to match the output
(edit) @11634   18 years kjdon strings.rb renamed to strings.properties
(edit) @11602   18 years kjdon added in GLI help entry
(edit) @11528   18 years mdewsnip Now barfs if there are duplicate chunk keys in Greenstone XML files.
(edit) @11526   18 years mdewsnip Fixed a bug parsing multi-line strings with Updated tags (Greenstone …
(edit) @11519   18 years mdewsnip Fixed a bug comparing CVS dates with Update dates.
(edit) @11498   18 years mdewsnip Code for handling new entities in the XML.
(edit) @11487   18 years mdewsnip Added <Updated date="..."> comments to the greenstone XML files, to …
(edit) @11452   18 years mdewsnip Yet more subtle XML escaping -- there must be something weird going on …
(edit) @11449   18 years mdewsnip Prevented from adding extra newlines to the Greenstone XML files every …
(edit) @11448   18 years mdewsnip Some more XML escaping code, for the Greenstone XML files. Hope this …
(edit) @11447   18 years mdewsnip Some fixes for greenstone XML files that I forgot to commit earlier :-(
(edit) @11442   18 years kjdon check the result of read_file before continuing
(edit) @11366   18 years mdewsnip Added settings for tutorial exercises.
(edit) @11350   18 years mdewsnip Removed the "-filename_field" option from …
(edit) @11348   18 years mdewsnip Now only tries to obtain documents locally if the values (formed by …
(edit) @11342   18 years mdewsnip The "-metadata_set" option is no longer hidden from the GLI. For John Rose.
(edit) @11333   18 years mdewsnip Now consistently sets $self->{'gli'} in plugin::begin.
Note: See TracRevisionLog for help on using the revision log.