root/gsdl/trunk/perllib

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Rev Chgset Date Author Log Message
(edit) @17574 [17574] 12 years kjdon now calls read_build_cfg() instead of having the code here
(edit) @17573 [17573] 12 years kjdon moved a couple of things around, added read_build_cfg which finds and …
(edit) @17572 [17572] 12 years kjdon moved the make_absolute method to here from buildcol.pl
(edit) @17568 [17568] 12 years kjdon recoding of the text method. more closely matches mgpp one. ZZ field only …
(edit) @17567 [17567] 12 years kjdon if metadata is specified, only add in the ones that are not already …
(edit) @17566 [17566] 12 years kjdon lucene no longer does anything with paragraphs, so we print a warning if …
(edit) @17565 [17565] 12 years kjdon removed some debug statements, and no longer load in the default …
(edit) @17564 [17564] 12 years kjdon fixed up some stuff to do with indexfieldmap. still working on it, but …
(edit) @17549 [17549] 12 years ak19 Changes to sudden wget download termination when OAIDownload.pm is used: …
(edit) @17547 [17547] 12 years ak19 When subroutines useWget and useWgetMonitored receive the STOP signal from …
(edit) @17543 [17543] 12 years mdewsnip Fixed the block_exp regular expression to move the $ symbol, so it doesn't …
(edit) @17537 [17537] 12 years ak19 Subroutine useWgetMonitored updated to include the modifications made …
(edit) @17533 [17533] 12 years oranfry protect against a particular error message poluting XML output
(edit) @17531 [17531] 12 years ak19 Now works with OAIDownload.pm for downloading over OAI. The variable port …
(edit) @17530 [17530] 12 years ak19 Fixed not being able to run wget from the cmd-line via downloadfrom.pl on …
(edit) @17529 [17529] 12 years ak19 Now WgetDownload?.pm uses Sockets to communicate with GLI which launched …
(edit) @17528 [17528] 12 years ak19 New subroutine setIsGLI to store whether or not the download is run from …
(edit) @17527 [17527] 12 years ak19 Now calls new subroutine setIsGLI on the download_obj to indicate whether …
(edit) @17513 [17513] 12 years kjdon extrametadata keys need to be regexs, so windows paths need converting
(edit) @17512 [17512] 12 years kjdon added a method to turn windows filename paths (with single back slash) …
(edit) @17483 [17483] 12 years kjdon I just discovered that if image magick was not installed, you weren't …
(edit) @17480 [17480] 12 years kjdon removed the pc namespace. the metadata is now extracted metadata, and if …
(edit) @17479 [17479] 12 years kjdon put this back to using block expression for now - on windows sets up …
(edit) @17476 [17476] 12 years mdewsnip Support for using MSSQL for infodb databases, many thanks to Jeffrey Ke …
(edit) @17463 [17463] 12 years kjdon some mods to make this a bit more useful in response to request from John …
(edit) @17462 [17462] 12 years kjdon added ProCite?.entry_separator
(edit) @17354 [17354] 12 years ak19 Added SIGTERM and SIGINT handlers to terminate wget child process either …
(edit) @17330 [17330] 12 years kjdon added default values for self->input_encoding and self->default_encoding, …
(edit) @17322 [17322] 12 years kjdon added a -f test on filename in can_process_this_file to prevent this …
(edit) @17321 [17321] 12 years anna Removed a line break at the end of an French element.
(edit) @17320 [17320] 12 years kjdon found and fixed what I think is a bug - in the metadata structures for …
(edit) @17319 [17319] 12 years kjdon tidied this up and removed some old code
(edit) @17313 [17313] 12 years kjdon this seemed to have been forgotten in the 'removing metadata form …
(edit) @17300 [17300] 12 years kjdon removed the metadata argument from metadata_read as its not used and just …
(edit) @17294 [17294] 12 years kjdon added a fix for a bug John T discovered where in get_new_doc_dir you can …
(edit) @17293 [17293] 12 years davidb fixed type in function call: parsefile -> parse_file
(edit) @17290 [17290] 12 years kjdon previous changes to get exploding working (using metadata_read) meant that …
(edit) @17289 [17289] 12 years kjdon moved the actual parsing from read into parse_file so other plugins can do …
(edit) @17288 [17288] 12 years kjdon in add_section_content, we are regenrating doc objs from gdbm database. …
(edit) @17287 [17287] 12 years kjdon added 'if verbosity > 3' to some print statements, and set doctype to be …
(edit) @17286 [17286] 12 years kjdon renamed make_infodatabase to make_infodatabase_dlc so that its not called …
(edit) @17285 [17285] 12 years kjdon fixed a couple of typos in function calls
(edit) @17284 [17284] 12 years ak19 The PerlDoc? seems to indicate that it is necessary to call waitpid after …
(edit) @17283 [17283] 12 years kjdon changed a couple of print statements to be more informative
(edit) @17267 [17267] 12 years anna Updated French translations. Many thanks to John Rose.
(edit) @17250 [17250] 12 years kjdon forgot to pass the arguments to ImageConverter::begin()
(edit) @17249 [17249] 12 years kjdon need to ignore Manifest tag in xml_start_tag
(edit) @17247 [17247] 12 years ak19 The Server Information button produced nothing for some urls, since …
(edit) @17246 [17246] 12 years ak19 Clearer description for the display string OAIDownload.get_doc_exts, …
(edit) @17234 [17234] 12 years ak19 The GLI java class DownloadPane?.java has been changed to alter the …
(edit) @17233 [17233] 12 years ak19 Shorter strings for the various Download Settings in the Download panels. …
(edit) @17232 [17232] 12 years ak19 Added an extra comment to the new quit_yaz subroutine to indicate why the …
(edit) @17231 [17231] 12 years ak19 In subroutine quit_yaz(), while flushing yaz-client's outputstream, only …
(edit) @17230 [17230] 12 years ak19 SRWDownload now finally quits once it has finished. It's no longer running …
(edit) @17229 [17229] 12 years ak19 Moved code for starting up (including opening connections) and quitting …
(edit) @17220 [17220] 12 years ak19 One more occasion where the quit command needs to be sent
(edit) @17219 [17219] 12 years ak19 Previously the yaz-client cmd-line program would not quit (still active …
(edit) @17218 [17218] 12 years ak19 Previously the yaz-client cmd-line program would not quit (still active …
(edit) @17216 [17216] 12 years kjdon trying to get OAI files exploding. Have copied in some code from one of …
(edit) @17214 [17214] 12 years ak19 Significant changes: 1. Textcat can be restricted to a given encoding when …
(edit) @17213 [17213] 12 years ak19 Significant changes to subroutine get_language_encoding to better work out …
(edit) @17212 [17212] 12 years ak19 Removed some unnecessary commented-out code
(edit) @17210 [17210] 12 years kjdon BasDownload? changed to BaseDownload?
(edit) @17209 [17209] 12 years kjdon BasClas? renamed to BaseClassifier?, tidied up constructors
(edit) @17208 [17208] 12 years kjdon file rename BasClas?.pm to BaseClassifier?.pm
(edit) @17207 [17207] 12 years kjdon BasDownload? renamed to BaseDownload?, also tidied up the constructors
(edit) @17206 [17206] 12 years kjdon renamed file BasDownload?.pm to BaseDownload?.pm
(edit) @17205 [17205] 12 years kjdon reordered strings - put all the plugout ones together
(edit) @17204 [17204] 12 years kjdon in use_collection now always set GSDLCOLLECTION. previously was unless …
(edit) @17203 [17203] 12 years kjdon BasPlugout? renamed to BasePlugout?. And tidied up the constructors
(edit) @17202 [17202] 12 years kjdon changed BasPlugout? to BasePlugout? in line with package renaming. Also …
(edit) @17200 [17200] 12 years kjdon renamed BasPlugout? to BasePlugout?
(edit) @17197 [17197] 12 years kjdon previous metadata changes meant that there was no longer URL metadata …
(edit) @17196 [17196] 12 years kjdon set cover_image to false as it makes no sense for images
(edit) @17144 [17144] 12 years kjdon modified export.params, added scripts.gli
(edit) @17143 [17143] 12 years kjdon modified check_removeold_and_keepold so that you don't need to pass in …
(edit) @17127 [17127] 12 years kjdon want to block body background, so added it into tabbg_matches regex for …
(edit) @17126 [17126] 12 years kjdon inherit and use args form ReadTextFile? cos we want the file encoding stuff
(edit) @17120 [17120] 12 years ak19 archivesinf_gdbm commented out until more testing under Windows has been …
(edit) @17117 [17117] 12 years kjdon when indexing a combined field, put the field tags arounds the whole …
(edit) @17112 [17112] 12 years kjdon CJK text segmentation now done at indexing level (in buildproc), not …
(edit) @17111 [17111] 12 years kjdon added a comment
(edit) @17110 [17110] 12 years kjdon changed way cjk separation is done. Not done in plugins any more, but is …
(edit) @17109 [17109] 12 years kjdon moved separate_cjk from colcfg to buildcfg
(edit) @17106 [17106] 12 years mdewsnip No longer writes out the document/section number entries for Lucene, since …
(edit) @17105 [17105] 12 years mdewsnip Not sure why "gdbm-txtgz" was made the default, particularly since …
(edit) @17104 [17104] 12 years mdewsnip Arrrgghhh, someone uglied up my nice tidy code…
(edit) @17103 [17103] 12 years ak19 OAI files should be explodable, so added that back in as an option
(edit) @17099 [17099] 12 years kjdon in get_language_encoding, we extract head from html files. if its not …
(edit) @17088 [17088] 12 years davidb Plugin modified to only print out URL encoded filename if different to …
(edit) @17087 [17087] 12 years davidb Introduction of new GDBM alternative for archives.inf as step towards full …
(edit) @17070 [17070] 12 years anna remove the translation of ImageConverter?.desc element.
(edit) @17069 [17069] 12 years anna make the element texts consistent with the english version.
(edit) @17068 [17068] 12 years anna update several element texts, changed the key names appeared in the text …
(edit) @17066 [17066] 12 years ak19 OAIPlugin now works again: 1. needs to inherit from ReadTextFile? as well …
(edit) @17064 [17064] 12 years anna Make the French version consistent with the current English properties …
(edit) @17063 [17063] 12 years ak19 Special xml element added for images is now commented out since that …
(edit) @17062 [17062] 12 years ak19 When verbosity is set to 0, it now also ignores output generated by a …
(edit) @17059 [17059] 12 years ak19 The invalid MIMEtype image/jpg for generated images are now changed to the …
(edit) @17058 [17058] 12 years ak19 1. Moved the mime_type hashmap out of the guess_mime_type subroutine since …
Note: See TracRevisionLog for help on using the revision log.