|
|
@17579
|
16 years |
kjdon |
removed a debug print statement
|
|
|
@17575
|
16 years |
kjdon |
implemented init_for_incremental_build to read in indexfields and …
|
|
|
@17574
|
16 years |
kjdon |
now calls read_build_cfg() instead of having the code here
|
|
|
@17573
|
16 years |
kjdon |
moved a couple of things around, added read_build_cfg which finds and …
|
|
|
@17572
|
16 years |
kjdon |
moved the make_absolute method to here from buildcol.pl
|
|
|
@17568
|
16 years |
kjdon |
recoding of the text method. more closely matches mgpp one. ZZ field …
|
|
|
@17567
|
16 years |
kjdon |
if metadata is specified, only add in the ones that are not already …
|
|
|
@17566
|
16 years |
kjdon |
lucene no longer does anything with paragraphs, so we print a warning …
|
|
|
@17565
|
16 years |
kjdon |
removed some debug statements, and no longer load in the default …
|
|
|
@17564
|
16 years |
kjdon |
fixed up some stuff to do with indexfieldmap. still working on it, but …
|
|
|
@17549
|
16 years |
ak19 |
Changes to sudden wget download termination when OAIDownload.pm is …
|
|
|
@17547
|
16 years |
ak19 |
When subroutines useWget and useWgetMonitored receive the STOP signal …
|
|
|
@17543
|
16 years |
mdewsnip |
Fixed the block_exp regular expression to move the $ symbol, so it …
|
|
|
@17537
|
16 years |
ak19 |
Subroutine useWgetMonitored updated to include the modifications made …
|
|
|
@17533
|
16 years |
oranfry |
protect against a particular error message poluting XML output
|
|
|
@17531
|
16 years |
ak19 |
Now works with OAIDownload.pm for downloading over OAI. The variable …
|
|
|
@17530
|
16 years |
ak19 |
Fixed not being able to run wget from the cmd-line via downloadfrom.pl …
|
|
|
@17529
|
16 years |
ak19 |
Now WgetDownload.pm uses Sockets to communicate with GLI which …
|
|
|
@17528
|
16 years |
ak19 |
New subroutine setIsGLI to store whether or not the download is run …
|
|
|
@17527
|
16 years |
ak19 |
Now calls new subroutine setIsGLI on the download_obj to indicate …
|
|
|
@17513
|
16 years |
kjdon |
extrametadata keys need to be regexs, so windows paths need converting
|
|
|
@17512
|
16 years |
kjdon |
added a method to turn windows filename paths (with single back slash) …
|
|
|
@17483
|
16 years |
kjdon |
I just discovered that if image magick was not installed, you weren't …
|
|
|
@17480
|
16 years |
kjdon |
removed the pc namespace. the metadata is now extracted metadata, and …
|
|
|
@17479
|
16 years |
kjdon |
put this back to using block expression for now - on windows sets up …
|
|
|
@17476
|
16 years |
mdewsnip |
Support for using MSSQL for infodb databases, many thanks to Jeffrey …
|
|
|
@17463
|
16 years |
kjdon |
some mods to make this a bit more useful in response to request from …
|
|
|
@17462
|
16 years |
kjdon |
added ProCite.entry_separator
|
|
|
@17354
|
16 years |
ak19 |
Added SIGTERM and SIGINT handlers to terminate wget child process …
|
|
|
@17330
|
16 years |
kjdon |
added default values for self->input_encoding and …
|
|
|
@17322
|
16 years |
kjdon |
added a -f test on filename in can_process_this_file to prevent this …
|
|
|
@17321
|
16 years |
anna |
Removed a line break at the end of an French element.
|
|
|
@17320
|
16 years |
kjdon |
found and fixed what I think is a bug - in the metadata structures for …
|
|
|
@17319
|
16 years |
kjdon |
tidied this up and removed some old code
|
|
|
@17313
|
16 years |
kjdon |
this seemed to have been forgotten in the 'removing metadata form …
|
|
|
@17300
|
16 years |
kjdon |
removed the metadata argument from metadata_read as its not used and …
|
|
|
@17294
|
16 years |
kjdon |
added a fix for a bug John T discovered where in get_new_doc_dir you …
|
|
|
@17293
|
16 years |
davidb |
fixed type in function call: parsefile -> parse_file
|
|
|
@17290
|
16 years |
kjdon |
previous changes to get exploding working (using metadata_read) meant …
|
|
|
@17289
|
16 years |
kjdon |
moved the actual parsing from read into parse_file so other plugins …
|
|
|
@17288
|
16 years |
kjdon |
in add_section_content, we are regenrating doc objs from gdbm …
|
|
|
@17287
|
16 years |
kjdon |
added 'if verbosity > 3' to some print statements, and set doctype to …
|
|
|
@17286
|
16 years |
kjdon |
renamed make_infodatabase to make_infodatabase_dlc so that its not …
|
|
|
@17285
|
16 years |
kjdon |
fixed a couple of typos in function calls
|
|
|
@17284
|
16 years |
ak19 |
The PerlDoc seems to indicate that it is necessary to call waitpid …
|
|
|
@17283
|
16 years |
kjdon |
changed a couple of print statements to be more informative
|
|
|
@17267
|
16 years |
anna |
Updated French translations. Many thanks to John Rose.
|
|
|
@17250
|
16 years |
kjdon |
forgot to pass the arguments to ImageConverter::begin()
|
|
|
@17249
|
16 years |
kjdon |
need to ignore Manifest tag in xml_start_tag
|
|
|
@17247
|
16 years |
ak19 |
The Server Information button produced nothing for some urls, since …
|
|
|
@17246
|
16 years |
ak19 |
Clearer description for the display string OAIDownload.get_doc_exts, …
|
|
|
@17234
|
16 years |
ak19 |
The GLI java class DownloadPane.java has been changed to alter the …
|
|
|
@17233
|
16 years |
ak19 |
Shorter strings for the various Download Settings in the Download …
|
|
|
@17232
|
16 years |
ak19 |
Added an extra comment to the new quit_yaz subroutine to indicate why …
|
|
|
@17231
|
16 years |
ak19 |
In subroutine quit_yaz(), while flushing yaz-client's outputstream, …
|
|
|
@17230
|
16 years |
ak19 |
SRWDownload now finally quits once it has finished. It's no longer …
|
|
|
@17229
|
16 years |
ak19 |
Moved code for starting up (including opening connections) and …
|
|
|
@17220
|
16 years |
ak19 |
One more occasion where the quit command needs to be sent
|
|
|
@17219
|
16 years |
ak19 |
Previously the yaz-client cmd-line program would not quit (still …
|
|
|
@17218
|
16 years |
ak19 |
Previously the yaz-client cmd-line program would not quit (still …
|
|
|
@17216
|
16 years |
kjdon |
trying to get OAI files exploding. Have copied in some code from one …
|
|
|
@17214
|
16 years |
ak19 |
Significant changes: 1. Textcat can be restricted to a given encoding …
|
|
|
@17213
|
16 years |
ak19 |
Significant changes to subroutine get_language_encoding to better work …
|
|
|
@17212
|
16 years |
ak19 |
Removed some unnecessary commented-out code
|
|
|
@17210
|
16 years |
kjdon |
BasDownload changed to BaseDownload
|
|
|
@17209
|
16 years |
kjdon |
BasClas renamed to BaseClassifier, tidied up constructors
|
|
|
@17208
|
16 years |
kjdon |
file rename BasClas.pm to BaseClassifier.pm
|
|
|
@17207
|
16 years |
kjdon |
BasDownload renamed to BaseDownload, also tidied up the constructors
|
|
|
@17206
|
16 years |
kjdon |
renamed file BasDownload.pm to BaseDownload.pm
|
|
|
@17205
|
16 years |
kjdon |
reordered strings - put all the plugout ones together
|
|
|
@17204
|
16 years |
kjdon |
in use_collection now always set GSDLCOLLECTION. previously was unless …
|
|
|
@17203
|
16 years |
kjdon |
BasPlugout renamed to BasePlugout. And tidied up the constructors
|
|
|
@17202
|
16 years |
kjdon |
changed BasPlugout to BasePlugout in line with package renaming. Also …
|
|
|
@17200
|
16 years |
kjdon |
renamed BasPlugout to BasePlugout
|
|
|
@17197
|
16 years |
kjdon |
previous metadata changes meant that there was no longer URL metadata …
|
|
|
@17196
|
16 years |
kjdon |
set cover_image to false as it makes no sense for images
|
|
|
@17144
|
16 years |
kjdon |
modified export.params, added scripts.gli
|
|
|
@17143
|
16 years |
kjdon |
modified check_removeold_and_keepold so that you don't need to pass in …
|
|
|
@17127
|
16 years |
kjdon |
want to block body background, so added it into tabbg_matches regex …
|
|
|
@17126
|
16 years |
kjdon |
inherit and use args form ReadTextFile cos we want the file encoding stuff
|
|
|
@17120
|
16 years |
ak19 |
archivesinf_gdbm commented out until more testing under Windows has …
|
|
|
@17117
|
16 years |
kjdon |
when indexing a combined field, put the field tags arounds the whole …
|
|
|
@17112
|
16 years |
kjdon |
CJK text segmentation now done at indexing level (in buildproc), not …
|
|
|
@17111
|
16 years |
kjdon |
added a comment
|
|
|
@17110
|
16 years |
kjdon |
changed way cjk separation is done. Not done in plugins any more, but …
|
|
|
@17109
|
16 years |
kjdon |
moved separate_cjk from colcfg to buildcfg
|
|
|
@17106
|
16 years |
mdewsnip |
No longer writes out the document/section number entries for Lucene, …
|
|
|
@17105
|
16 years |
mdewsnip |
Not sure why "gdbm-txtgz" was made the default, particularly since …
|
|
|
@17104
|
16 years |
mdewsnip |
Arrrgghhh, someone uglied up my nice tidy code…
|
|
|
@17103
|
16 years |
ak19 |
OAI files should be explodable, so added that back in as an option
|
|
|
@17099
|
16 years |
kjdon |
in get_language_encoding, we extract head from html files. if its not …
|
|
|
@17088
|
16 years |
davidb |
Plugin modified to only print out URL encoded filename if different to …
|
|
|
@17087
|
16 years |
davidb |
Introduction of new GDBM alternative for archives.inf as step towards …
|
|
|
@17070
|
16 years |
anna |
remove the translation of ImageConverter.desc element.
|
|
|
@17069
|
16 years |
anna |
make the element texts consistent with the english version.
|
|
|
@17068
|
16 years |
anna |
update several element texts, changed the key names appeared in the …
|
|
|
@17066
|
16 years |
ak19 |
OAIPlugin now works again: 1. needs to inherit from ReadTextFile as …
|
|
|
@17064
|
16 years |
anna |
Make the French version consistent with the current English properties …
|
|
|
@17063
|
16 years |
ak19 |
Special xml element added for images is now commented out since that …
|
|
|
@17062
|
16 years |
ak19 |
When verbosity is set to 0, it now also ignores output generated by a …
|
|
|