|
|
@1420
|
24 years |
davidb |
Moved read_file and read from ConvertToBasPlug to ConvertToPlug.
|
|
|
@1418
|
24 years |
davidb |
Small modification to improve handling of file names with spaces in.
|
|
|
@1417
|
24 years |
davidb |
Additions so ConvertPlug etc. can handle filenames with spaces in them.
|
|
|
@1415
|
24 years |
davidb |
Removed some diagnostic print statements.
|
|
|
@1411
|
24 years |
dmm9 |
added the options for the date extractor
|
|
|
@1410
|
24 years |
davidb |
Introduction of "ConvertTo" family of plugins. This establishes
a new …
|
|
|
@1403
|
24 years |
say1 |
taught HTMLPlug about shtml, asp, cgi, php and html query files …
|
|
|
@1401
|
24 years |
davidb |
Fixed small problem with associated files.
|
|
|
@1400
|
24 years |
davidb |
General tidying of code.
|
|
|
@1396
|
24 years |
say1 |
changed initialisation code for acronyms
|
|
|
@1393
|
24 years |
say1 |
acronym markup functionality
|
|
|
@1384
|
24 years |
paynter |
Changed language extraction to ignoer encoding information, so that …
|
|
|
@1379
|
24 years |
paynter |
Fixed bug that gave gsdlsourcedocument metadata relative path instead …
|
|
|
@1360
|
24 years |
say1 |
clarified status messages
|
|
|
@1358
|
24 years |
nzdl |
Fixed bug I recently introduced into HTMLPlug (<pre> tags were being …
|
|
|
@1335
|
24 years |
say1 |
many acronym changes
|
|
|
@1317
|
24 years |
paynter |
Added -extract_language option, which uses the textcat language …
|
|
|
@1312
|
24 years |
sjboddie |
fixed a bug in the HTML plugin that showed up under windows
|
|
|
@1269
|
24 years |
sjboddie |
Added ZIPPlug plugin for handling input documents that have been …
|
|
|
@1245
|
24 years |
sjboddie |
Fixed a bug that davidb found in a couple of regular expressions
|
|
|
@1244
|
24 years |
sjboddie |
Caught up most general plugins (that's the ones in …
|
|
|
@1243
|
24 years |
sjboddie |
Caught HTMLPlug up with BasPlug. A few minor changes to some …
|
|
|
@1242
|
24 years |
sjboddie |
Added Stuart Yeate's acronym extraction code and made it a standard …
|
|
|
@1235
|
24 years |
nzdl |
* empty log message *
|
|
|
@1231
|
24 years |
gwp |
Bug fix on the H1 metadata option: if the file has no <H1> tag, …
|
|
|
@1230
|
24 years |
gwp |
Added an additional H1 metadata field that extracts the text
between …
|
|
|
@1229
|
24 years |
sjboddie |
fixed bug in options
|
|
|
@1227
|
24 years |
sjboddie |
Modified the perl code for importing arabic encoded documents. Plugins …
|
|
|
@1221
|
24 years |
sjboddie |
Added a new HBSPlug which is kind of a generalisation of HBPlug …
|
|
|
@1220
|
24 years |
sjboddie |
Caught HTMLPlug up with the changes I made to BasPlug. HTMLPlug now …
|
|
|
@1219
|
24 years |
sjboddie |
Made BasPlug take options (these options are available to all plugins …
|
|
|
@1206
|
24 years |
gwp |
A thorough rewrite; some of the metadata was flawed in such a way
that …
|
|
|
@1190
|
24 years |
gwp |
The first 200 chars of body text can now be extracted as metadata
by …
|
|
|
@1020
|
24 years |
sjboddie |
changed paths to collection images (again!)
|
|
|
@1010
|
24 years |
sjboddie |
renamed old html module ghtml -- it clashed with builtin html module …
|
|
|
@1006
|
24 years |
sjboddie |
fixed but in previous changes
|
|
|
@973
|
24 years |
sjboddie |
new path to images
|
|
|
@965
|
24 years |
sjboddie |
fixed bug - added assoc_files option
|
|
|
@918
|
24 years |
kjm18 |
fixed bug where it was creating two doc_obj per file instead of just one.
|
|
|
@900
|
24 years |
sjboddie |
tweaked the way associated files are handled at build time - some …
|
|
|
@897
|
24 years |
sjboddie |
lots of stuff
|
|
|
@863
|
24 years |
sjboddie |
fixed a couple of bugs that I introduced when including Davids stuff
|
|
|
@862
|
24 years |
sjboddie |
fixed a couple of bugs that were preventing muliple document gml files …
|
|
|
@850
|
24 years |
sjboddie |
added use strict - tidied a few things up etc.
|
|
|
@849
|
24 years |
sjboddie |
Fixed a bit of a bug
|
|
|
@847
|
25 years |
sjboddie |
fixed CVS burp
|
|
|
@840
|
25 years |
davidb |
Optimisations to make plugin go faster
|
|
|
@839
|
25 years |
davidb |
added extra_metadata function
|
|
|
@809
|
25 years |
sjboddie |
plugins now take options, maxdocs is always defined
|
|
|
@808
|
25 years |
sjboddie |
New html plugin with options
|
|
|
@796
|
25 years |
sjboddie |
semi-colon;;;;
|
|
|
@734
|
25 years |
sjboddie |
removed old out of date comments
|
|
|
@733
|
25 years |
sjboddie |
just minor changes to book cover image stuff
|
|
|
@732
|
25 years |
sjboddie |
prevent from overriding Title metadata that may have been passed
in …
|
|
|
@721
|
25 years |
davidb |
Support functions to help with the generation of webpages from
Perl …
|
|
|
@709
|
25 years |
sjboddie |
no longer need classifytype metadata added from plugin
|
|
|
@707
|
25 years |
sjboddie |
fixed a windows specific bug
|
|
|
@678
|
25 years |
sjboddie |
added bookcover
|
|
|
@640
|
25 years |
sjboddie |
fixed an error that was causing a 'key: ' line to be sent to plugins
|
|
|
@639
|
25 years |
sjboddie |
added the <pre> tags …
|
|
|
@638
|
25 years |
sjboddie |
Gordon's new email plugin thingy
|
|
|
@620
|
25 years |
sjboddie |
functionality extended so a list of directories to ignore can be …
|
|
|
@617
|
25 years |
sjboddie |
a few fixes
|
|
|
@616
|
25 years |
sjboddie |
some new gb plugins
|
|
|
@594
|
25 years |
sjboddie |
Fixed some silly problems
|
|
|
@593
|
25 years |
sjboddie |
Fixed bug causing everything to die if it tripped over an unreadable …
|
|
|
@592
|
25 years |
sjboddie |
new plugin for GB encoded text
|
|
|
@589
|
25 years |
sjboddie |
fixed bug in regular expression
|
|
|
@585
|
25 years |
sjboddie |
new plugin
|
|
|
@537
|
25 years |
sjboddie |
added GPL headers
|
|
|
@433
|
25 years |
sjboddie |
added gzip option to import.pl
|
|
|
@339
|
25 years |
sjboddie |
Fixed a bug that I created last time
|
|
|
@321
|
25 years |
sjboddie |
Fixed a couple of small bugs
|
|
|
@318
|
25 years |
sjboddie |
fixed a bug causing documents without <body> tags to have no text
|
|
|
@317
|
25 years |
sjboddie |
Added maxdocs option
|
|
|
@288
|
25 years |
sjboddie |
Fixed up a few things to allow collections to be built directly …
|
|
|
@286
|
25 years |
sjboddie |
few changes mostly to get howto and organization classifications …
|
|
|
@285
|
25 years |
sjboddie |
Did some hacking around here while trying to build subsets
of existing …
|
|
|
@250
|
25 years |
sjboddie |
just changed a comment to stop confusing myself
|
|
|
@245
|
25 years |
sjboddie |
Small changes to allow metadata to be passed to plugins from …
|
|
|
@233
|
25 years |
sjboddie |
Altered TOCPlug to work with new gdbm building code. It now reads
a …
|
|
|
@230
|
25 years |
sjboddie |
fixed typo I introduced last time
|
|
|
@229
|
25 years |
sjboddie |
Removed old sorting function
|
|
|
@168
|
25 years |
rjmcnab |
Initial revision.
|
|
|
@139
|
25 years |
sjboddie |
Got building stuff to handle subcollections and language subcollections
|
|
|
@136
|
25 years |
rjmcnab |
Fixed small bug.
|
|
|
@88
|
26 years |
rjmcnab |
Special characters are now escaped in GML files.
|
|
|
@87
|
26 years |
rjmcnab |
Fixed a bug in the sorting and then removed the sorting as its too …
|
|
|
@75
|
26 years |
rjmcnab |
Added support for the new version of doc.pl (which uses the UTF-8 …
|
|
|
@70
|
26 years |
sjboddie |
Put sorting back in for now as removing it caused some problems for …
|
|
|
@68
|
26 years |
sjboddie |
Don't need to be sorting files at build time anymore
|
|
|
@4
|
26 years |
sjboddie |
Initial revision
|