|
|
@8904
|
19 years |
jrm21 |
need to do qp decoding before doing text_into_html so we don't keep …
|
|
|
@8903
|
19 years |
jrm21 |
fix typo in ensure_utf8 function for asian characters
|
|
|
@8902
|
19 years |
jrm21 |
slightly better way of recognising gb charset names (mapped to 'gb')
|
|
|
@8895
|
19 years |
chi |
Modification of the validated METS format in the docmets.xml.
|
|
|
@8894
|
19 years |
chi |
Modification of the validated METS format in the docmets.xml
|
|
|
@8893
|
19 years |
davidb |
Additional check added to plugins read function to remain compatible …
|
|
|
@8892
|
19 years |
davidb |
Addition of new minus option to BasPlug: -associate_ext.
This new …
|
|
|
@8891
|
19 years |
davidb |
Revision of argument types to a few plugin options to better reflect …
|
|
|
@8890
|
19 years |
davidb |
Reading of config files has support for environment variables, however …
|
|
|
@8889
|
19 years |
davidb |
Small modification of image URL manipulation remain consistent with …
|
|
|
@8886
|
19 years |
mdewsnip |
Bug fix in html2txt function, thanks to Emanuel Dejanu.
|
|
|
@8885
|
19 years |
mdewsnip |
Ooops... fixed up a mismatch between the name of an option and how it …
|
|
|
@8866
|
19 years |
kjdon |
if the list is split into only one bucket, then suppress the HList
|
|
|
@8854
|
19 years |
kjdon |
we now format the metadata used for sorting the import docs, can also …
|
|
|
@8852
|
19 years |
kjdon |
shifted format_metadata_for_sorting from BasCLas to sorttools, so …
|
|
|
@8843
|
19 years |
jrm21 |
fix problem for -metadata_fields if tag1<Tag2> given for mapping to a …
|
|
|
@8818
|
19 years |
mdewsnip |
Title tags over multiple lines will now be removed correctly before …
|
|
|
@8814
|
19 years |
mdewsnip |
Updated files for Kea 3.0, thanks to Olena.
|
|
|
@8797
|
19 years |
kjdon |
added new oidtypes 'assigned' (from stephen de gabrielle) and fixed up …
|
|
|
@8796
|
19 years |
kjdon |
added new oidtypes 'assigned' (from stephen de gabrielle) and …
|
|
|
@8795
|
19 years |
kjdon |
if use_sections is on, now we are a bit more relaxed about what the …
|
|
|
@8794
|
19 years |
jrm21 |
remove trailing \n from meta tags (bug reported by Tim Finney, 13 Dec 2004)
|
|
|
@8789
|
19 years |
mdewsnip |
Better documentation of the extract keyphrases (Kea) code, thanks to Olena.
|
|
|
@8776
|
19 years |
kjdon |
fixed a bug whereby you couldn't build more than 11 subcollections
|
|
|
@8767
|
19 years |
jrm21 |
add 'use utf8' so hopefully substr() is smart enough to cut between …
|
|
|
@8764
|
19 years |
chi |
Modifications of the use of BasPlug
|
|
|
@8763
|
19 years |
mdewsnip |
Added a missing curly bracket.
|
|
|
@8762
|
19 years |
mdewsnip |
The files this plugin processes can be exploded by the …
|
|
|
@8761
|
19 years |
mdewsnip |
XML plugin descriptions now include an <Explodes> tag that records …
|
|
|
@8749
|
19 years |
mdewsnip |
Now escapes '<' and '>' characters in metadata values correctly.
|
|
|
@8740
|
19 years |
chi |
Modifications for validated METS format.
|
|
|
@8739
|
19 years |
chi |
A new plugin - BNContentePlug to deal with Portugal BN collections.
|
|
|
@8737
|
19 years |
davidb |
Extension to RecPlug so metadata that goes with a file that is in a …
|
|
|
@8730
|
19 years |
jrm21 |
don't need 2 identical files in cvs
|
|
|
@8729
|
19 years |
kjdon |
changed List so that when its being used from AZCompactList it does …
|
|
|
@8728
|
19 years |
kjdon |
changed reinit a bit so that things that only need to be done once get …
|
|
|
@8716
|
19 years |
kjdon |
added some changes made by Emanuel Dejanu (Simple Words)
|
|
|
@8684
|
19 years |
mdewsnip |
Ooops... the OAIPlug has never worked properly on Windows! Regular …
|
|
|
@8682
|
19 years |
mdewsnip |
Added a filename_head function that returns the path of a file without …
|
|
|
@8679
|
19 years |
kjdon |
BasPlug cover images are now turned on by default, and the option is …
|
|
|
@8678
|
19 years |
kjdon |
cover images are now turned on by default, and the option is changed …
|
|
|
@8668
|
19 years |
kjdon |
when processing description tags, it used to use …
|
|
|
@8647
|
19 years |
mdewsnip |
Added a "-newest_first" option to DateList for reverse chronological …
|
|
|
@8646
|
19 years |
mdewsnip |
Made ISISPlug.pm a bit more robust to crap files.
|
|
|
@8563
|
20 years |
mdewsnip |
Ripped all the obtaining referenced documents and exploding database …
|
|
|
@8519
|
20 years |
mdewsnip |
Fixed the extra Title metadata problem with David's help.
|
|
|
@8518
|
20 years |
chi |
A new program to deal with export.pl function.
|
|
|
@8517
|
20 years |
chi |
Add and modify methods to deal with exporting GS collections to "METS" …
|
|
|
@8516
|
20 years |
chi |
Add new messages for export.pl function.
|
|
|
@8515
|
20 years |
chi |
Add a new method metadata_read to identify any specific or extra …
|
|
|
@8514
|
20 years |
chi |
Modify the namespace in METS file as "gsdl3"
|
|
|
@8513
|
20 years |
chi |
Add a method metadata_read in order to go straight to BasPlug and …
|
|
|
@8512
|
20 years |
chi |
Add a new metadata_read method in the first pass in order to identify …
|
|
|
@8511
|
20 years |
chi |
A new plugin to import the collections in DSpace format to GS2.
|
|
|
@8510
|
20 years |
chi |
Add a new method metadat_read to deal with specific (or external) …
|
|
|
@8509
|
20 years |
chi |
Add new methods (with a smart_block option) to store the blocked …
|
|
|
@8504
|
20 years |
chi |
Modification of METS format in order to be compatible with GS3. Also, …
|
|
|
@8502
|
20 years |
kjdon |
changed mets:FLocate to mets:FLocat
|
|
|
@8479
|
20 years |
kjdon |
fixed a typo in the arg list which meant it didn't work with -xml
|
|
|
@8446
|
20 years |
kjdon |
new classifier:AutoHierarchy. Does the same thing as Hierarchy …
|
|
|
@8445
|
20 years |
kjdon |
fixed a bug I introduced with the remove_empty_classifications thing - …
|
|
|
@8402
|
20 years |
kjdon |
fixed up the header page stuff with pagedimgplug - docs always have a …
|
|
|
@8366
|
20 years |
kjdon |
added script to the list of tags to process as relative links, and js …
|
|
|
@8365
|
20 years |
kjdon |
put doule quotes around values of <a href=xxx> and <img src=xxx>
|
|
|
@8363
|
20 years |
kjdon |
renamed build option 'allclassifications' to …
|
|
|
@8362
|
20 years |
kjdon |
added a new option to the phind classifier: min_occurs. this is the …
|
|
|
@8361
|
20 years |
kjdon |
renamed build option 'allclassifications' to …
|
|
|
@8350
|
20 years |
kjdon |
assign the fall back title after processing any other metadata, so …
|
|
|
@8315
|
20 years |
mdewsnip |
Was adding Source metadata twice.
|
|
|
@8278
|
20 years |
jrm21 |
sanity check for a valid date before trying to add it as metadata, …
|
|
|
@8275
|
20 years |
cs025 |
Avoids problems with 'oai' being visible better than the previous version.
|
|
|
@8252
|
20 years |
kjdon |
changed the pagedimgplug -noheaderpage to -headerpage
|
|
|
@8246
|
20 years |
kjdon |
changed the default to have noheaderpage, so the option is now …
|
|
|
@8245
|
20 years |
kjdon |
a few fixes for problems found on Ians laptop
|
|
|
@8227
|
20 years |
jrm21 |
all perl things should "use strict;" to catch errors!
$cursection was …
|
|
|
@8226
|
20 years |
jrm21 |
tell HTMLPlug to extract the author metadata, and rename it to Creator.
|
|
|
@8225
|
20 years |
jrm21 |
support tag<tagname> as described in the pluginfo for HTMLPlug. The …
|
|
|
@8221
|
20 years |
cs025 |
Added AllList to provide a universal list of all documents, which …
|
|
|
@8220
|
20 years |
cs025 |
Extensions to underpin OAI - e.g. creation of the OAI classifier, …
|
|
|
@8218
|
20 years |
jrm21 |
use the unicode::ensure_utf8() function on the extracted text so we …
|
|
|
@8217
|
20 years |
jrm21 |
added a safety check to ensure_utf8()
|
|
|
@8171
|
20 years |
mdewsnip |
FileFormat metadata for PostScript files should now be set correctly.
|
|
|
@8170
|
20 years |
mdewsnip |
Fixed some of the new FileFormat metadata so you only get one value …
|
|
|
@8166
|
20 years |
mdewsnip |
Added FileSize metadata in most plugins.
|
|
|
@8154
|
20 years |
kjdon |
added a bit more to teh sortmeta description
|
|
|
@8145
|
20 years |
mdewsnip |
Check for ImageMagick being installed and on the path, and bail early …
|
|
|
@8139
|
20 years |
mdewsnip |
Now adds NumPages metadata.
|
|
|
@8138
|
20 years |
mdewsnip |
Added FileFormat metadata.
|
|
|
@8121
|
20 years |
chi |
Add the "FileFormat" metadata to each of the Plugins.
|
|
|
@8119
|
20 years |
jrm21 |
allow multiple callbacks, one for each metadata field (using the …
|
|
|
@8117
|
20 years |
mdewsnip |
Fixed a bug where extra dots in filenames would cause the file …
|
|
|
@8102
|
20 years |
mdewsnip |
Unfinished, but I'm committing it now so I don't lose it.
|
|
|
@8098
|
20 years |
jrm21 |
* guess a title if no \title tag
* \it tag
* fractions in maths mode
|
|
|
@8097
|
20 years |
jrm21 |
added extra accent for \"i
|
|
|
@8094
|
20 years |
jrm21 |
fix errors with uninitialised variables if 'saveas' not specified.
…
|
|
|
@8090
|
20 years |
davidb |
Switching RecPlug over to using XMLParser wrapper rather than …
|
|
|
@8087
|
20 years |
mdewsnip |
On Windows we use the XML::Parser stuff in bin/windows/perl/lib rather …
|
|
|
@8079
|
20 years |
davidb |
docsave.pm had been saving both GA and METS format. if-statement …
|
|
|
@8072
|
20 years |
davidb |
Support for building collections with lucene.
|
|
|
@8071
|
20 years |
davidb |
When title metadata is derived from first 100 chars of text,
extra =~ …
|
|
|