source: main/trunk/greenstone2/perllib/plugins

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @32501   6 years Georgiy Litvinov Workaround to set assign metadata via csv metadata plugin. "Section" …
(edit) @32500   6 years ak19 For a test case, best_encoding come out with prefix/suffix to utf_8, …
(edit) @32499   6 years ak19 Fix for PDFv2 plugin's page buckets.
(edit) @32343   6 years kjdon quantifiers mustn't have \ before {
(edit) @32341   6 years ak19 1. Fixing up regex syntax in DirectoryPlugin for perl 5.26 that comes …
(edit) @32332   6 years kjdon removed replace_images function. this inherits from HTMLPlugin, and …
(edit) @32325   6 years ak19 Dr Bainbridge worked out the solution to HTMLPlugin not handling …
(edit) @32305   6 years ak19 1. When a plugin's built on multiple inheritance, the first n-1 …
(edit) @32303   6 years ak19 Forgot to update the plugin descriptions for the PDF plugins.
(edit) @32290   6 years ak19 1. Making paged_pretty_html the default rather than pretty_html, since …
(edit) @32289   6 years ak19 The PDFPlugin is being deprecated (since PDFv1 and PDFv2 plugins are …
(edit) @32287   6 years ak19 Cleaning up unused strings, some debug statements and recently …
(edit) @32286   6 years ak19 PDFv2Plugin will only work out of the box for GS3 now: PDFBoxConverter …
(edit) @32285   6 years ak19 Fix to sectionalising xpdftools' produced paged_pretty_html: Dr …
(edit) @32284   6 years ak19 PDFv2Plugin doesn't offer a zoom flag anymore, replaced with a dpi …
(edit) @32283   6 years ak19 More stable behaviour by PDFv2Plugin: 1. when pdfbox_conversion is on, …
(edit) @32281   6 years ak19 Undoing accidental commit
(edit) @32280   6 years ak19 Implementing PDFv2paged_text (with pdfbox)
(edit) @32277   6 years ak19 First attempt at PDFv2Plugin.pm.
(edit) @32275   6 years ak19 Moving another fixed English language string into strings.properties …
(edit) @32274   6 years ak19 Related to previous commit, forgot to commit with previous revision. A …
(edit) @32273   6 years ak19 First of the commits to do with restructuring and refactoring the …
(edit) @32224   6 years ak19 Adding PDF to text support for Windows using Xpdf's pdftotext tool. …
(edit) @32223   6 years ak19 When no output mode for PDFPlugin has been set by the user, the output …
(edit) @32222   6 years ak19 q
(edit) @32215   6 years ak19 Before reorganising our PDFPlugin in whatever way we ultimately …
(edit) @32210   6 years ak19 When PDFPlugin is set to paged_html output mode, it now finally …
(edit) @32206   6 years ak19 1. ConvertBinaryFile.pm no longer knows more than necessary about …
(edit) @32205   6 years ak19 First set of commits to do with implementing the new 'paged_html' …
(edit) @32192   6 years kjdon with new result and result_str return values from convert, need to …
(edit) @32186   6 years kjdon if the eval didn't work, all the return values might be undefined, so …
(edit) @32185   6 years kjdon use new return values from ImageConverter::convert
(edit) @32184   6 years kjdon change the return values of convert to match tmp_area_convert_file in …
(edit) @32183   6 years kjdon image height and width might be returned as 'unknown', in particular …
(edit) @32159   6 years ak19 incremental building was not being incremental when no metadata was …
(edit) @32131   6 years kjdon don't want the initial , if trying to match 41 times. this is a syntax …
(edit) @32129   6 years kjdon After () in a regex, {} signifys quantifiers. eg (xx){2,4} - 2-4 …
(edit) @32122   6 years kjdon we had [a-z]{2..} which causes an error in later versions of perl. …
(edit) @32096   6 years ak19 Marking all the uses of sysread() with a comment saying they're a …
(edit) @32028   7 years ak19 1. Bugfix to previous commit: var might not be on an …
(edit) @32026   7 years ak19 Some more placeholder strings for the UnknownConverterPlugin to …
(edit) @31958   7 years kjdon use identify to work out the filetype of the original file, rather …
(edit) @31955   7 years Georgiy Litvinov Prevent page from reloading on open same-section links
(edit) @31926   7 years kjdon added a new option 'store_original_image'. If this is set, and the …
(edit) @31780   7 years ak19 When testing GS3.08's GLI on Ubuntu v 16.04, found its perl v 5.22.1 …
(edit) @31766   7 years ak19 1. Refactored ConvertBinaryFile:tmp_area_convert_file() to do the …
(edit) @31765   7 years ak19 Cosmetic change to error message
(edit) @31764   7 years ak19 Should replace INPUT_FILE placeholder with softlink path tmp_filename …
(edit) @31763   7 years ak19 Fixing somethings before attempting to refactor tmp_area_convert_file
(edit) @31762   7 years ak19 Changed the placeholder names to what Dr Bainbridge suggested, which …
(edit) @31761   7 years ak19 Moved function generate_item_file that's shared between …
(edit) @31760   7 years ak19 Making the plugin active. It's rudimentary, but works when I pass in …
(edit) @31759   7 years ak19 The previous commit put text into doc.xml, but no text was visible in …
(edit) @31757   7 years ak19 Fixed the earlier problems, which, it turned out, had to do with the …
(edit) @31745   7 years ak19 Another change that's needed, this time to add the plugin.
(edit) @31744   7 years ak19 Further changes to new UnknownConverterPlugin that's still in …
(edit) @31743   7 years ak19 Committing first attempt at new UnknownConverterPlugin, which hasn't …
(edit) @31742   7 years ak19 No need to hardcode the plugin name
(edit) @31690   7 years kjdon removing debug statements
(edit) @31689   7 years kjdon removing debug statements
(edit) @31688   7 years kjdon removing debug statements
(edit) @31497   7 years kjdon oops, had commented out a line which meant normal ascii images weren't …
(edit) @31494   7 years kjdon updated text string keys based on new plugin names
(edit) @31493   7 years kjdon removed smart_block option. Its been deprecated for long enough
(edit) @31492   7 years kjdon renamed EncodingUtil to CommonUtil, BasePlugin to BaseImporter. The …
(edit) @31491   7 years kjdon need to normalize the name when we look up in hte block hash too, for macos
(edit) @31487   7 years ak19 Important import statement for the recent commits related to encoding.
(edit) @31480   7 years kjdon util::block_file moved to EncodingUtil::block_raw_filename
(edit) @31479   7 years kjdon inherit from EncodingUtil instead of PrintInfo
(edit) @31478   7 years kjdon blocking stuff moved to here
(edit) @31477   7 years kjdon blocking moved to EncodingUtil. debug stuff still in here. needs tidying up
(edit) @31476   7 years kjdon blocking moved to EncodingUTil
(edit) @31474   7 years kjdon encoding_list is in EncodingUtil now
(edit) @31459   7 years kjdon now inherits from EncodingUtil. When using local directory in metadata …
(edit) @31458   7 years kjdon encoding list now comes from EncodingUtil, not BasePlugin
(edit) @31457   7 years kjdon baseplugin now inherits from EncodingUtil, and all its encoding …
(edit) @31456   7 years kjdon new base plugin for directories and files. DirectoryPLugin needs …
(edit) @31446   7 years ak19 use guess_filesystem_encoding instead of utf8 hard coded. hope it …
(edit) @31445   7 years ak19 added a method guessing_filesystem_encoding. use this to try and work …
(edit) @31444   7 years ak19 block hash filenames should be windows long names
(edit) @31440   7 years kjdon nearly there for handling russian etc subfolders in import. need to …
(edit) @31439   7 years kjdon chnaged a comment
(edit) @31438   7 years kjdon added a couple of comments
(edit) @31420   7 years kjdon lookup_string with extra '1' arg returns perl internal unicode aware …
(edit) @31415   7 years Georgiy Litvinov Modified html links pointed to different section in the same document.
(edit) @31284   7 years davidb Initial cut at plugin for processing HathiTrust METS files
(edit) @31113   7 years ak19 Text item files now handle UTF-8 properly by reading in the file correctly.
(edit) @30857   8 years ak19 Unless new line endings (particularly carriage return characters …
(edit) @30742   8 years kjdon paged docs without images look weird in gs3. need to make a new type, …
(edit) @30681   8 years ak19 3 new strings introduced by Kathy contained the :, which is used as a …
(edit) @30600   8 years ak19 An empty metadata.xml was unrecognised by MetadataXMLPlugin because …
(edit) @30492   8 years Georgiy Litvinov Fix for previous commit.
(edit) @30491   8 years Georgiy Litvinov Removed high and low surrogates from converted html
(edit) @30427   8 years davidb Technique for working out cached-dir name for file updated to allow it …
(edit) @30358   8 years Georgiy Litvinov Fix for -associate_tail_re option. Files with extensions that could be …
(edit) @30022   9 years ak19 Finally committing Dr Bainbridge's suggested fix (tested) to handle …
(edit) @29820   9 years kjdon EmbeddedMEtadataPlugin needs to make raw filenames into unicode for …
(edit) @29818   9 years kjdon removing debug and old test code
(edit) @29817   9 years kjdon removing debug statements
(edit) @29796   9 years kjdon don't need use Win32 and anyway, can't have it when not running on windows
Note: See TracRevisionLog for help on using the revision log.