source: main/trunk/greenstone2/perllib/plugins

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @32215   6 years ak19 Before reorganising our PDFPlugin in whatever way we ultimately …
(edit) @32210   6 years ak19 When PDFPlugin is set to paged_html output mode, it now finally …
(edit) @32206   6 years ak19 1. ConvertBinaryFile.pm no longer knows more than necessary about …
(edit) @32205   6 years ak19 First set of commits to do with implementing the new 'paged_html' …
(edit) @32192   6 years kjdon with new result and result_str return values from convert, need to …
(edit) @32186   6 years kjdon if the eval didn't work, all the return values might be undefined, so …
(edit) @32185   6 years kjdon use new return values from ImageConverter::convert
(edit) @32184   6 years kjdon change the return values of convert to match tmp_area_convert_file in …
(edit) @32183   6 years kjdon image height and width might be returned as 'unknown', in particular …
(edit) @32159   6 years ak19 incremental building was not being incremental when no metadata was …
(edit) @32131   6 years kjdon don't want the initial , if trying to match 41 times. this is a syntax …
(edit) @32129   6 years kjdon After () in a regex, {} signifys quantifiers. eg (xx){2,4} - 2-4 …
(edit) @32122   6 years kjdon we had [a-z]{2..} which causes an error in later versions of perl. …
(edit) @32096   6 years ak19 Marking all the uses of sysread() with a comment saying they're a …
(edit) @32028   7 years ak19 1. Bugfix to previous commit: var might not be on an …
(edit) @32026   7 years ak19 Some more placeholder strings for the UnknownConverterPlugin to …
(edit) @31958   7 years kjdon use identify to work out the filetype of the original file, rather …
(edit) @31955   7 years Georgiy Litvinov Prevent page from reloading on open same-section links
(edit) @31926   7 years kjdon added a new option 'store_original_image'. If this is set, and the …
(edit) @31780   7 years ak19 When testing GS3.08's GLI on Ubuntu v 16.04, found its perl v 5.22.1 …
(edit) @31766   7 years ak19 1. Refactored ConvertBinaryFile:tmp_area_convert_file() to do the …
(edit) @31765   7 years ak19 Cosmetic change to error message
(edit) @31764   7 years ak19 Should replace INPUT_FILE placeholder with softlink path tmp_filename …
(edit) @31763   7 years ak19 Fixing somethings before attempting to refactor tmp_area_convert_file
(edit) @31762   7 years ak19 Changed the placeholder names to what Dr Bainbridge suggested, which …
(edit) @31761   7 years ak19 Moved function generate_item_file that's shared between …
(edit) @31760   7 years ak19 Making the plugin active. It's rudimentary, but works when I pass in …
(edit) @31759   7 years ak19 The previous commit put text into doc.xml, but no text was visible in …
(edit) @31757   7 years ak19 Fixed the earlier problems, which, it turned out, had to do with the …
(edit) @31745   7 years ak19 Another change that's needed, this time to add the plugin.
(edit) @31744   7 years ak19 Further changes to new UnknownConverterPlugin that's still in …
(edit) @31743   7 years ak19 Committing first attempt at new UnknownConverterPlugin, which hasn't …
(edit) @31742   7 years ak19 No need to hardcode the plugin name
(edit) @31690   7 years kjdon removing debug statements
(edit) @31689   7 years kjdon removing debug statements
(edit) @31688   7 years kjdon removing debug statements
(edit) @31497   7 years kjdon oops, had commented out a line which meant normal ascii images weren't …
(edit) @31494   7 years kjdon updated text string keys based on new plugin names
(edit) @31493   7 years kjdon removed smart_block option. Its been deprecated for long enough
(edit) @31492   7 years kjdon renamed EncodingUtil to CommonUtil, BasePlugin to BaseImporter. The …
(edit) @31491   7 years kjdon need to normalize the name when we look up in hte block hash too, for macos
(edit) @31487   7 years ak19 Important import statement for the recent commits related to encoding.
(edit) @31480   7 years kjdon util::block_file moved to EncodingUtil::block_raw_filename
(edit) @31479   7 years kjdon inherit from EncodingUtil instead of PrintInfo
(edit) @31478   7 years kjdon blocking stuff moved to here
(edit) @31477   7 years kjdon blocking moved to EncodingUtil. debug stuff still in here. needs tidying up
(edit) @31476   7 years kjdon blocking moved to EncodingUTil
(edit) @31474   7 years kjdon encoding_list is in EncodingUtil now
(edit) @31459   7 years kjdon now inherits from EncodingUtil. When using local directory in metadata …
(edit) @31458   7 years kjdon encoding list now comes from EncodingUtil, not BasePlugin
(edit) @31457   7 years kjdon baseplugin now inherits from EncodingUtil, and all its encoding …
(edit) @31456   7 years kjdon new base plugin for directories and files. DirectoryPLugin needs …
(edit) @31446   7 years ak19 use guess_filesystem_encoding instead of utf8 hard coded. hope it …
(edit) @31445   7 years ak19 added a method guessing_filesystem_encoding. use this to try and work …
(edit) @31444   7 years ak19 block hash filenames should be windows long names
(edit) @31440   7 years kjdon nearly there for handling russian etc subfolders in import. need to …
(edit) @31439   7 years kjdon chnaged a comment
(edit) @31438   7 years kjdon added a couple of comments
(edit) @31420   7 years kjdon lookup_string with extra '1' arg returns perl internal unicode aware …
(edit) @31415   7 years Georgiy Litvinov Modified html links pointed to different section in the same document.
(edit) @31284   7 years davidb Initial cut at plugin for processing HathiTrust METS files
(edit) @31113   7 years ak19 Text item files now handle UTF-8 properly by reading in the file correctly.
(edit) @30857   8 years ak19 Unless new line endings (particularly carriage return characters …
(edit) @30742   8 years kjdon paged docs without images look weird in gs3. need to make a new type, …
(edit) @30681   8 years ak19 3 new strings introduced by Kathy contained the :, which is used as a …
(edit) @30600   8 years ak19 An empty metadata.xml was unrecognised by MetadataXMLPlugin because …
(edit) @30492   8 years Georgiy Litvinov Fix for previous commit.
(edit) @30491   8 years Georgiy Litvinov Removed high and low surrogates from converted html
(edit) @30427   8 years davidb Technique for working out cached-dir name for file updated to allow it …
(edit) @30358   8 years Georgiy Litvinov Fix for -associate_tail_re option. Files with extensions that could be …
(edit) @30022   9 years ak19 Finally committing Dr Bainbridge's suggested fix (tested) to handle …
(edit) @29820   9 years kjdon EmbeddedMEtadataPlugin needs to make raw filenames into unicode for …
(edit) @29818   9 years kjdon removing debug and old test code
(edit) @29817   9 years kjdon removing debug statements
(edit) @29796   9 years kjdon don't need use Win32 and anyway, can't have it when not running on windows
(edit) @29795   9 years kjdon change to using util method raw_filename_to_unicode. got this working …
(edit) @29763   9 years ak19 on macos, accented chars in filenames are in decomposed form, eg the …
(edit) @29762   9 years ak19 check if the filenames are url encoded - this happens for eg accented …
(edit) @29760   9 years kjdon try decoding against locale rather than utf8. will this work on …
(edit) @29745   9 years kjdon using Encode::decode to make the filenames 'unicode aware'. For …
(edit) @29476   9 years sjs49 First of 2 commits to get diffcol on the 64 bit Ubuntu that has perl …
(edit) @29102   10 years kjdon added the string for PDFPlugin.use_realistic_book option
(edit) @29101   10 years kjdon added -use_realistic_book option. htis makes user you are converting …
(edit) @28836   10 years ak19 A question on the mailing list involved accented characters in custom …
(edit) @28803   10 years ak19 Testing with accented characters in MARC data showed up problems in …
(edit) @28783   10 years ak19 Treatment of 'and' in the MARC*Plugin.pm an issue for Greenstone …
(edit) @28782   10 years ak19 Routine for reading in text files failed to 'decode' from UTF-8 to …
(edit) @28669   10 years ak19 This plugin is similar to CSVPlugin, but for tab-separated metadata files
(edit) @28638   10 years kjdon don't process a doc.xml entry if the group-position > 1: we have …
(edit) @28603   10 years ak19 Found some issues when wanting to add in the CDS-ISIS tutorial …
(edit) @28563   10 years kjdon changing some util:: methods to FileUtils:: methods
(edit) @28560   10 years ak19 1. New subroutine util::set_gnomelib_env that sets the environment for …
(edit) @28489   11 years davidb Support for Cygwin added
(edit) @28381   11 years ak19 Bugfix. When dealing with filenames with special characters that are …
(edit) @28375   11 years davidb A set of changes to help Greenstone building code (perl) run under …
(edit) @28355   11 years ak19 1. Now gsConvert.pl calls the new pptextract.vbs VBScript (which …
(edit) @28319   11 years ak19 The replace-with-src-doc feature had stopped working. It needed …
(edit) @28285   11 years ak19 Deprecated util:: subroutines replaced with their FileUtils equivalents
(edit) @28267   11 years davidb Code change to allow doc.xml files that do not have a DOCTYPE line
(edit) @28265   11 years davidb Revised RE for accepting doc.xml files to allow for time-stamped ones
Note: See TracRevisionLog for help on using the revision log.