source: trunk/gsdl/perllib

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @1735   23 years say1 fixed about a billion little Image things.
(edit) @1733   23 years say1 new plugin for images
(edit) @1732   23 years say1 check metadata before adding
(edit) @1731   23 years jrm21 New and improved! Now gets #include information from std C files as …
(edit) @1730   23 years jrm21 removed a debugging statement left in accidentally…
(edit) @1729   23 years jrm21 title regexp should have started "\s*", not "\s+" - it's optional …
(edit) @1728   23 years jrm21 Minor change so that leading whitespace is skipped when grabbing the …
(edit) @1720   23 years dmm9 Added information to the usage text about date extraction option
(edit) @1719   23 years dmm9 Added information to the usage text about date extraction option
(edit) @1718   23 years dmm9 Added information to the usage text about date extraction option
(edit) @1716   23 years jrm21 minor change to allow the -title option to display correctly on HTML page.
(edit) @1712   23 years say1 cleaned up metadata extraction.
(edit) @1711   23 years say1 fixed minor spelling mistake
(edit) @1710   23 years say1 RecPlug now skips CVS directories.
(edit) @1707   23 years jrm21 Plugin for source code (primarily for putting Greenstone src into a …
(edit) @1706   23 years say1 cleaned up the Title code to strip away standard prefixes inserted by …
(edit) @1705   23 years say1 fixed to handle filenames with multiple dots.
(edit) @1700   23 years say1 changed PSPlug to extract CreationDate, Title and Pages info.
(edit) @1699   23 years say1 fixed the bug in HTML plug which broke images for Dave
(edit) @1694   24 years kjm18 updated to resembled the corresponding mg updated versions
(edit) @1691   24 years jrm21 return "" instead of exit 1 on error. This means that if 1 file …
(edit) @1686   24 years jrm21 HTMLPlug no longer blocks .pdf files. (also updated reference to this …
(edit) @1685   24 years jrm21 PSPlug based heavily on PDFPlug…
(edit) @1679   24 years sjboddie Re-Added recent changes that were lost when the CVS repository was …
(edit) @1677   24 years paynter Added teh BibTex entry type as metadata.
(edit) @1676   24 years paynter Plugins for processing files of bibliography records in BibTex and …
(edit) @1658   24 years paynter Fixed a bug reading the headers that confused "To" with "In-Reply-To".
(edit) @1653   24 years paynter Fixed a few bugs where incorrect variable names were used.
(edit) @1646   24 years paynter Arguments for setting suffix program parameters.
(edit) @1645   24 years paynter Output less verbose & more consistant with buildcol.pl
(edit) @1643   24 years paynter The phind phrase browsing interface is now a Greenstone classifier. …
(edit) @1611   24 years sjboddie Minor bug fix
(edit) @1609   24 years say1 fixed print_uage
(edit) @1608   24 years nzdl Inserted an ugly hack into the Hierarchy classifier to mask a bug …
(edit) @1605   24 years say1 fixed some of my earlier mistakes. sorry Stefan
(edit) @1602   24 years say1 metadata extraction work. (email addresses, generalised HTML tags, …
(edit) @1587   24 years nzdl * empty log message *
(edit) @1586   24 years sjboddie fixed a bug in the cp_r perl routine
(edit) @1572   24 years paynter phind <data> command for use with phind.
(edit) @1515   24 years sjboddie Fixed bug that was introduced with -out option for classifiers
(edit) @1503   24 years davidb A bit of extra error checking.
(edit) @1483   24 years sjboddie added -out option to classifiers
(edit) @1482   24 years davidb Small modification so Index files can be in subdirectories of an …
(edit) @1467   24 years dmm9 pre-Christian date support
(edit) @1454   24 years stefan Lots of changes to perl building code for collectoraction
(edit) @1448   24 years paynter Changed regular expressions for extracting metadata from META tags …
(edit) @1446   24 years paynter Major overhauls; works with the new gsConvert.pl instead of …
(edit) @1442   24 years dmm9 date->Coverage
(edit) @1436   24 years davidb Due to rearrangement of ConvertTo hierarchy, this file is now redundant.
(edit) @1435   24 years davidb Rearrangement of ConvertTo inheritence so HTMLPlug and TextPlug do not …
(edit) @1431   24 years sjboddie Made a few minor adjustments to perl building code for use with …
(edit) @1424   24 years sjboddie Added a -out option to most of the perl building scripts to allow …
(edit) @1420   24 years davidb Moved read_file and read from ConvertToBasPlug to ConvertToPlug.
(edit) @1418   24 years davidb Small modification to improve handling of file names with spaces in.
(edit) @1417   24 years davidb Additions so ConvertPlug etc. can handle filenames with spaces in them.
(edit) @1415   24 years davidb Removed some diagnostic print statements.
(edit) @1412   24 years dmm9 adding the date extractor
(edit) @1411   24 years dmm9 added the options for the date extractor
(edit) @1410   24 years davidb Introduction of "ConvertTo" family of plugins. This establishes a new …
(edit) @1405   24 years say1 fixed acronym bugs
(edit) @1404   24 years say1 fixed acronyms option file. trimmed text at start of bibliographies to …
(edit) @1403   24 years say1 taught HTMLPlug about shtml, asp, cgi, php and html query files …
(edit) @1401   24 years davidb Fixed small problem with associated files.
(edit) @1400   24 years davidb General tidying of code.
(edit) @1396   24 years say1 changed initialisation code for acronyms
(edit) @1393   24 years say1 acronym markup functionality
(edit) @1388   24 years sjboddie fixed a bit of a bug (more of a typo really) in the recent changes …
(edit) @1384   24 years paynter Changed language extraction to ignoer encoding information, so that …
(edit) @1382   24 years paynter Less common languages moved into a subdirectory of textcat so that the …
(edit) @1379   24 years paynter Fixed bug that gave gsdlsourcedocument metadata relative path instead …
(edit) @1377   24 years paynter Added "mirror interval N" command for use with update.pl
(edit) @1374   24 years sjboddie made set_OID use original document text instead of document object
(edit) @1362   24 years say1 removed use statement so other files could be compiled with use strict …
(edit) @1361   24 years say1 rewrote recursively to handle stop words and more cases
(edit) @1360   24 years say1 clarified status messages
(edit) @1358   24 years nzdl Fixed bug I recently introduced into HTMLPlug (<pre> tags were being …
(edit) @1341   24 years paynter Licensing information for TextCat language models.
(edit) @1336   24 years say1 fixed acronym extraction so it is now runs in time linear to the …
(edit) @1335   24 years say1 many acronym changes
(edit) @1317   24 years paynter Added -extract_language option, which uses the textcat language …
(edit) @1316   24 years paynter The textcat language identification package.
(edit) @1315   24 years paynter Language models for the textcat language identification package.
(edit) @1313   24 years sjboddie Added Davids version of AZCompactList which handles multiple value metadata
(edit) @1312   24 years sjboddie fixed a bug in the HTML plugin that showed up under windows
(edit) @1304   24 years sjboddie fixed an intermittent bug (I hope) when building under windows
(edit) @1302   24 years kjm18 buildtype and indexfields added to configuration file entries. these …
(edit) @1301   24 years kjm18 building now writes 'buildtype mgpp' to build.cfg - indicates an mgpp …
(edit) @1287   24 years sjboddie Implemented a -sortmeta option for import.pl to sort archives.inf file …
(edit) @1269   24 years sjboddie Added ZIPPlug plugin for handling input documents that have been …
(edit) @1252   24 years sjboddie Building code now extracts a couple more statistics from mg and …
(edit) @1251   24 years sjboddie Added some stat reporting and a warning message to the build code. Now …
(edit) @1250   24 years sjboddie Tidied up the classfiers slightly, made them a little more object …
(edit) @1246   24 years sjboddie Now prevent "notbuilt" field from going in the build.cfg file unless …
(edit) @1245   24 years sjboddie Fixed a bug that davidb found in a couple of regular expressions
(edit) @1244   24 years sjboddie Caught up most general plugins (that's the ones in …
(edit) @1243   24 years sjboddie Caught HTMLPlug up with BasPlug. A few minor changes to some …
(edit) @1242   24 years sjboddie Added Stuart Yeate's acronym extraction code and made it a standard …
(edit) @1241   24 years sjboddie merged ascii_doc.pm and doc.pm back together (removing basedoc.pm). To …
(edit) @1240   24 years gwp Resolved conflicts between previous two versions.
(edit) @1239   24 years gwp Replaced references to @_ in subroutine parse with a new variable …
Note: See TracRevisionLog for help on using the revision log.