source: trunk/gsdl/perllib@ 14115

Name Size Rev Age Author Last Change
../
classify 14084   17 years mdewsnip Added "partition_name_length" option to GenericList, many thanks to …
cpan 14081   17 years lh92 library for HTML Parser module required for HTMLTidy procedure
downloaders 13961   17 years kjdon need to make sure that the tmp directory exists
plugins 14108   17 years lh92 Plugin for processing MediaWiki pages
plugouts 13469   17 years shaoqun the get_top_metadata_list methos has been removed from doc.pm to here
rm 12461   18 years kjdon moved rm::Header::PurePerl out of cpan directory into perllib …
textcat 11384   18 years jrm21 new model
acronym.pm 19.0 KB 7645   20 years jrm21 don't fail if we can't load the diagnostics package.
arcinfo.pm 5.2 KB 12328   18 years shaoqun added code to guard undefined value
basebuilder.pm 19.3 KB 14112   17 years sjboddie More modifications to support additional collection-level …
basebuildproc.pm 16.9 KB 12844   18 years mdewsnip Incremental building and dynamic GDBM updating code, many thanks to …
cfgread4gs3.pm 20.8 KB 14105   17 years qq6 fixed a mistake
cfgread.pm 4.9 KB 8890   19 years davidb Reading of config files has support for environment variables, however …
classify.pm 14.9 KB 14112   17 years sjboddie More modifications to support additional collection-level …
ClassifyTreeModel.pm 8.8 KB 12844   18 years mdewsnip Incremental building and dynamic GDBM updating code, many thanks to …
ClassifyTreeNode.pm 21.8 KB 12844   18 years mdewsnip Incremental building and dynamic GDBM updating code, many thanks to …
ClassifyTreePath.pm 4.4 KB 12844   18 years mdewsnip Incremental building and dynamic GDBM updating code, many thanks to …
cnseg.pm 2.2 KB 537   25 years sjboddie added GPL headers
colcfg.pm 6.8 KB 14115   17 years sjboddie * empty log message *
DateExtract.pm 8.3 KB 2018   23 years jrm21 removed "use BasPlug" lines from metadata extractors, as they …
doc.pm 26.1 KB 13770   17 years shaoqun now use the absolute source path to get the last modified time
docprint.pm 2.6 KB 13190   17 years kjdon changed a comment
docproc.pm 2.2 KB 12268   18 years kjdon set_OIDtype now takes a second argument which is the metadata element …
download.pm 2.6 KB 12465   18 years shaoqun fixed th bugs on windows
encodings.pm 4.5 KB 12604   18 years mdewsnip Added definition for new DOS codepage 852 (Central European) encoding.
expinfo.pm 3.9 KB 8518   19 years chi A new program to deal with export.pl function.
GDBMUtils.pm 3.8 KB 12844   18 years mdewsnip Incremental building and dynamic GDBM updating code, many thanks to …
gflock.pm 1.7 KB 1181   24 years sjboddie got end-user collection building to work (almost) on windows 95. …
ghtml.pm 13.5 KB 8886   19 years mdewsnip Bug fix in html2txt function, thanks to Emanuel Dejanu.
giget.pm 5.1 KB 10112   19 years davidb Minor tweak to pretty printing of "Searching Google images for"
gsprintf.pm 7.7 KB 11632   18 years kjdon renamed strings.rb to strings.properties
incremental_build.pm 9.3 KB 13171   17 years kjdon docprint is no longer a docproc, now need to call get_section_xml and …
IncrementalBuildUtils.pm 20.5 KB 12844   18 years mdewsnip Incremental building and dynamic GDBM updating code, many thanks to …
IncrementalDocument.pm 7.2 KB 12844   18 years mdewsnip Incremental building and dynamic GDBM updating code, many thanks to …
iso639.pm 7.4 KB 14114   17 years nzdl Changed Singhalese to Sinhalese
Kea.pm 3.8 KB 11070   18 years mdewsnip A much tidier Kea.pm that now also works on Windows.
lang.pm 5.6 KB 8716   19 years kjdon added some changes made by Emanuel Dejanu (Simple Words)
lucenebuilder.pm 14.6 KB 13590   17 years kjdon mgpp and lucene. made them always use doc and sec levels for the text …
lucenebuildproc.pm 15.8 KB 14068   17 years mdewsnip A new preprocess_text function that is similar to the mgppbuildproc …
manifest.pm 4.0 KB 11994   18 years davidb Improved support for incremental addition: instead of having to run …
metadatautil.pm 3.3 KB 13492   17 years kjdon sort the metadata fields before putting in a table
mgbuilder.pm 18.5 KB 12971   18 years kjdon removed some debug print statements
mgbuildproc.pm 4.9 KB 12371   18 years mdewsnip If sections_index_document_metadata is on, top level sections no …
mgppbuilder.pm 28.2 KB 13813   17 years mdewsnip mgpp_stem_idx now returns 2 instead of -1 when accent folding is …
mgppbuildproc.pm 11.2 KB 13590   17 years kjdon mgpp and lucene. made them always use doc and sec levels for the text …
multiread.pm 8.1 KB 12832   18 years kjdon added in ascii casee in read_file - if not done specially, will be …
muread.pm 4.0 KB 627   25 years rjmcnab initial revision.
parsargv.pm 5.6 KB 8716   19 years kjdon added some changes made by Emanuel Dejanu (Simple Words)
parse2.pm 9.9 KB 12546   18 years kjdon changed parse2::parse so that it returns -1 on error, 0 on success, or …
plugin.pm 10.4 KB 14112   17 years sjboddie More modifications to support additional collection-level …
plugout.pm 2.4 KB 13933   17 years shaoqun make it check collection specific super classes first
printusage.pm 10.6 KB 12626   18 years mdewsnip Removed all the DTD stuff from XML output... it's just one more …
remproc.pm 1.5 KB 8716   19 years kjdon added some changes made by Emanuel Dejanu (Simple Words)
scriptutil.pm 2.3 KB 12966   18 years kjdon check_removeold_and_keepold now checks incremental as well
sorttools.pm 5.8 KB 10977   18 years kjdon extended the match for author metadata - can now have Authors, and …
strings.properties 64.0 KB 14107   17 years lh92 Added strings for MediaWikiPlug
strings_ar.properties 39.8 KB 13299   17 years mdewsnip Removed all the English strings, and fixed all the Arabic strings that …
strings_es.properties 73.6 KB 12639   18 years mdewsnip Changed the "-collect" option to "-collection", because it's a million …
strings_fr.properties 63.7 KB 12639   18 years mdewsnip Changed the "-collect" option to "-collection", because it's a million …
strings_mr.properties 31.5 KB 13444   17 years nzdl A start to Marathi perl strings, thanks to Shubhada Nagarkar
strings_ru.properties 98.3 KB 12639   18 years mdewsnip Changed the "-collect" option to "-collection", because it's a million …
textcat.pm 4.5 KB 2235   23 years sjboddie Hacked the textcat package about so that it only reads all the …
unbuildutil.pm 1.6 KB 7589   20 years kjdon some util stuff for the two unbuild scripts (though only v2 uses it at …
unicode.pm 15.2 KB 10983   18 years jrm21 better error message when we can't load an encoding
util.pm 18.4 KB 11179   18 years kjdon added a cp_r_toplevel function, to copy the contents of a directory, …
webpageutil.pm 518 bytes 1010   24 years sjboddie renamed old html module ghtml -- it clashed with builtin html module …
XMLParser.pm 789 bytes 9239   19 years mdewsnip Changed "unshift" to "push", so an existing XML::Parser on the system …
Note: See TracBrowser for help on using the repository browser.