source: trunk/gsdl/perllib/mgbuilder.pm

Revision Log Mode:


Legend:

Added
Modified
Copied or renamed
Diff Rev Age Author Log Message
(edit) @12971   18 years kjdon removed some debug print statements
(edit) @12910   18 years kjdon new indexoptions field in collect.cfg specifies which stem indexes …
(edit) @12340   18 years kjdon maxnumeric is set using set_maxnumeric (by buildcol.pl) rather than …
(edit) @10468   19 years kjdon made a base builder class, adn moved lots of the code to it. hoe I …
(edit) @10158   19 years davidb *builder.pm packages (principally lucenebuilder.pl which inherits from …
(edit) @9938   19 years kjdon added a new field to build.cfg: indexstem. specifies the root of teh …
(edit) @9853   19 years kjdon fixed up maxdocs - now pass an extra parameter to the read function
(edit) @9669   19 years kjdon fixed up the case where you have subcollection partitions and language …
(edit) @9548   19 years kjdon for teh language subcollection stuff, can now specify …
(edit) @8776   19 years kjdon fixed a bug whereby you couldn't build more than 11 subcollections
(edit) @8716   19 years kjdon added some changes made by Emanuel Dejanu (Simple Words)
(edit) @8361   20 years kjdon renamed build option 'allclassifications' to …
(edit) @7904   20 years chi Minor changes to layout of code.
(edit) @7150   20 years mdewsnip Now chooses the English collectionmeta value (if it exists) to be the …
(edit) @6584   20 years kjdon Fiddled around with segmenting for chinese text. Haven't changed how …
(edit) @6545   20 years kjdon added a comment
(edit) @6543   20 years kjdon the bit where you put language items into the indexexparr was …
(edit) @6407   20 years jmt12 Added non-language specific messages useful for progress checking. …
(edit) @6332   20 years jmt12 When -gli argument is provided to calling script these modules will …
(edit) @5768   21 years kjdon added a check for failed indexing - just test to see if the .id file …
(edit) @5225   21 years sjboddie Fixed a couple of bugs in recent changes for building/displaying …
(edit) @4743   21 years sjboddie Build code changes allowing mg collections containing no indexes to be …
(edit) @4192   21 years sjboddie Added maxnumeric build time configuration option. This is useful for …
(edit) @3115   22 years jrm21 Redirect mg(pp)_passes stderr to /dev/null if the "-out xxx" option is …
(edit) @2785   23 years sjboddie The build process now creates a summary of how many files were …
(edit) @2772   23 years kjm18 changes to enable language specific collectionmeta in collect.cfg …
(edit) @2506   23 years dmm9 added writing of collection document list to db (OID browselist)
(edit) @2336   23 years sjboddie added a -no_text option to buildcol.pl to allow collections to be …
(edit) @1973   23 years kjm18 fixed up language stuff
(edit) @1803   23 years paynter Moved the phind classifier's data directory into the index directory. …
(edit) @1799   23 years sjboddie fixed a little bug in the building code that caused an endless loop if …
(edit) @1679   23 years sjboddie Re-Added recent changes that were lost when the CVS repository was …
(edit) @1483   24 years sjboddie added -out option to classifiers
(edit) @1424   24 years sjboddie Added a -out option to most of the perl building scripts to allow …
(edit) @1304   24 years sjboddie fixed an intermittent bug (I hope) when building under windows
(edit) @1252   24 years sjboddie Building code now extracts a couple more statistics from mg and …
(edit) @1251   24 years sjboddie Added some stat reporting and a warning message to the build code. Now …
(edit) @1246   24 years sjboddie Now prevent "notbuilt" field from going in the build.cfg file unless …
(edit) @1243   24 years sjboddie Caught HTMLPlug up with BasPlug. A few minor changes to some …
(edit) @1072   24 years sjboddie Fixed bug - Control B's and C's were only being removed from body of …
(edit) @835   24 years davidb added 'begin' and 'end' function for plugins
(edit) @810   24 years sjboddie plugins now take options, files are associated at build time as well …
(edit) @784   24 years sjboddie added -keepold option
(edit) @782   24 years sjboddie removed gettext.pl - added debug, mode and index options to …
(edit) @780   24 years sjboddie added dontgdbm configuration option
(edit) @677   25 years sjboddie changes for new browsing support
(edit) @626   25 years rjmcnab Added collection specific stage of building
(edit) @537   25 years sjboddie added GPL headers
(edit) @486   25 years rjmcnab Added code to keep track of indexes that aren't built and changed the …
(edit) @461   25 years sjboddie added ability to add collection-level metadata with collectionmeta …
(edit) @378   25 years sjboddie fixed bug in writing out subcollectionmap and languagemap to build.cfg
(edit) @315   25 years sjboddie - removed old infodb stuff - changed the way classifiers work - added …
(edit) @292   25 years rjmcnab Added ability to post-process text before it is fed to be indexed or …
(edit) @290   25 years rjmcnab Added code to retain the ordering of indexes, subcollections, and …
(edit) @246   25 years sjboddie Now checks collect.cfg for a 'doctype' field and if it finds it it's …
(edit) @215   25 years rjmcnab Added code to build a new gdbm format.
(edit) @139   25 years sjboddie Got building stuff to handle subcollections and language subcollections
(edit) @134   25 years rjmcnab Used GSDLCOLLECTDIR which is now defined in util.pm.
(edit) @98   25 years rjmcnab Switched doc.pl over to use unicode. Changed the importing software to …
(edit) @91   25 years rjmcnab Changed the directory structure (collect.cfg and site.cfg now reside …
(edit) @85   25 years rjmcnab Fixed a few small bugs.
(edit) @81   25 years sjboddie fixed minor bug in create_index_mapping()
(edit) @69   25 years sjboddie Sub-collection indexes may now be defined within the collect.cfg file …
(edit) @16   25 years rjmcnab Altered to work with perllib/ (instead of the old directory name lib/)
(add) @4   25 years sjboddie Initial revision
Note: See TracRevisionLog for help on using the revision log.