|
|
@37194
|
17 months |
davidb |
Tested version of file-level document-version history (fldv-history) …
|
|
|
@37187
|
17 months |
davidb |
Reworking of file-level document-version history, in light of a …
|
|
|
@37184
|
17 months |
davidb |
Further refinement of idea, with emphasis on using plugins arguments …
|
|
|
@37182
|
17 months |
davidb |
Internal plugin to make things easier when processing JSON files. …
|
|
|
@37179
|
17 months |
davidb |
Only want to get OIDMetadata if OIDtype is assigned. Also adjusted …
|
|
|
@37148
|
17 months |
davidb |
Had not been careful enough with my refactoring of the code that …
|
|
|
@37051
|
18 months |
davidb |
A variety of changes: added in call to post_process_doc_obj() which is …
|
|
|
@37050
|
18 months |
davidb |
Changed to use the new add_dummpy_text_if_empty routine
|
|
|
@37049
|
18 months |
davidb |
Comment text tidyup
|
|
|
@37048
|
18 months |
davidb |
Useful support routine added that only sets the document field to say …
|
|
|
@37047
|
18 months |
davidb |
Introduction of 'metadata_separate_fields', a plugin option that …
|
|
|
@37028
|
18 months |
davidb |
Whitespace fixup
|
|
|
@36910
|
19 months |
kjdon |
Added some code for tk labels - options to take a single field and …
|
|
|
@36885
|
20 months |
kjdon |
need to make sure maxdocs is not -1 (ie process all docs) before …
|
|
|
@36587
|
22 months |
davidb |
Test first if defined, to avoid unassigned variable warning
|
|
|
@36533
|
22 months |
kjdon |
added CSVPLugin option use_namespace_for_field_names - prepend the …
|
|
|
@36482
|
22 months |
kjdon |
added some new options for CSVPlugin
|
|
|
@36481
|
22 months |
kjdon |
now fixed up the files to use the new names
|
|
|
@36480
|
22 months |
kjdon |
new and improved CSVPlugin - handles input encoding, spaces inside …
|
|
|
@36479
|
22 months |
kjdon |
renaming the old CSVPlugin and MetadataCSVPlugin to Deprecated …
|
|
|
@36470
|
22 months |
davidb |
Tweaks after refactoring. Causes 'use strict' to report error as …
|
|
|
@36380
|
22 months |
kjdon |
there were 2 for-gs311 versions of SplitTextFile. THe other onne had …
|
|
|
@36379
|
22 months |
kjdon |
replacing SplitTextFile with the for-gs311 version
|
|
|
@36374
|
22 months |
kjdon |
missing curly bracket
|
|
|
@36373
|
22 months |
kjdon |
now I have removed commented out code from last commit
|
|
|
@36372
|
22 months |
kjdon |
tidy up of extrametautil, renaming some methods to make them easier to …
|
|
|
@36297
|
2 years |
anupama |
lomdemo-e's XMLRecord value does need the line feed (backslash-n) …
|
|
|
@36293
|
2 years |
anupama |
Forgot to commit minor change for GS3 to LOMPlugin previously. If GS3, …
|
|
|
@36271
|
2 years |
davidb |
Check for any sign of text being bound to the doc, before going ahead …
|
|
|
@36095
|
2 years |
kjdon |
look for files in collection/tmp folder, as well as import, and cache …
|
|
|
@36057
|
2 years |
kjdon |
in the whakatohea project, converting the pdfs to paged_pretty_html …
|
|
|
@35401
|
3 years |
anupama |
Committing Dr Bainbridge's improvements to the Tika-preconfigured …
|
|
|
@35173
|
3 years |
kjdon |
renamed gsConvert.pl option to verbosity (instead of verbose) - why …
|
|
|
@35166
|
3 years |
kjdon |
added code that handles utf16 surrogate pair entities.
|
|
|
@35164
|
3 years |
kjdon |
xpdf seems to output surrogate pairs into the html - these end up …
|
|
|
@34999
|
3 years |
davidb |
Indented to better align with map block
|
|
|
@34998
|
3 years |
davidb |
These changes have now been committed into SVN
|
|
|
@34997
|
3 years |
davidb |
When working with orthogonal indexes, these plugins constructors get …
|
|
|
@34921
|
3 years |
anupama |
Committing the improvements to EmbeddedMetaPlugin's processing of …
|
|
|
@34878
|
3 years |
davidb |
Further changes to work more smoothly with JSONSparqlResultsPlugin, …
|
|
|
@34840
|
3 years |
davidb |
Changed to apply extra-metadata before trying to work out doc-id. …
|
|
|
@34690
|
3 years |
davidb |
When using an orthogonal index, the constructor is run for a second …
|
|
|
@34643
|
3 years |
davidb |
Version of file that is designed to work with planned changes in GS v3.11
|
|
|
@34250
|
4 years |
ak19 |
Having tested the incorporation of Kathy's bugfix to CSVPlugin from …
|
|
|
@34249
|
4 years |
ak19 |
Dr Bainbridge in his commit 32810 had expressed that he intended to …
|
|
|
@34221
|
4 years |
ak19 |
Undid the change of converting tabstops to their entities in …
|
|
|
@34220
|
4 years |
ak19 |
1. TextPlugin takes care to preserve whitespace formatting when …
|
|
|
@34137
|
4 years |
ak19 |
Have only been able to incorporate one of Dr Bainbridge's improvements …
|
|
|
@34131
|
4 years |
ak19 |
Allowing input keep-urls-file to contain a comma followed by country …
|
|
|
@34130
|
4 years |
ak19 |
Some more tidying up while isMRI filtered collection rebuilding
|
|
|
@34129
|
4 years |
ak19 |
Implemented Kathy's suggestions: 1. Explicit ex prefix to ex meta …
|
|
|
@34126
|
4 years |
ak19 |
When I'd modified the code to make the keep_urls_file non-compulsory, …
|
|
|
@34125
|
4 years |
ak19 |
Commit message went awry. Cleaned up some comments to recommit with …
|
|
|
@34124
|
4 years |
ak19 |
Decoding the title and text using the encoding seemed to have turned …
|
|
|
@34123
|
4 years |
ak19 |
Some more minor changes
|
|
|
@34122
|
4 years |
ak19 |
1. After some testing of building the complete commoncrawl collection, …
|
|
|
@34121
|
4 years |
ak19 |
1. Introducing NutchTextDumpPlugin to process the records …
|
|
|
@33721
|
5 years |
ak19 |
Inactive but committing to svn: Newer Locale.pm file, and introducing …
|
|
|
@33389
|
5 years |
kjdon |
store csv field array associated with filename, because you might have …
|
|
|
@33309
|
5 years |
ak19 |
More workarounds for HTML conversion results from Word's …
|
|
|
@33301
|
5 years |
ak19 |
Incorporating Dr Bainbridge's suggested fix for dealing with Word docs …
|
|
|
@33299
|
5 years |
ak19 |
1. Committing Dr Bainbridge's fix to remove duplicated heading titles …
|
|
|
@32984
|
5 years |
davidb |
Commented out debug print statements
|
|
|
@32819
|
5 years |
kjdon |
fixed a comment
|
|
|
@32790
|
5 years |
ak19 |
Suffixing .inactive to the GreenstoneSQL plugs because if perl package …
|
|
|
@32783
|
5 years |
kjdon |
adding missing strings and tidying up some mislabelling
|
|
|
@32778
|
5 years |
ak19 |
Need to set the surrounding div width to be the same/no more than the …
|
|
|
@32777
|
5 years |
ak19 |
Minor. Correction to plugin name displayed
|
|
|
@32761
|
5 years |
kjdon |
when printing out arg values for some other thing, I noticed that site …
|
|
|
@32760
|
5 years |
kjdon |
merge_inheritance, if it finds a conflict in option values, will keep …
|
|
|
@32643
|
6 years |
ak19 |
1. Previous commit (r32640) reintroduced an earlier bug in attempting …
|
|
|
@32640
|
6 years |
ak19 |
Important changes (and commented out debugging statements) to get …
|
|
|
@32595
|
6 years |
ak19 |
Major tidying up: last remaining debug statements, lots of comments, …
|
|
|
@32592
|
6 years |
ak19 |
Renamed gssql.pm to gsmysql.pm. Not subclassing the old gssql into …
|
|
|
@32591
|
6 years |
ak19 |
1. gssql destructor DESTROY doesn't really do anything now, as DBI's …
|
|
|
@32589
|
6 years |
ak19 |
1. SQL db password is not compulsory. 2. Forgot to add the …
|
|
|
@32586
|
6 years |
ak19 |
Renaming 'site_name' parameter used by GS SQL Plugout and Plugin to …
|
|
|
@32584
|
6 years |
ak19 |
Some more tidying up of the code.
|
|
|
@32583
|
6 years |
ak19 |
1. Some tidying up of the code. 2. Removing unnecessary calls to …
|
|
|
@32582
|
6 years |
ak19 |
Now that previous commit(s) put sig handlers in place in gs_sql, have …
|
|
|
@32580
|
6 years |
ak19 |
1. support for port param when connecting to SQL DB. 2. GS SQL Plugout …
|
|
|
@32578
|
6 years |
ak19 |
Optimising. The gssql class internally has only one shared connection …
|
|
|
@32577
|
6 years |
ak19 |
Forgot to call superclass in overridden removeall(). Nothing broke so …
|
|
|
@32575
|
6 years |
ak19 |
1. gssql now does fetching all rows internally upon select. With this …
|
|
|
@32571
|
6 years |
ak19 |
Optimised the SQL DB delete operations in case there are several in …
|
|
|
@32570
|
6 years |
ak19 |
1. Bugfix for when renaming an imported doc and …
|
|
|
@32565
|
6 years |
ak19 |
I think this is a bugfix to plugin.pm::remove_some(): when processing …
|
|
|
@32563
|
6 years |
ak19 |
1. Overhaul of GreenstoneSQLPlugs to handle removeold and incremental …
|
|
|
@32562
|
6 years |
ak19 |
Before major changes to GSSQLPlugs, committing useful comments to …
|
|
|
@32560
|
6 years |
ak19 |
gssql constructor accepts a verbosity parameter
|
|
|
@32559
|
6 years |
ak19 |
Removing db_encoding as parameters to GreenstoneSQLPlugout and …
|
|
|
@32556
|
6 years |
ak19 |
Tested to find DBI connection attempt fails immediately when MySQL …
|
|
|
@32555
|
6 years |
ak19 |
1. In GreenstoneSQLPlugout, removeold is now paramterised (as are …
|
|
|
@32544
|
6 years |
ak19 |
1. GreenstoneSQLPlugin: now sub read() calls the new lazy_get_gssql() …
|
|
|
@32543
|
6 years |
ak19 |
Tidying up and adjusting TODO statements
|
|
|
@32542
|
6 years |
ak19 |
Instead of the docoid being stored in the docsql-<OID>.xml filename, …
|
|
|
@32541
|
6 years |
ak19 |
Using proper parameters to GreenstoneSQLPlugin/Plugout instead of …
|
|
|
@32539
|
6 years |
ak19 |
New plugin parameter site_name (only set for GS3) that is passed to …
|
|
|
@32538
|
6 years |
ak19 |
Previous commit message meant to be: string names of strings shared by …
|
|
|
@32537
|
6 years |
ak19 |
First commit to do with reading back in from the SQL DB. This commit …
|
|
|