Changeset 14923 for gsdl/trunk/perllib/lucenebuildproc.pm
- Timestamp:
- 2007-12-17T13:47:08+13:00 (16 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
gsdl/trunk/perllib/lucenebuildproc.pm
r14068 r14923 446 446 } 447 447 448 # It's important that we remove name entities because otherwise the text passed to Lucene for indexing 449 # may not be valid XML (eg. if HTML-only entities like are used) 450 $new_text =~ s/&\w{1,10};//g; 451 # Remove stray '&' characters, except in &#nnnn; or &#xhhhh; entities (which are valid XML) 452 $new_text =~ s/&([^\#])/ $1/g; 453 448 454 return $new_text; 449 455 }
Note:
See TracChangeset
for help on using the changeset viewer.