Last change
on this file since 3147 was 3147, checked in by jrm21, 22 years ago |
Updated to mention new maintained version at sourceforge.
|
-
Property svn:keywords
set to
Author Date Id Revision
|
File size:
624 bytes
|
Rev | Line | |
---|
[3147] | 1 | This is pdftohtml, which was based at:
|
---|
[2348] | 2 | http://www.ra.informatik.uni-stuttgart.de/~gosho/pdftohtml/
|
---|
| 3 |
|
---|
[3147] | 4 | It has recently been picked up again, and is currently based at:
|
---|
| 5 | http://pdftohtml.sourceforge.net/
|
---|
| 6 |
|
---|
[2348] | 7 | The version is based on version 0.22, with some code included from
|
---|
| 8 | version 0.31. It has been modified for Greenstone use, particularly
|
---|
| 9 | the file xpdf/HtmlOutputDev.cc, in an attempt to get text and images
|
---|
| 10 | in roughly the right place without using javascript or multiple pages.
|
---|
| 11 |
|
---|
[2351] | 12 | Known problems:
|
---|
| 13 | tables with text.
|
---|
| 14 | multi-column pages.
|
---|
| 15 | some image types don't get extracted.
|
---|
| 16 |
|
---|
[2348] | 17 | John McPherson.
|
---|
| 18 | 02 May 2001. |
---|
Note:
See
TracBrowser
for help on using the repository browser.