pdftotext 3.02 (RISC OS release 1.00 24-Mar-08)

This is a command-line program that extracts plain text from PDF files. It is a
sister application of xpdf (the program !PDF is based on). In addition to what
!PDF does pdf2text performs some sophisticated analysis of text positions to
sort the text in the most sensible order, so it does a significantly better job
than the "Save text" option of !PDF.

Usage: pdftotext <infile> <outfile>
Details about additional options: pdftotext -h

This program requires at least 1400k of memory, which you need to allocate at
the command line. It also requires a copy of the SharedUnixLib module,
available from http://www.riscos.info/downloads/gccsdk/sharedunixlib/system.zip
(to install it, merge the skeleton !System in this file with your !System by
double-clicking on your !Boot application, then click on the System button).

Example usage
-------------
For example, if you have stored the pdftotext executable as
ADFS::HardDisc4.$.Programs.pdftotext, then in order to to process the PDF file
stored at ADFS::HardDisc4.$.Files.Input/pdf and place the resulting text in the
RAM disc into the file RAM::RamDisc0.$.Output you would proceed as follows:

Open a TaskWindow in your favourite editor, then type:

*Wimpslot 1400k 1400k
*run ADFS::HardDisc4.$.Programs.pdftotext ADFS::HardDisc4.$.Files.Input/pdf RAM::RamDisc0.$.Output

In most editors you can shift drag the executable and the PDF file into the
window to enter their full path names instead of typing them in yourself.

pdftotext is released under the GPL. See the file COPYING for details. It was
compiled from the xpdf 3.02 source code without any changes. The source code is
available from http://www.foolabs.com/xpdf/

Please note that this is a quick port without any extensive testing and without
any warranty and support.

Martin Wuerthner
http://www.mw-software.com/
