You should be able to run ocropus from inside emacspeak without too much trouble --- eventually I will check in an appropriate script for use with emacspeak-ocr to use ocropus. At present quality of output is highly variable from ocropus, but it looks promising. >>>>> "Jason" == Jason White <jasonw@xxxxxxxxxxx> writes: Jason> On Wed, Apr 18, 2007 at 10:15:02AM +0200, Lukas Jason> Loehrer wrote: >> >> At least for some pdf files, google does an excellent job >> at preserving math formulas in their "View as HTML" view. Jason> Jason> Interesting. There are also PDF files that contain Jason> only scanned images of text. To read these, you need Jason> OCR software, and it now appears that quality, free as Jason> in freedom, OCR solutions are coming down the Jason> pipeline: Jason> Jason> http://code.google.com/p/ocropus/ Jason> Jason> and it shouldn't be difficult for the Emacs Lisp Jason> enthusiasts on the mailing list to write a function Jason> that will run OCR Opus on a set of image files, or Jason> even scan a page, and then read the output into an Jason> Emacs buffer. Ideally this would be an Emacs mode that Jason> lets you set scanning parameters. Jason> Jason> The OCR software itself isn't expected to be ready for Jason> release until late next year, but I'm sure members of Jason> this list will be helping with the beta testing along Jason> the way. XPDF can extract image files from PDF Jason> documents, which could then be converted to whatever Jason> format the OCR software accepts. Jason> Jason> ----------------------------------------------------------------------------- Jason> To unsubscribe from the emacspeak list or change your Jason> address on the emacspeak list send mail to Jason> "emacspeak-request@xxxxxxxxxxx" with a subject of Jason> "unsubscribe" or "help" -- Best Regards, --raman Email: raman@xxxxxxxxxxx WWW: http://emacspeak.sf.net/raman/ AIM: emacspeak GTalk: tv.raman.tv@xxxxxxxxxxx PGP: http://emacspeak.sf.net/raman/raman-almaden.asc Google: tv+raman IRC: irc://irc.freenode.net/#emacs ----------------------------------------------------------------------------- To unsubscribe from the emacspeak list or change your address on the emacspeak list send mail to "emacspeak-request@xxxxxxxxxxx" with a subject of "unsubscribe" or "help"
If you have questions about this archive or had problems using it, please send mail to:
priestdo@xxxxxxxxxxx No Soliciting!Emacspeak List Archive | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | 1998 | Pre 1998