开发者 | formasfunction |
---|---|
更新时间 | 2010年4月8日 22:16 |
捐献地址: | 去捐款 |
PHP版本: | 2.9 及以上 |
WordPress版本: | 2.9 |
ocr.php
to the /wp-content/plugins/
directoryPlugins
menu in WordPressPlugins > OCR
link in the sidebar menu in WordPressTesseract OCR is an open source optical character recognition library that the WordPress OCR plugin uses to extract text from images. The library as well as installation instructions can be found at http://code.google.com/p/tesseract-ocr/
Linux:
which tesseract
./opt/local/bin/tesseract
.Plugins > OCR
link in the sidebar menu in WordPressImageMagick is a an open source, server side, image manipulation library.
The WordPress OCR plugin requires the convert
utility specifically.
The library as well as installation instructions can be found at
http://www.imagemagick.org
Linux:
which convert
./opt/local/bin/convert
.Plugins > OCR
link in the sidebar menu in WordPressTesseract is only compatible with TIFF images. Therefor, when a web formatted image (JPG, GIF, PNG, etc) is uploaded, a temporary TIFF image must be created via ImageMagick in order for Tesseract to detect the text within the image. This TIFF is discarded once the OCR has been completed.
The text detected by the OCR plugin is added to the image as a custom field named ocr_text
.
See http://codex.wordpress.org/Custom_Fields for instructions on using the ocr_text
field in your templates.
The text detected by the OCR plugin is available in a text area labeled 'OCR Text' both in the 'Add an Image' model while attaching an image to a post and while editing a previously uploaded image under the 'Media' section of your WordPress install.
The OCR plugin is tailored to detecting text in images with ~12pt text at 72dpi.
ImageMagick is used to upscale the temporary TIFF images fed to Tesseract as Tesseract is generally more accurate with
larger type, even if it's been upscaled from a smaller source. If you wish to disable this option simply set this
configuration option to 100%
and no resizing will occur.
Possibly. The OCR plugin simply hasn't been tested on any other versions.