
What does that say?
Published: 11 April 2007 17:02 GMT
Google is sponsoring an artificial-intelligence research group's work to develop advanced technologies for character recognition.
The open source project, called Ocropus, has several goals, including developing a high-level, easy-to-use handwriting recognition system that can convert handwritten documents to computer text, assisting in the creation of electronic libraries, analysing historical documents and helping vision-impaired people access information. The "ocr" in Ocropus stands for optimal character recognition.
The project is headquartered at the Image Understanding and Pattern Recognition (IUPR) research group at the German Research Center for Artificial Intelligence (DFKI) in Kaiserslautern, Germany. DFKI professor Thomas Breuel is leading the project.
Breuel made the announcement through a post on the Google Code blog. In addition to Google's sponsorship, Ocropus is getting funds from several German government agencies and other public and private entities.
The Ocropus team expects the project to last three years and it will support three PhD students. IUPR is basing the software primarily on two research projects: one, a handwriting recognition system developed in the mid-1990s for use by the US Census Bureau; and two, newer layout analysis methods for character recognition.
Other resources include Tesseract, a decades-old engine for optimal character recognition originally developed by HP Labs and rereleased by Google last year as an open source system.
A preview of the Ocropus system is available on the project's website under an Apache licence, and the IUPR is soliciting open source contributions in order to complete a number of goals. These include creating a desktop application for the system, adding third-party tools and adapting Ocropus to a variety of languages. It's currently English-only.
Caroline McCarthy writes for CNET News.com
We are looking for an enthusiastic graduate with an interest in areas such as artificial intelligence/ pattern recognition/ machine vision /image ...
He/she will be experienced in research, design and implementation of artificial intelligence. He/she will have good writing skills with the ability ...
Knowledge of Search APIs and / or OCR (Optical Character Recognition), would also be desirable. Document reading technologies will be used to index ...
Agenda Setters 2009
Welcome to the ninth annual Agenda Setters poll – silicon.com's list of the top 50 most influential individuals in the technology and IT industries, from techies and CIOs to entrepreneurs and business leaders. Find out more in our latest special report.
Dell PowerVault DL2100 Powered by CommVault - Spec Sheet
Data Protection Strategies: Deduplication for More Efficient Backups
True Convergence Demands a Communication Service Provider that Embraces a Customer-Centric...
Learn how Performance Metrics for Telcomm Expense Management Drive new ROIs and SLAs
Stories from the web...
Copyright © 2008 CBS Interactive Limited. All rights reserved. Top of page
Mark Crichard Doing business with citizen developers: Beware the legal pitfalls Legal Eye: Make sure your business is protected from potential hazards
Tim Ferguson How CIOs can achieve post-recession success Q&A: McKinsey & Company on living in the 'new normal' business world