suitable OCR Tópico cartaz: telefpro
| telefpro Local time: 17:15 português para inglês + ...
I have a document typewritten in 1957 and it is converted into PDF. The printouts cannot be read. Is there any way to read this document better, using some OCR. It is in French.Please advise. | | | | mediamatrix (X) Local time: 08:45 espanhol para inglês + ... The world's top OCR system ... | Sep 18, 2008 |
telefpro wrote:
I have a document typewritten in 1957 and it is converted into PDF. The printouts cannot be read. Is there any way to read this document better, using some OCR. It is in French.Please advise.
... is the human eye.
If you can't read the document, then no OCR software will be able to either.
MediaMatrix | | | esperantisto Local time: 14:45 Membro (2006) inglês para russo + ... SITE LOCALIZER
mediamatrix wrote:
If you can't read the document, then no OCR software will be able to either.
Absolutely right!
However, theoretically, one might convert a PDF into a set of graphic files such as TIFF and try to play with gamma/color correction. But that's alchemy, not an exact science | |
|
|
Jack Doughty Reino Unido Local time: 11:45 russo para inglês + ... In memoriam
With Adobe Acrobat, even if you only have Adobe Acrobat Reader, you can zoom the page out to show the details a lot larger. If this doesn't help, I have no idea what else would. | | | This should do the trick | Sep 18, 2008 |
Though it's in Spanish, you'll certainly be able to understand if you have Office 2003. Click on the link below.
You have your own OCR
| | | I beg to differ | Sep 18, 2008 |
mediamatrix wrote:
If you can't read the document, then no OCR software will be able to either.
I have successfully displayed on screen text that didn't even seem to be there. My OCR software is OmniPage, and it has a built-in function to enhance images before starting the recognition on it (if you know what you are doing, you can also do this with graphic programs like Photoshop).
There are documents that cannot be read by the human eye that can be made readable with the help of software. If I was in telefpro's situation, I would try it.
[Edited at 2008-09-19 15:22] | | | mediamatrix (X) Local time: 08:45 espanhol para inglês + ...
Viktoria Gimbe wrote:
There are documents that cannot be read by the human eye that can be made readable with the help of software. If I was in telefpro's situation, I would try it.
And how do you propose that telefpro should go about validating the OCR output? Even with texts that are easily human-readable, no OCR software is ever 100% accurate. If telefpro can't read the source text is it reasonable to assume that the output from image-enhanced OCR will, by some miracle, be 100% reliable on this particular occasion?
Telefpro is up against a fundamental law of entropy here.
MediaMatrix | |
|
|
That's the part his/her intelligence is needed for | Sep 18, 2008 |
mediamatrix wrote:
And how do you propose that telefpro should go about validating the OCR output?
Well, s/he can read, right? Once s/he gets the OCR input, s/he can read through it to decide whether the output is strong enough to be processed. Isn't that what we should be doing even with texts that are already in an editable format?
The point of telefpro's question is to make the text readable by the human eye, not to turn it into editable text (although s/he may ultimately be interested in that as well). What other method do you propose? I don't know of any other solution. It's a matter of using technology to enhance what is humanly feasible.
In some cases, like the present one, technology can go much farther than the human brain - although this is usually not the case. | | | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » suitable OCR Wordfast Pro | Translation Memory Software for Any Platform
Exclusive discount for ProZ.com users!
Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value
Buy now! » |
| CafeTran Espresso | You've never met a CAT tool this clever!
Translate faster & easier, using a sophisticated CAT tool built by a translator / developer.
Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools.
Download and start using CafeTran Espresso -- for free
Buy now! » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |