Re: png2txt -

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sat, Jun 28, 2008 at 5:32 PM, Bob Goodwin USA
<bobgoodwin@xxxxxxxxxxxx> wrote:
> fred smith wrote:
>>>>> Is there an F8 application that will convert a .png copy of a text list
>>>>> to a text file?
>>>>
>>>> ----
>>>> png is a picture file and there is no text.
>>>>
>>>> If you want OCR (optical character recognition - software that scans a
>>>> picture for recognizable text and saves the recognized text to a file),
>>>> I would suggest tesseract.
>>>
>>> Thanks, I will look at that.
>>>
>>
>> I believe that Tesseract only understands TIF files, so you will need
>> to convert the png before you can OCR them.
>>
>>
>
> Yes, I discovered that requirement but now I am stumped by -
>
>   The command line is:
>   tesseract <image.tif> <output> [-l langid]
>
> I thought "-l enUS" might work but no go there.
>
> There's no man page, only a README and that doesn't tell me about the langid
> other than it wants it.  Without it I get very strange looking text.

Unfortunately, the OCR programs working in Linux are not very good
yet. In case you have access to Acrobat Professional, use it instead;
the results are usually excellent.

Paul

-- 
fedora-list mailing list
fedora-list@xxxxxxxxxx
To unsubscribe: https://www.redhat.com/mailman/listinfo/fedora-list

[Index of Archives]     [Current Fedora Users]     [Fedora Desktop]     [Fedora SELinux]     [Yosemite News]     [Yosemite Photos]     [KDE Users]     [Fedora Tools]     [Fedora Docs]

  Powered by Linux