為了改善您的使用體驗,本網站正在進行維護,部分功能暫時無法使用。若本站的文件無法解決您的問題,想要向社群發問的話,請到 Twitter 上的 @FirefoxSupport 或 Reddit 上的 /r/firefox 發問,我們的社群成員將很快會回覆您的疑問。

搜尋 Mozilla 技術支援網站

防止技術支援詐騙。我們絕對不會要求您撥打電話或發送簡訊,或是提供個人資訊。請用「回報濫用」功能回報可疑的行為。

了解更多

Does Firefox Automatically perform OCR on PDF Documents?

more options

My bank delivers monthly statements as rasterized copies of their paper statements. They are clearly pixelated and not text. However, when I open one of these PDFs in Firefox I am able to select the rasterized text, as you can see from the attached screenshot clip.

How is this possible?

My bank delivers monthly statements as rasterized copies of their paper statements. They are clearly pixelated and not text. However, when I open one of these PDFs in Firefox I am able to select the rasterized text, as you can see from the attached screenshot clip. How is this possible?
附加的畫面擷圖

所有回覆 (9)

more options

I assume that your bank actually sends real PDF files. If you use Print then in some cases Firefox converts the page to an image.

more options

I would have assumed the same thing except that Sumatra won't t allow me to highlight and copy text and Acrobat will select it but won't copy it. Firefox allows both.

more options

Also I've never seen a pixelated PDF that still contains text. Will wonders never cease?!

由 Helmanfrow 於 修改

more options

If the PDF consists purely of a series of full-page images, unfortunately, Firefox's PDF viewer doesn't have the ability to OCR it.

I suspect your bank applied "security" to the PDF to prevent certain actions, such as copying, editing, and/or printing. (https://helpx.adobe.com/acrobat/how-to/password-protect-pdf.html)

Firefox's PDF viewer is based on the pdf.js JavaScript library, which ignores these "security" restrictions by default. It is a bit of an annoyance to people who create the PDFs, but Mozilla doesn't seem inclined to enforce the restrictions in Firefox.

more options

jscher2000 - Support Volunteer said

I suspect your bank applied "security" to the PDF to prevent certain actions, such as copying, editing, and/or printing. (https://helpx.adobe.com/acrobat/how-to/password-protect-pdf.html)

Yes, I did a little more digging and that's apparently what it is. The document is protected from editing and apparently this can sometimes present text as pixelated images.

由 Helmanfrow 於 修改

more options

jscher2000 - Support Volunteer said

I suspect your bank applied "security" to the PDF to prevent certain actions, such as copying, editing, and/or printing. (https://helpx.adobe.com/acrobat/how-to/password-protect-pdf.html)

Yes, the document is password-protected so that's probably it.

more options

By the way, when you select text in Firefox's PDF viewer, you are selecting a transparent layer of text positioned in front of the page image.

more options

It's funny that "security" can be partially bypassed by simply ignoring it in code.

more options

Helmanfrow said

It's funny that "security" can be partially bypassed by simply ignoring it in code.

Once upon a time, basing "security" on the honor system actually worked, I guess.