419808 – SharePoint: Optical Character Recognition now supports hybrid PDFs (text and image)

SharePoint Logo

*For this entry exists the more relevant or more recent entry MC907534

check before: 2024-10-01

Product:

SharePoint

Platform:

Online, Web, World tenant

Status:

In development

Change type:

Links:

MC907534

Details:

SharePoint OCR feature now extends support to hybrid PDFs, which contain both images and text. Previously, OCR was limited to image-only PDFs, but with this update, you can seamlessly extract and utilize text from hybrid documents.

Change Category:
XXXXXXX ... free basic plan only

Scope:
XXXXXXX ... free basic plan only

Release Phase:
General Availability

Created:
2024-10-02

updated:
2024-10-02

Docu to Check

XXXXXXX ... free basic plan only

MS workload name

XXXXXXX ... free basic plan only

summary for non-techies**

XXXXXXX ... free basic plan only

Direct effects for Operations**

Please, look at the most relevant linked item for details

explanation for non-techies**

Imagine you have a large stack of documents on your desk. Some of these documents are typed out, while others are handwritten or contain images. Previously, you had a special tool that could only read and understand the handwritten or image-based documents, but it couldn't make sense of the typed ones. This meant you had to manually go through the typed documents to find the information you needed.

Now, think of SharePoint's Optical Character Recognition (OCR) feature as that special tool. Initially, it could only read and extract text from image-only PDFs, similar to how you could only read the handwritten notes. However, with the recent update, SharePoint OCR can now handle hybrid PDFs, which are like those documents on your desk that contain both typed text and images. This means that SharePoint can now automatically extract and utilize text from these mixed-content documents, saving you the time and effort of doing it manually.

This improvement is akin to upgrading your tool so it can now read both the handwritten notes and the typed documents, making your job much easier and more efficient. With this new capability, you can search, index, and manage your hybrid PDFs in SharePoint more effectively, just as you would with any other document.

** AI generated content. This information must be reviewed before use.

a free basic plan is required to see more details. Sign up here


A cloudsocut.one plan is required to see all the changed details. If you are already a customer, choose login.
If you are new to cloudscout.one please choose a plan.



Last updated 4 days ago

Share to MS Teams

Login to your account

Welcome Back, We Missed You!