Home page
Business
Pricing
Blog

How to Extract Highlighted Text from a PDF

May 22 2026

Productivity

Sanity Image
Read time

7 min

Tags

Annotate PDF

Data Extraction

Desktop

Document Management


Read time

7 min


Share this post

emaillinkedIntwitter

Learn how to extract highlighted text from a PDF using Xodo PDF Studio. Discover what counts as extractable highlights and how they behave, how to export them to a text file, and tips to fix common issues with scanned or image‑based PDFs.

If you've ever highlighted dozens of passages in a long PDF and then tried to copy them out manually, you know how tedious the process is.

Order gets messy. Page references disappear. And sometimes… nothing copies at all.

Looking for a more reliable method to extract highlighted text from your PDF? This guide shows a reliable desktop workflow using Xodo PDF Studio.

It's designed for anyone who reviews PDFs seriously: students, researchers, legal and ops reviewers, analysts, and those working through long documents offline.

We'll walk you through:

First, let's take a close look at extracting highlighted text for some context.

Why extract highlighted text from a PDF?

Highlights are extracted because they're useful if you can reuse them. Extracting highlighted text lets you:

  • Turn highlights into study notes or summary reports
  • Pull key clauses from contracts and legal documents for review
  • Build research excerpts and packets from academic papers
  • Create review sheets from policy PDFs, manuals, or reports
  • Share key passages with teammates without sharing the full PDF

For annotation‑heavy work, exporting highlights is how review turns into output, especially when you’re working across multiple academic papers, reports, or contracts that may need to be combined without losing markups.

Highlighted text vs comments vs annotations

Not all highlights are created equal and hence, they don't export in the same way. Here’s a quick breakdown of some common PDF annotations and how they behave during export.

Markup Type

What It Is

What It’s Attached To

Exports as Highlighted Text?

Text Highlight

A text markup made with the highlight tool

Selectable text in the PDF

Comment / Note

A written note or comment bubble

A location or selected text

❌ (exports as comment data)

Area Highlight / Shape / Marker

A visual markup drawn over the page

The page surface (not text)

Area Highlight / Shape / Marker

A visual markup drawn over the page

The page surface (not text)

Highlight on scanned PDF (no OCR)

A visual overlay on an image‑based page

An image, not text

Highlight on scanned PDF (after OCR)

A text highlight created after OCR

OCR‑generated selectable text

Before you start highlighting or exporting, it’s worth knowing how scanned PDFs behave differently.

Can you extract highlights from scanned PDFs?

Yes, but only after OCR. Scanned PDFs are image‑based. Even though the page looks like text, the file is really just a picture of a page. There aren't any actual letters for a PDF tool to read, copy, or extract, which is why highlights on scanned PDFs often look correct on screen but can't be exported.

This is typical for scanned contracts, government policy documents or compliance records that were digitized and scanned from multiple files.

OCR (Optical Character Recognition) analyzes the scanned page and converts the image into searchable, selectable text that can be copied and exported.

One caveat: while OCR technology can make text extractable, it doesn't always guarantee perfect results. You may still see issues if:

  • The scan quality is low or blurry
  • The document uses complex layouts or handwriting
  • Text is inside images, tables, or stamps

How to highlight text in Xodo PDF Studio

To highlight text in Xodo PDF Studio, follow the steps below.

  1. Download and install Xodo PDF Studio on Windows, macOS, or Linux.
  2. Open your PDF in Xodo PDF Studio.
  3. Go to the Comment tab > Highlight Text.
  4. Click and drag over selectable text.
  5. Repeat as needed across the document.
Highlighting text in a PDF using Xodo PDF Studio on Windows, macOS, and Linux

If you can't select the text at all, the PDF is likely a scanned PDF.

For a deeper walkthrough of annotation tools, see our guide on How to annotate a PDF with Xodo.

How to export highlighted text from a PDF

In Xodo PDF Studio, you can export highlighted text from your PDF by doing the following:

  1. Open your PDF in Xodo PDF Studio.
  2. On the Comments tab, click on Export.
  3. Select Highlighted Text.
  4. Choose a location and file name for the .txt file output.
  5. Click Save.
Exporting highlighted text from a PDF with Xodo PDF Studio

This is a desktop‑level workflow designed for review, not a generic text dump. Xodo PDF Studio will generate a separate document containing only the text that you've highlighted. The text will be grouped by page, logical reading order, and the original page separation.

How to extract highlighted text if you have a scanned PDF document

You can extract highlighted text from scanned PDFs, as well, with the following steps:

  1. Open the scanned PDF in Xodo PDF Studio.
  2. Go to Document > OCR. Specify any options necessary.
  3. Click on OK to run the OCR process.
  4. Save the searchable PDF.
  5. Apply highlights using the Highlight Text tool.
  6. Save the PDF.
  7. Then go to Comment > Export > Highlighted Text (Text).
  8. Save your text file to your computer.
Extracting highlighted text from a scanned PDF using Xodo PDF Studio

How to use your exported text file

Once exported, the text becomes a working document, while the original PDF can remain annotated for internal review or be shared or printed cleanly without comments once decisions are finalized.

Common uses:

  • Paste into Word documents, Google Docs, or other editors
  • Import into note‑taking tools for reference or training
  • Archive highlights separately from the source PDF

For teams reviewing sensitive documents, this workflow stays fully offline so the content stays secure.

Troubleshooting highlight export in PDFs

If exporting highlights doesn't work as expected, it's usually not a failure. It's a signal about how the PDF was built or how the highlights were created.

The good news: some fixes are simple.

1. Text can't be selected

This one step answers half of all export issues.

If you can click and select individual words with your cursor, the PDF contains real text and highlight exporting should work. If you can't select text at all, the PDF is image‑based.

What to do next:
Perform OCR on your PDF in Xodo PDF Studio, then re‑apply your highlights and export again. OCR turns scanned pages into searchable, selectable text, which is required for highlight extraction.

2. Visual markups aren't exporting properly

Some highlights look correct on screen but aren't connected to text underneath.

Only text highlights applied with the Highlight Text tool can be exported as highlighted text. Visual markups drawn over images, scanned pages, or layout elements don't contain textual data.

Quick self‑check: Click on a highlight. If the underlying words can be selected or copied, it is text‑based. If needed, delete the visual markup and re‑highlight the text after OCR'ing the file.

3. Only some highlights export

Highlights created in older files or in other PDF tools may not include embedded text in a way that exports cleanly. This is common with:

  • PDFs highlighted before OCR was applied
  • Files annotated in a different app

Try this: Re‑highlight one problem section using Xodo PDF Studio, then export again. If it appears correctly, re‑highlight the remaining sections.

4. The text order looks strange

In multi‑column documents, tables, or legal PDFs, text is sometimes stored in an order that does not match how it looks on the page. This isn't an error in your highlights.

What helps:
Review the exported file by page, or reorganize the text after you extract it. For complex layouts, this is expected behavior across PDF tools.

Frequently asked questions

1. Can I copy all highlighted text from a PDF at once?

Yes. Use the Export > Highlighted Text option in Xodo PDF Studio instead of a manual copy‑paste.

2. Can I export highlighted text to Word or other formats?

Yes. Highlights export as plain text. You can paste the text from the file into Word or another editor easily.

3. Are comments included when exporting highlighted text?

No. Comments are excluded unless you export comments explicitly.

4. Why is my exported file empty?

Most often, an exported file is empty because the PDF is scanned or highlights were applied to non‑selectable content.

5. Can I export highlights with page numbers?

Yes. When extracted highlighted text with Xodo PDF Studio, the text is separated by page numbers in the exported file.

6. Does this work offline?

Yes. Xodo PDF Studio is a fully offline desktop editor for Windows, macOS, and Linux platforms.

7. Is this different from extracting all text from a PDF?

Yes. This workflow extracts only highlighted text, not the entire document or content.

Put your PDF highlights to work

Extracting highlighted text from a PDF doesn't require any guesswork.

You’ve learned what counts as exportable highlights, how to pull out only the text you've marked, and how to troubleshoot common issues.

For a quick and easy desktop way to turn review work into something you can actually reuse, give Xodo PDF Studio a try.

Highlight, review, and export text offline and efficiently, all from one application.

Sanity Image

Share this post

emaillinkedIntwitter

Related Articles

Sanity Image

How to Print a PDF Bank Statement

Need a printed copy of your bank statement for taxes, loan applications, or recordkeeping? This guide shows you how to print a PDF bank statement using Xodo tools, your browser, or mobile apps. No technical setup required.

Sanity Image

How to OCR a PDF: Extract Text from PDF Images

Struggling with scanned PDFs? Need to extract text from PDF image files? We’ll show you how to get text from image PDFs that you can edit and search. Our guide covers what you need to know about using Xodo’s OCR tool on your desktop, online, and on mobile devices.

Sanity Image

How to Print a PDF Without Comments or Annotations

PDFs filled with comments, highlights, and review notes can make collaboration documents harder to use. The simplest fix is controlling what gets printed. This guide shows you how to print a PDF without comments using Xodo PDF Studio to produce clean documents without altering the original file.