Most Mac users hit the same wall a few times a week: someone sends a screenshot of an error message, a Slack thread, a flight confirmation, or a slide from a deck — and you need the text from it. You can't just select it the way you would highlight text in a Notion page. The pixels are pixels.
The good news is that macOS has made screenshot OCR genuinely easy since Live Text shipped in macOS 12 Monterey. The somewhat-bad news is that the built-in workflow still involves a few too many steps when you're doing it twenty times a day, and the floating-preview shortcut a lot of people don't know about can shave off a couple of those steps. And then, for anyone who lives in this workflow professionally — engineers grabbing stack traces, support agents pulling info out of customer screenshots, students pulling quotes from PDFs of slides — there's a third method that essentially collapses the whole sequence into one drag-select.
This guide walks through all three, compares them honestly side by side, and tells you which one to reach for in different situations.
A quick refresher on Mac screenshot shortcuts
Before we get into text extraction, the three native screenshot shortcuts are worth committing to muscle memory. They show up in every method below.
- Cmd+Shift+3 — captures the entire screen (every connected display) instantly to your Desktop.
- Cmd+Shift+4 — gives you a crosshair to drag-select a region. Press Space mid-drag to switch to window-capture mode, where you can click any window to grab it cleanly with its drop shadow.
- Cmd+Shift+5 — opens the floating capture toolbar with options for full screen, window, region, screen recording, plus an Options menu. This is the modern flagship shortcut and it's where the second method below lives.
By default, all three save PNG files named Screenshot YYYY-MM-DD at HH.MM.SS.png to your Desktop. You can change that with Cmd+Shift+5 → Options → Save to. We'll come back to this option a few times because it materially changes which method makes sense.
One more thing worth knowing: when you take a screenshot on a recent macOS (Mojave and later), a small thumbnail floats in the bottom-right corner for a few seconds. That thumbnail is more capable than people realize, and it's the second method's secret weapon.
Method 1: Screenshot, then Live Text in Preview
This is the most discoverable workflow — the one Apple's documentation tends to point you toward, and the one most people land on by trial and error.
- Press Cmd+Shift+4 (or Cmd+Shift+3 for the full screen).
- Drag-select your region. The screenshot saves to your Desktop.
- Double-click the file. It opens in Preview by default.
- In Preview, click and drag across the text in the image. Cursor turns into the I-beam over text, just like in a regular text document.
- Cmd+C to copy.
That's about five distinct user actions, and it leaves a Screenshot file behind on your Desktop forever (or until you remember to clean up).
Live Text inside Preview is genuinely good. It's the same Apple Vision pipeline that powers OCR everywhere else on macOS — accurate on English, solid on Chinese, Japanese, Korean, French, German, Spanish, Italian, Portuguese, Ukrainian. It handles handwriting reasonably well in clean cases. It picks up text in screenshots of dark-mode UIs without complaint. And it understands data detectors, so phone numbers and email addresses get right-click menus to dial or compose.
The two real complaints are the friction (five actions) and the file pollution (your Desktop fills up with throwaway PNGs). For someone who needs text from a screenshot once a week, those costs are negligible. For someone doing it constantly, they add up.
When Method 1 is exactly right
- You already have the screenshot saved — someone sent it to you, or you took it earlier.
- You want to keep the image as a reference, not just the text.
- You also want to mark up or annotate the screenshot before extracting text.
- You're on a Mac you don't own and can't install apps on.
Method 2: Cmd+Shift+5 floating preview with Live Text
This is the underused shortcut. Most Mac users know about the floating thumbnail that appears in the bottom-right corner after a screenshot, but most of them either ignore it or click it to open Markup. Fewer people realize that on macOS 13 Ventura and later, you can use Live Text directly on the thumbnail without ever opening Preview.
Here's the flow:
- Press Cmd+Shift+5.
- Pick your capture mode (region capture is the most common).
- Drag-select your area.
- The thumbnail floats in the bottom-right. Click it before it disappears (you have a few seconds, and you can also force it to stay by clicking once more).
- The Markup window opens. The cursor over text becomes an I-beam — drag to select, then Cmd+C.
Same five actions, but no file is written if you turn on the right setting. Cmd+Shift+5 → Options → Save to: Clipboard sends the image to your clipboard instead of disk. Then the floating preview becomes essentially a transient annotation layer — once you copy the text out, you can dismiss the window and nothing persists.
The other useful setting in that Options menu is the floating-preview duration. By default the thumbnail disappears after a few seconds. There's no slider to extend it, but you can pin it open by clicking on it once. If you find yourself constantly missing the thumbnail before it disappears, get into the habit of double-tapping the trackpad-edge corner where it appears — that brings the focus there reliably.
When Method 2 wins
- You take screenshots constantly and don't want a Desktop full of files.
- You want to occasionally annotate before copying — Markup is right there.
- You're on macOS 13 Ventura or newer (the Live Text-in-Markup feature is Ventura-and-up).
The catch
You still have to click the floating thumbnail before it vanishes. And you still need to drag-select inside Markup to get text out — there's no "OCR everything in this image" button. If your screenshot has, say, ten distinct lines of text and you want all of them, you're highlighting all ten. For text-rich screenshots, this gets tedious.
Method 3: A dedicated OCR hotkey (Cheese! OCR)
This is what we make, so factor that in. But here's the workflow honestly:
- Press ⇧⌘E (or whatever hotkey you've configured).
- Drag-select the area containing text.
That's it. No file is created. No window opens. The recognized text is already on your clipboard, ready to paste anywhere. A small confirmation chip appears briefly near where you finished dragging, and the result is also stored in a searchable history that lives in the menu bar.
The reason this is faster than Methods 1 and 2 isn't the OCR itself — Cheese! OCR uses the same Apple Vision framework that powers Live Text, so the recognition step takes effectively the same amount of time. The reason is that everything between "I want this text" and "the text is in my clipboard" is collapsed. No file save. No app switch. No Markup window. No drag-select inside the captured image. The drag you do to select the screenshot region is also the drag that defines the OCR area.
A few other things that matter when this is your daily workflow:
- Searchable history. Every OCR result is saved locally with a thumbnail of the source. You can search across past extractions weeks later when you remember "I OCR'd that error message somewhere."
- Multi-language defaults. English, Simplified Chinese, Japanese, and Korean are recognized out of the box without switching modes. Mixed-language screenshots (which are common in real life — a Slack thread with a code snippet, an English UI with a Japanese error message) work transparently.
- 100% on-device. Cheese! OCR has zero network entitlements, verifiable in the App Store sandbox. Nothing about your screenshot leaves the Mac. For people who screenshot internal dashboards, contracts, customer data, or anything sensitive, this matters.
- One-time price. $5.99 once. No subscription. The Mac App Store handles the receipt and updates.
The honest trade-off
You're installing another app, configuring another hotkey, and paying $5.99. If your screenshot-OCR frequency is low — once a week, once a month — Methods 1 and 2 are perfectly fine and free. The math flips somewhere around "more than once a day." At that frequency, the saved seconds and the searchable history start adding up to real time.
Side-by-side comparison
| Method | Steps | File created? | Multi-lang default | Searchable history | Cost |
|---|---|---|---|---|---|
| Method 1: Preview + Live Text | ~5 actions | Yes (PNG on Desktop) | Yes (Apple Vision) | No | Free |
| Method 2: Cmd+Shift+5 floating preview | ~5 actions | Optional (Save to: Clipboard) | Yes (Apple Vision) | No | Free |
| Method 3: Cheese! OCR hotkey | ~2 actions | No | Yes (en/zh/ja/ko) | Yes | $5.99 one-time |
When each method makes sense
This is the practical heuristic we'd give a friend asking which one to use:
Use Method 1 (Preview + Live Text) when
- You already have the screenshot file — somebody AirDropped it to you, you saved it last week, it's an attachment in an email.
- You want to mark up or crop the image before grabbing text.
- You're working on a borrowed Mac.
- You only do this every now and then.
Use Method 2 (Cmd+Shift+5 floating preview) when
- You're going to keep using built-in tools but want to stop polluting your Desktop. Set Save to: Clipboard once and the friction drops.
- You sometimes need to annotate before copying — Markup gives you arrows and shapes right there.
- You're on macOS 13 Ventura or newer.
Use Method 3 (Cheese! OCR or similar dedicated tool) when
- You extract text from screenshots more than once a day.
- You frequently search for past extractions ("what was that command I OCR'd from a tutorial last week?").
- You handle sensitive content and want a tool with zero network entitlements.
- You work in multiple languages — Slack threads with mixed English and Chinese, Japanese game menus, Korean web articles.
- You're tired of seeing your Desktop fill up with throwaway PNGs.
Power-user tips that apply to all three methods
Set Save to: Clipboard if you want to stop creating files
This is the single best macOS screenshot tweak nobody tells you about. Cmd+Shift+5 → Options → under Save to, pick Clipboard. From that point on, every screenshot goes to your clipboard instead of your Desktop. You can paste directly into Slack, Notion, Mail, anywhere. If you do want a file, hold Control while taking the screenshot — that temporarily forces clipboard mode for that one capture (or, with the setting flipped, forces a file).
Pin the floating preview if you keep missing it
Click the thumbnail once instead of waiting for it to fade. The window stays open until you dismiss it. Useful when you're juggling several captures back to back.
Use Cmd+Shift+4 then Space to capture a window cleanly
Most people drag a region around a window, which captures the background too. Press Cmd+Shift+4, then tap Space, then click the target window — you get a clean PNG with the rounded corners and drop shadow preserved. If you want it without the drop shadow, hold Option while clicking.
Use Cmd+Shift+5 → Capture Selected Window for the same effect from the toolbar
Same idea, but launchable from the floating capture controls. Useful when you forget the keyboard shortcut.
Cheese! OCR-specific: pin frequent captures via the OCR History
If you find yourself OCR'ing the same kind of region repeatedly — say, copying customer IDs out of a recurring dashboard — the searchable history doubles as a timeline of past extractions. Open the menu bar window, Cmd+F to filter, and you can find a result from days ago in seconds.
Cheese! OCR-specific: feed an existing image into the app
If you receive a screenshot via email or AirDrop, you don't have to re-take it. Drop the image onto the Cheese! OCR menu bar window or paste it from the clipboard, and it gets recognized the same way. So even when Method 3 isn't the original capture path, it can still be the extraction path.
Customize hotkeys per workflow
If your fingers are happier with a different combination than ⇧⌘E (some people prefer F-key bindings, some prefer ⌃⌥⌘ combos for fewer collisions), you can change the hotkey in Cheese! OCR's Settings. The same advice applies to macOS itself — you can rebind Cmd+Shift+3/4/5 in System Settings → Keyboard → Keyboard Shortcuts → Screenshots if any of those collide with another tool you use.
None of the three methods is wrong. They're just optimized for different frequencies and different workflows. If you take one screenshot a week to share with a teammate and occasionally need to grab a couple of words out of it, the built-in tools are excellent. If text extraction is part of how you work — pulling quotes from PDFs of slides, grabbing error messages from chat threads, copying numbers out of dashboard screenshots half a dozen times a day — a dedicated hotkey makes a noticeable difference. Try the free workflows first; if you find yourself wishing the keystrokes would collapse, that's the moment a tool like Cheese! OCR is paying for itself.