Spot duplicate Word and Pages files on Mac
Find duplicate .docx, .doc, and .pages files across your Mac, even when they have different filenames.
You drafted the same letter three times. You saved a contract as .docx and later as .pages. Someone emailed you back the file you’d sent them, and now you have both. Word and Pages files multiply quietly and never get cleaned up because everyone’s afraid of losing the “real” version.
Two paths
The native way: Finder + manual compare
- Open Finder. Use Cmd+F and search for “Kind: Document” or by extension (
.docx,.doc,.pages). - Switch to List view and sort by Size.
- Files with the exact same size are duplicate candidates.
- Open suspected pairs in Word or Pages to verify identical contents.
- Drag the redundant copy to the Trash.
This works for ten files. For a real-world Documents folder with hundreds of .docx and .pages files, opening every suspected pair is impractical. And Finder will happily show you two same-size docs that turn out to have completely different content.
The Dupe way
Dupe hashes every Word and Pages file with SHA-256. Two files match if and only if their bytes are identical, regardless of what they’re named or where they sit.
- Install Dupe and open it.
- Click “Add Folder” and add Documents, Desktop, Downloads, and any iCloud Drive folders.
- Click “Scan.”
- Browse the duplicate groups. Each shows file paths so you can see whether a copy is in Documents/Contracts versus Desktop/old.
- Select the copies to remove and click “Move to Trash.”
Important caveats specific to Word and Pages files:
.pagesand.docxof the same content are not duplicates by bytes — different file formats, different hashes. Dupe won’t flag them. That’s correct: they’re different files even if the words inside match.- Two
.docxfiles where one has tracked changes accepted and the other doesn’t are not byte-identical. Dupe leaves them alone. Good — you might want both. - Pages bundles (
.pagesfiles are actually directories on disk) are hashed correctly as units, not as their internal XML pieces.
Safety:
- Trash-only deletion. Restore from Trash within 30 days if you change your mind.
- Word and Pages auto-recovery folders (inside
~/Library/Containers/) are excluded — Dupe never touches app sandbox data. - Hidden folders are skipped automatically.
If you’ve been saving “FINAL_v3_REAL_FINAL.docx” copies for years, a Dupe scan over Documents and Desktop will surface the byte-identical duplicates without confusing them with revisions.
More Dupe tips
-
Apple Photos Duplicates album — what it catches and what it misses
The Photos app Duplicates album is handy, but it has real limits. Here's what it finds, what it doesn't, and how to fill the gaps.
-
Clean up leftover files from uninstalled apps on Mac
Dragging an app to the Trash doesn't remove all its data. Here's where the leftovers live and how to clean them.
-
Clean up your Mac without buying a cleaner app
Most paid cleaner apps do things macOS already does. Here's a free, manual workflow that's just as effective.
-
A no-bullshit guide to cleaning up your Mac's disk
Skip the SEO bait and the sketchy cleaner apps. Here's what actually works to reclaim disk space on a Mac.