Find and Remove Duplicate Files Fast
dupeGuru scans your folders with fuzzy matching and content-based comparison to find duplicate files, photos, and music across Windows, macOS, and Linux.
What is dupeGuru?
A cross-platform duplicate file finder built on fuzzy matching algorithms and content-based scanning
Duplicate files without the guesswork
dupeGuru is a free, open-source application that finds duplicate files on your computer. Unlike basic search tools that only compare filenames, dupeGuru uses fuzzy matching algorithms to catch near-identical files even when names have been changed. It also does byte-level content comparison using MD5 hashes, so it can identify exact copies regardless of what the file is called.
The program runs on Windows (7 through 11), macOS (10.11 and up), and Linux (Ubuntu, Fedora, Arch, plus Flatpak and Snap). It is written in Python with a Qt5 interface, licensed under GPLv3, and costs nothing to use. Originally created by Virgil Dupras at Hardcoded Software, dupeGuru is now maintained by Andrew Senetar and a community of 32 contributors on GitHub, where the project has over 7,400 stars.
Three scan modes for different file types
dupeGuru ships with three distinct scan modes. Standard mode works on any file type and offers both filename-based and content-based matching. Music mode reads audio tag metadata (artist, title, album, track number) to find duplicate songs that may have different filenames or bitrates. Picture mode uses a 15×15 color grid analysis to compare images visually, catching resized copies and minor edits that content hashing would miss.
Built for real-world file management
A reference folder system lets you mark certain directories as untouchable, so dupeGuru will never flag those files for deletion. Results appear in grouped rows with match percentages displayed alongside each duplicate. You can review everything before committing, send items to the system trash instead of permanently deleting, and export your results to HTML or CSV for later review. Folders are added through drag-and-drop, and scan thresholds are adjustable in the preferences panel.
Quick facts
Key Features
dupeGuru packs serious duplicate-finding power into a straightforward interface. Here is what makes it a go-to tool for cleaning up cluttered drives.
Fuzzy Filename Matching
dupeGuru does not require exact filenames to spot duplicates. Its fuzzy matching algorithm catches renamed files, slight typos, and numbering differences like photo.jpg versus photo (1).jpg. You set a match threshold percentage, and dupeGuru handles the rest. This alone catches duplicates that most file comparers miss entirely.
Content-Based Scanning
Beyond filenames, dupeGuru compares files byte-by-byte using MD5 hash comparison. Two files with completely different names but identical content will still show up as duplicates. This is the most reliable detection mode when you need 100% certainty that files are true copies, regardless of what they happen to be named.
Picture Mode
Picture Mode goes further than file hashes. It builds a 15×15 color grid for each image and compares the actual visual content. Resized copies, re-exported JPEGs, and lightly edited versions of the same photo all get flagged. If you have thousands of photos scattered across backup folders, this mode finds the near-duplicates that standard scanning cannot.
Music Mode
Music Mode reads audio metadata tags – artist, title, album, genre, duration – to identify duplicate tracks. A song ripped from CD at 192kbps and the same track downloaded at 320kbps will match on tag data even though the files differ in size and bitrate. You can set which tags matter and how closely they need to match.
Reference Folder System
Mark specific folders as “Reference” to protect their contents from deletion. dupeGuru will find duplicates against those folders but only flag the copies elsewhere. Your curated library stays untouched while the messy backup gets cleaned out.
Grouped Results View
Scan results display as grouped clusters with match percentages shown alongside each file. Delta values in orange highlight differences in size and modification date. You can quickly see which copy to keep and which to remove at a glance.
Safe Deletion
Files go to the system trash by default instead of permanent deletion. You review every selection before confirming the bulk removal. Nothing gets destroyed without your explicit approval, so recovery is always one click away.
Cross-Platform Support
Built on Python and Qt, dupeGuru runs natively on Windows (7 through 11), macOS (10.11+), and Linux (Ubuntu, Fedora, Arch, Flatpak). The interface and features are identical across all three platforms.
Export Results
Save your scan results to HTML or CSV for documentation or later review. This is handy when you need to audit large drives before committing to deletions, or when you want to share findings with a team.
Drag-and-Drop Folders
Adding scan targets is as simple as dragging folders from your file manager into the dupeGuru window. No manual path entry needed. You can mix folders from different drives and set each one as Normal, Reference, or Excluded independently.
Want to see dupeGuru in action? Jump straight to getting started.
System Requirements
dupeGuru runs on Windows, macOS, and Linux. Here are the hardware and software specs you need before installing.
| Component | Minimum | Recommended |
|---|---|---|
| OS (Windows) | Windows 7 (32-bit or 64-bit) | Windows 10 or 11, 64-bit |
| OS (macOS) | macOS 10.11 El Capitan | macOS 12 Monterey or newer |
| OS (Linux) | Ubuntu 18.04 / Fedora 30 / Arch | Ubuntu 22.04+ / Fedora 38+ |
| Processor | Any x86 or x86_64 CPU | Multi-core processor (Intel i3 / AMD Ryzen 3 or better) |
| RAM | 512 MB | 2 GB or more (4 GB+ for large file collections) |
| Disk Space | ~50 MB for installation | 50 MB + extra temp space for scan cache |
| Display | 1024 × 768 | 1280 × 800 or higher |
Ready to get started? Download dupeGuru for your platform.
Download dupeGuru
Grab dupeGuru 4.3.1 for your operating system. All downloads come straight from the official GitHub repository — no bundled software, no ads.
All packages hosted on GitHub Releases. No installer needed for Windows — just unzip and run.
Getting Started with dupeGuru
From download to your first duplicate scan in under ten minutes. This walkthrough covers everything you need to clean up wasted storage space.
Head to our download section above to grab dupeGuru 4.3.1 for your operating system. The Windows 64-bit portable ZIP weighs in at 45.4 MB, while the 32-bit build is 38.7 MB. Linux packages are much smaller — the .deb file for Ubuntu/Debian is only 1.19 MB since it pulls dependencies from your system repos.
Which version should you pick?
If you run Windows 10 or 11, go with the 64-bit portable ZIP. It runs without installation and leaves no registry entries behind. The 32-bit build is there for older machines running Windows 7 or 8 that lack 64-bit support.
Mac users can install through Homebrew with a single terminal command (covered below), or grab the DMG from the GitHub releases page. On Linux, the .deb package works for Ubuntu and Debian, while Fedora users should pick the .rpm. Flatpak is available for other distributions.
All downloads come from the official GitHub repository maintained by Andrew Senetar. The source code is published under the GPLv3 license, so you can verify every line if you want to.
Windows (Portable ZIP)
- Right-click the downloaded dupeguru-win64_4.3.1.zip file and select Extract All (or use 7-Zip if you have it).
- Pick a destination folder. Your Desktop, Documents, or a dedicated Tools folder all work fine.
- Open the extracted folder and double-click dupeguru.exe to launch.
- If Windows SmartScreen shows a blue warning that says “Windows protected your PC,” click More info, then click Run anyway. This appears because the portable build is not code-signed, but the software is safe and open source.
No installation wizard, no registry changes, no admin rights needed. Your settings are stored in the same folder as the executable.
Windows via package managers
If you prefer automated setup, dupeGuru is available through two package managers:
Both commands handle the download, extraction, and PATH setup automatically.
macOS
The quickest way on Mac is Homebrew:
This places dupeGuru.app in your Applications folder. On first launch, macOS Gatekeeper may block it. If that happens, go to System Preferences > Security & Privacy > General and click Open Anyway. You can also right-click the app, select Open, and confirm from the dialog. Works on both Intel and Apple Silicon Macs.
Linux
Install from your distribution’s package manager or use the universal Flatpak option:
sudo apt install ./dupeguru_4.3.1_amd64.deb
# Fedora / RHEL
sudo dnf install ./dupeguru-4.3.1-2.x86_64.rpm
# Arch Linux (AUR)
yay -S dupeguru
# Flatpak (any distro)
flatpak install flathub com.hardcoded_software.dupeGuru
After installation, search for “dupeGuru” in your application menu or run dupeguru from the terminal.
dupeGuru opens straight to its main window with no first-run wizard or registration. The interface is ready to go immediately, but a few preference tweaks will save you time on future scans.
Opening Preferences
On Windows and Linux, go to Edit > Preferences. On macOS, it is dupeGuru > Preferences in the menu bar. Here is what to adjust:
- Filter Hardness (slider, 1-100): This controls how closely filenames must match to be flagged as duplicates. The default sits at 80, which works well for most scans. Drop it to 60-70 if you want to catch more renamed files. Raise it above 90 when you only want near-exact name matches. This setting applies to filename-based scans only — content scans use exact byte comparison regardless.
- Remove empty folders after delete or move: Turn this on. When you delete or move duplicate files, dupeGuru will automatically clean up any folders that become empty afterward.
- Copy and Move behavior: Three options here. “Right in destination” dumps files flat into your target folder. “Recreate relative path” preserves the subfolder structure relative to the scanned root. “Recreate absolute path” mirrors the full directory tree. The relative path option is the most practical for most people.
- Custom Command: Advanced users can wire up an external diff tool here. Use
%das a placeholder for the duplicate path and%rfor the reference path. Handy for comparing files in Beyond Compare or Meld before deleting.
If you are switching from AllDup, Czkawka, or another duplicate finder, there is no import function. dupeGuru starts with a clean slate each session — just point it at the same folders and run a fresh scan.
Let’s walk through finding and removing duplicate files from a cluttered photo collection. This same workflow applies to documents, videos, music, or any other file type.
Choose your scan mode
At the top of the main window, you will see a mode selector with three options: Standard, Music, and Picture. Pick Standard for general files, Music for MP3/FLAC collections (compares audio tags like artist, album, and title), or Picture for image similarity detection. For this walkthrough, select Standard.
Pick a scan type
Below the mode selector is the Scan Type dropdown. Your choices in Standard mode are:
- Content: Compares files byte-by-byte using MD5 hashing. Finds only exact copies regardless of filename. This is the most reliable option and what you should use when you want zero false positives.
- Filename: Uses fuzzy matching to flag files with similar names. Good for catching renamed copies like “report.docx” vs “report_final.docx”.
- Folders: Identifies entire duplicate directories where all contained files match.
Select Content for your first scan.
Add folders to scan
Click the “+” button at the bottom-left corner of the folder list. A file browser opens — navigate to the folder you want to scan and confirm. You can add multiple folders. Drag-and-drop also works: just pull a folder from File Explorer (or Finder on Mac) directly into the list.
Set folder states
This step is optional but important. Click on any folder in the list to select it, then look at the state indicator next to it. Three options:
- Normal: Files here can be marked and deleted.
- Reference: Files here are protected. They will always be treated as the “original” and never flagged for deletion.
- Excluded: Folder is completely skipped during scanning.
If you have a master photo archive and want to find duplicates only in your unsorted downloads folder, mark the archive as Reference and the downloads folder as Normal.
Run the scan and review results
Click the blue Scan button. A progress bar appears while dupeGuru groups files by size and then computes MD5 hashes. Once finished, results appear as grouped rows. The first entry in each group is the reference file (checkbox disabled, cannot be deleted). Indented rows below it are duplicates with active checkboxes.
Each duplicate shows a Match % badge. Content scans always show 100% since they are exact matches. Filename scans show the fuzzy similarity score.
Mark and act on duplicates
- Click individual checkboxes to mark specific duplicates, or go to Edit > Mark All to select everything at once.
- To remove duplicates safely, go to Actions > Send Marked to Recycle Bin. Files land in your trash where you can restore them if needed.
- Alternatively, use Actions > Move Marked to… to relocate duplicates to a holding folder for manual review.
Keyboard shortcuts
| Shortcut | Action |
|---|---|
| Space | Toggle mark on selected duplicates |
| Ctrl+A | Select all entries |
| Delete | Remove selected from results (does not delete files) |
| Double-click | Open file with default application |
Performance optimization
Content scans group files by size before hashing, so they skip unique-sized files entirely. This means large folders with varied file types scan faster than you might expect. If dupeGuru uses too much RAM on large collections (100,000+ files), scan subfolders in batches rather than pointing it at an entire drive.
Picture mode uses a 15×15 color grid comparison that scales quadratically — scanning 10,000 images takes significantly longer than 1,000. Start with smaller folders and work your way up.
Common mistakes to avoid
- Forgetting Reference folders: Without setting a Reference folder, dupeGuru picks the largest file in each group as the keeper. This might not be the copy you actually want to preserve. Always mark your master archive as Reference before scanning.
- Mass-marking without reviewing: Fuzzy filename scans can produce false positives, especially at lower Filter Hardness values. Skim the results before hitting “Send to Recycle Bin.”
- Running from inside the ZIP: On Windows, double-clicking dupeguru.exe inside the compressed archive extracts it to a temporary folder. Settings will not persist. Extract the full ZIP first.
Power-user features
- Delta Values mode: Toggle this in the results view to see relative differences from the reference. File sizes display as “+0.2 MB” or “-0.5 MB,” and non-numerical fields turn orange when they differ. Useful for spotting quality differences between near-duplicates.
- Dupes Only view: Hides reference files from the results, letting you sort all duplicates by any column across groups. Sort by folder path to quickly find which directories hold the most copies.
- Re-Prioritize Results: Go to Actions > Re-Prioritize Results to bulk-reassign which files become references based on criteria like size, folder path, or filename patterns.
- Ignore List: Select file pairs that keep appearing as false matches and use Actions > Add Selected to Ignore List. They will be skipped in future scans.
- Export to CSV/HTML: Use Actions > Export Results to XHTML to generate a shareable report of all found duplicates before you act on them.
Where to get help
The official GitHub repository has an active Issues section where you can search for known problems or report bugs. The official documentation covers advanced topics like building from source and custom commands. Community discussions pop up regularly on r/DataHoarder and r/software where experienced users share scan strategies.
dupeGuru does not auto-update. To check for a new version, visit the download section on this page or watch the GitHub releases page.
Ready to reclaim your storage? Download dupeGuru and start finding duplicates in minutes.
Frequently Asked Questions
Answers to the most common questions about downloading, installing, and using dupeGuru for duplicate file detection.
Is dupeGuru safe to download and install?
Yes, dupeGuru is safe to download when you get it from the official GitHub repository. The project is open source under the GPLv3 license, which means anyone can read and audit the entire Python codebase for backdoors or malicious code. Over 7,400 GitHub users have starred the project, and 32 contributors have worked on it over the years.
The official Windows 64-bit package (version 4.3.1, released July 9, 2022) is a 45.4 MB ZIP file hosted directly on GitHub Releases. VirusTotal scans of the official download consistently show 0 detections across 70+ antivirus engines. The installer contains no bundled adware, browser toolbars, or third-party offers. Windows SmartScreen may show a warning because the executable is not code-signed with an expensive certificate, but that is normal for free open-source software.
- Download only from our download section or the official GitHub releases page
- Avoid third-party download sites like Softonic, CNET, or FileHippo that may wrap installers with adware
- Check the file hash against the published SHA256 on the GitHub release page if you want extra verification
Pro tip: If Windows Defender or SmartScreen flags the download, click “More info” then “Run anyway.” This happens because dupeGuru is not commercially signed, not because there is anything wrong with the file.
For step-by-step installation instructions, see our Getting Started guide.
Is dupeGuru free from malware and spyware?
dupeGuru is completely free from malware, spyware, and adware. The entire source code is publicly available on GitHub at github.com/arsenetar/dupeguru, so security researchers and developers can verify there is nothing hidden in the code.
The project has been around since 2009 (originally by Hardcoded Software / Virgil Dupras) and is now maintained by Andrew Senetar. In over 15 years, no credible security researcher has ever reported malicious code in the official releases. The dupeguru.exe process runs locally on your machine, does not phone home to any server, and does not collect or transmit any personal data. It reads file metadata and contents only for the purpose of finding duplicates.
- No internet connection required to use dupeGuru after downloading
- No user accounts, telemetry, or analytics built into the application
- No background processes running after you close the program
- GitHub’s security overview reports zero known vulnerabilities in the codebase
Pro tip: A common mistake is downloading dupeGuru from unofficial mirror sites. Stick to GitHub Releases or our download section to avoid tampered copies.
Check the system requirements to confirm your PC meets the minimum specs before downloading.
Does dupeGuru work on Windows 11?
Yes, dupeGuru 4.3.1 runs on Windows 11 without issues. It is compatible with Windows 7, 8, 8.1, 10, and 11 in both 32-bit and 64-bit editions. Most users on Windows 10 and 11 should grab the 64-bit build (45.4 MB), which handles large file collections more efficiently thanks to access to more than 4 GB of RAM.
The application uses a Qt5 graphical interface that renders natively on Windows 11. Some users report the window chrome looks slightly dated compared to modern Windows 11 apps, but all functionality works as expected. On rare occasions, users scanning more than one million files have reported slowdowns, though these are Python memory constraints rather than Windows 11 compatibility problems.
- Windows 11 64-bit: use dupeguru-win64_4.3.1.zip (45.4 MB)
- Windows 11 32-bit (uncommon): use dupeguru-win32_4.3.1.zip (38.7 MB)
- No .NET Framework or Java runtime required
- Works on both Intel and AMD processors
Pro tip: On Windows 11, right-click the dupeGuru executable and select “Run as administrator” if you need to scan system folders or Program Files directories where normal user permissions may block file access.
See full hardware and OS requirements on our system requirements page.
Can I use dupeGuru on macOS or Linux?
dupeGuru is a cross-platform application that runs on macOS and Linux in addition to Windows. The same Python codebase powers all three platforms, so the feature set is identical regardless of your operating system.
On macOS 10.11 (El Capitan) and newer, you can install dupeGuru through Homebrew with brew install --cask dupeguru or build from source. On Linux, official packages are available for Ubuntu/Debian (.deb, 1.19 MB), Fedora/RHEL (.rpm, 1.31 MB), and Arch Linux via the AUR. Flatpak and Snap packages also exist for other distributions.
- macOS: Install via Homebrew (
brew install --cask dupeguru) or clone the GitHub repo and build with Python 3 - Ubuntu/Debian: Download the .deb file from GitHub Releases and install with
sudo dpkg -i dupeguru_4.3.1_amd64.deb - Fedora/RHEL: Download the .rpm and install with
sudo rpm -i dupeguru-4.3.1-2.x86_64.rpm - Arch Linux: Install from the AUR with your preferred helper (
yay -S dupeguru)
Pro tip: On Linux, if you hit dependency issues with the .deb package, run sudo apt-get install -f after the install to pull in missing Qt5 libraries automatically.
Visit our download section for direct links to all platform packages.
What are the minimum system requirements for dupeGuru?
dupeGuru is lightweight and runs on modest hardware. The minimum requirements are any x86 processor, 512 MB of RAM, approximately 50 MB of disk space, and a display resolution of 1024×768. For comfortable use with large file collections, 2 GB of RAM and a multi-core CPU are recommended.
The RAM requirement is the most important factor. dupeGuru loads file metadata into memory during scans, so collections with hundreds of thousands of files can push memory usage past 1 GB. Users on r/DataHoarder have noted that scanning terabyte-scale libraries of 500,000+ files works best with 4 GB or more of free RAM. The 64-bit Windows build removes the 2 GB memory ceiling that limits the 32-bit version.
- OS: Windows 7+, macOS 10.11+, or Ubuntu 18.04+
- CPU: Any x86/x64 processor (multi-core recommended for Picture mode)
- RAM: 512 MB minimum, 2 GB+ recommended for large scans
- Disk: ~50 MB for the application, plus temporary space for scan cache
- Display: 1024×768 minimum, 1280×800+ recommended
Pro tip: Close other memory-heavy applications (browsers with many tabs, video editors) before running a large scan. dupeGuru’s Python runtime competes for the same heap memory, and running out causes the “Fiddling with results” step to stall.
Check the full requirements table on our system requirements section.
Is dupeGuru completely free to use?
Yes, dupeGuru is 100% free. There is no premium tier, no trial period, no feature gating, and no in-app purchases. Every feature, including the Standard, Music, and Picture scan modes, is available to all users at no cost.
The software is released under the GNU General Public License v3.0 (GPLv3), which guarantees that it will remain free and open source. You can use dupeGuru for personal, commercial, or educational purposes without paying anything. Some users confuse the existence of dupeguru.com (an informational site) with a commercial version, but there is no paid edition of dupeGuru. The only official releases come from Andrew Senetar’s GitHub repository.
- No registration or account creation required
- No ads displayed inside the application
- No feature limitations based on free vs. paid tiers
- GPLv3 license means the source code is permanently open
Pro tip: If any website asks you to pay for dupeGuru or prompts you to enter credit card information, you are on a scam site. Close it and download from our official download section instead.
Learn about all the features you get for free on our features page.
What license does dupeGuru use and can I use it for business?
dupeGuru uses the GNU General Public License v3.0 (GPLv3), which allows free use in any context, including commercial and business environments. You can install it on as many company workstations as you need without purchasing licenses.
The GPLv3 permits you to copy, distribute, and modify the software. If you modify dupeGuru and distribute your modified version, you must also release your changes under GPLv3. But simply using the application as-is in a corporate setting carries no obligations beyond the license terms. IT administrators at small businesses often deploy dupeGuru to reclaim storage space on shared network drives.
- Personal use: fully permitted, no restrictions
- Commercial/business use: fully permitted, no per-seat licensing
- Redistribution: allowed if you include the GPLv3 license text
- Modification: allowed, but derivative works must also be GPLv3
Pro tip: For enterprise deployments across many machines, use the portable ZIP version. It does not require administrator privileges to install, making it easy to distribute via shared network folders or software deployment tools like SCCM.
See how dupeGuru compares to commercial alternatives in the comparison question below, or explore our features overview.
How do I download and install dupeGuru on Windows?
Installing dupeGuru on Windows takes about two minutes. The Windows version comes as a portable ZIP file, so there is no traditional installer wizard to click through.
The latest stable release is version 4.3.1 (July 9, 2022). The 64-bit package is 45.4 MB and the 32-bit package is 38.7 MB. Both are hosted on GitHub Releases. After extracting the ZIP, you get a folder containing dupeguru.exe and its dependencies. No .NET Framework, Java, or Visual C++ redistributable is needed.
- Go to our download section and click the Windows 64-bit download button
- Save the ZIP file (dupeguru-win64_4.3.1.zip) to a location like your Downloads folder
- Right-click the ZIP file and select “Extract All” (or use 7-Zip / WinRAR)
- Open the extracted folder and double-click dupeguru.exe to launch the application
- If Windows SmartScreen appears, click “More info” then “Run anyway”
Pro tip: Move the extracted dupeGuru folder to a permanent location like C:ToolsdupeGuru before first launch. Right-click dupeguru.exe and select “Pin to taskbar” or “Create shortcut” on your desktop for quick access.
For detailed first-run configuration steps, check our Getting Started guide.
Should I use the portable ZIP or is there an installer version?
The official dupeGuru download for Windows is a portable ZIP file. There is no traditional .exe or .msi installer in the 4.3.1 release. This is actually an advantage for most users because it means dupeGuru does not write to your Windows Registry, does not require administrator privileges, and can be run from a USB drive.
The portable format means you can carry dupeGuru on a flash drive and run it on any Windows PC without installation. Your settings and scan cache are stored in the application folder, keeping everything self-contained. The 64-bit ZIP is 45.4 MB, which expands to roughly 120 MB after extraction. A community-maintained PortableApps.com launcher also exists (dupeGuru Portable 4.3.1 Dev Test 1) if you prefer the PortableApps platform format.
- Official ZIP: download, extract, and run. No admin rights needed.
- PortableApps format: available from portableapps.com for integration with the PortableApps menu
- Chocolatey: Windows package manager users can install with
choco install dupeguru - No installer means no uninstaller needed either. Just delete the folder to remove it completely.
Pro tip: If you want dupeGuru in your Start Menu, create a shortcut to dupeguru.exe manually and place it in %AppData%MicrosoftWindowsStart MenuPrograms.
Ready to grab it? Head to our download section for all available packages.
Why is dupeGuru stuck on “Fiddling with results” and how do I fix it?
The “Almost done! Fiddling with results…” message appears when dupeGuru is processing and grouping duplicate matches before displaying them. On large scans with tens of thousands of matches, this step can take minutes or even hours because the grouping algorithm scales exponentially with the number of results.
This is a known limitation reported in GitHub issue #571. The root cause is Python’s memory management and the Qt GUI’s list rendering overhead. When dupeGuru finds 50,000+ duplicate groups, loading them all into the results table strains both RAM and CPU. Users scanning photo libraries with fuzzy matching at low filter hardness are most likely to hit this bottleneck.
- Increase the Filter Hardness slider (try 80-90%) to reduce the number of fuzzy matches found
- Scan smaller folder batches instead of your entire drive at once
- Use “Contents” scan type instead of “Filename” when possible, as exact matches produce fewer groups
- Close all other applications to free up RAM before scanning
- If scanning photos, set the Picture mode threshold higher to limit weak matches
Pro tip: If dupeGuru has been “fiddling” for over 30 minutes, it is probably still working. Check Task Manager to see if dupeguru.exe is still consuming CPU. If CPU usage is near zero and RAM is maxed out, the process is likely memory-starved and you should restart with a smaller scan scope.
For alternative approaches to large collections, see the comparison with Czkawka in the question below, or consult our features section for details on scan modes.
Why can’t dupeGuru send files to the Recycle Bin?
The most common reason dupeGuru fails to send files to the Recycle Bin is file permissions. You need write access to the files and their parent directories for the trash operation to work. This frequently happens with files on network drives (NAS devices), external USB drives formatted as NTFS with restricted permissions, or system-protected directories.
The official FAQ confirms this is a permissions issue in most cases. On Windows, the Recycle Bin mechanism requires the file system to support it, and some network storage protocols (SMB shares) do not support the standard Windows trash operation. On Linux, the trash implementation varies by desktop environment, and dupeGuru may not locate the correct trash directory on non-standard setups.
- Verify you have write permissions on the files by trying to rename one manually in File Explorer
- For network drives: open dupeGuru Preferences and enable “Directly delete files” to bypass the Recycle Bin entirely
- For external drives: make sure the drive is not write-protected or mounted as read-only
- On Windows, try running dupeGuru as administrator (right-click, “Run as administrator”)
Pro tip: Before enabling “Directly delete files,” test with a small batch first. Unlike the Recycle Bin, direct deletion is permanent and cannot be undone. Always double-check your selections, especially when using the Reference Folder system.
Learn about the Reference Folder safety feature in our Getting Started guide.
dupeGuru is using too much RAM during scans. How do I reduce memory usage?
dupeGuru loads file metadata into memory for comparison, and this can consume significant RAM on large collections. GitHub issue #730 documents cases where scanning 500,000+ files pushed memory usage past 2 GB. The 32-bit Windows build has a hard 2 GB ceiling, while the 64-bit build can use as much RAM as your system has available.
The Picture mode is the heaviest consumer because it generates a 15×15 color grid fingerprint for each image and holds them all in memory for pairwise comparison. Standard mode with Contents scan type is more efficient because it uses MD5 hash comparisons that require less persistent memory. Fuzzy filename matching falls somewhere in between.
- Use the 64-bit build on Windows to remove the 2 GB memory limit
- Scan in smaller batches (e.g., one folder at a time instead of your entire drive)
- Prefer “Contents” scan type over “Filename” fuzzy matching when exact duplicates are sufficient
- Close browsers and other memory-heavy applications before scanning
- On Linux, increase your swap space if RAM runs low during large scans
Pro tip: If you regularly scan collections over 500,000 files, consider using Czkawka instead. Written in Rust, it uses roughly 25% less RAM (122 MB vs. 164 MB in benchmark tests) and completes scans 2-3 times faster than dupeGuru’s Python engine.
Check the system requirements for recommended RAM based on your collection size.
How do I update dupeGuru to the latest version?
dupeGuru does not have a built-in auto-update mechanism. To update, you download the latest release manually from GitHub and replace your existing installation. The current stable version is 4.3.1, released on July 9, 2022.
Since dupeGuru is a portable application on Windows, updating is straightforward: download the new ZIP, extract it, and start using it. Your scan results from the old version will not carry over, but there is no user data or configuration that needs migrating. On Linux, if you installed via a package manager (apt, dnf, or AUR helper), you can check for updates through that package manager, though updates depend on the maintainer packaging the new version.
- Visit our download section or the GitHub Releases page
- Download the package for your operating system
- On Windows: extract the new ZIP to a fresh folder, then delete the old one
- On Linux: install the new .deb or .rpm over the existing version
- On macOS: rebuild from source or update via Homebrew with
brew upgrade dupeguru
Pro tip: Version 4.3.1 is a critical bugfix release. If you are running 4.2.0 through 4.3.0, update immediately. Those versions had a bug (GitHub issue #1015) where the content scan cache could produce false duplicate matches, meaning non-duplicate files were incorrectly flagged as duplicates.
See what features are available in the current release.
Is dupeGuru still maintained and actively developed?
dupeGuru is community-maintained but development has slowed significantly. The last stable release (4.3.1) came out in July 2022. Andrew Senetar took over as lead maintainer from the original developer Virgil Dupras (Hardcoded Software) and has merged occasional pull requests, but no new feature releases have shipped since then.
The GitHub repository still accepts issues and discussions. As of early 2026, there are several hundred open issues, and the project has not been archived or marked as deprecated. Community members still submit patches and translations. However, if you need active development with new features and regular updates, alternatives like Czkawka (last release February 2024, written in Rust) offer more momentum.
- Last release: version 4.3.1 on July 9, 2022
- GitHub status: open, not archived, accepting pull requests
- 32 total contributors over the project’s lifetime
- 7,400+ GitHub stars indicate continued community interest
Pro tip: Even without frequent updates, dupeGuru 4.3.1 is stable and works well for most duplicate-finding tasks. The core functionality, especially content-based and fuzzy filename matching, is mature and reliable. Missing features like a modern UI or built-in video comparison are the main gaps.
Explore what dupeGuru can do today on our features page.
dupeGuru vs Czkawka – which duplicate finder is better?
It depends on what matters most to you. Czkawka is faster and more actively developed. dupeGuru has a more mature fuzzy matching engine and a larger user community built over 15+ years.
In benchmark tests, Czkawka scanned 4.1 GB of files in 8 seconds compared to dupeGuru’s 22 seconds. Czkawka’s RAM usage peaked at 122 MB versus dupeGuru’s 164 MB for the same dataset. Czkawka is written in Rust (compiled, memory-safe), while dupeGuru uses Python (interpreted, slower for CPU-bound tasks). Czkawka also finds empty folders, broken symlinks, temporary files, and similar videos, features dupeGuru lacks entirely.
- Speed: Czkawka is 2-3x faster. Rust vs. Python makes a measurable difference.
- RAM usage: Czkawka uses about 25% less memory on equivalent workloads.
- Features: Czkawka has more built-in tools (empty folders, broken symlinks, temp files). dupeGuru has a more customizable matching engine and the Reference Folder system.
- Development: Czkawka’s last release was February 2024. dupeGuru’s was July 2022.
- Platform support: Both run on Windows, macOS, and Linux. Czkawka has a modern GTK4 UI.
Pro tip: If you deal with massive photo libraries (100,000+ images), try both. dupeGuru’s Picture mode with its 15×15 grid color analysis catches visually similar images that Czkawka may miss, but Czkawka will finish the scan much faster.
See dupeGuru’s unique features on our features page.
What are the best free alternatives to dupeGuru?
The three strongest free alternatives are Czkawka, AllDup, and Duplicate Cleaner Free. Each targets a different type of user, so the best choice depends on your workflow and operating system.
Czkawka is the closest competitor: open source, cross-platform, and faster than dupeGuru thanks to its Rust codebase. AllDup is a Windows-only freeware tool with a simpler interface that many beginners prefer for straightforward duplicate-by-content scans. Duplicate Cleaner has a free version with a polished UI and folder comparison features, though its more powerful options are locked behind a paid Pro license (around $30).
- Czkawka: Open source, cross-platform, fast Rust engine, finds empty folders and broken symlinks too. Best for power users on any OS.
- AllDup: Freeware, Windows-only, content analysis, batch processing, clean interface. Best for Windows users who want something simple.
- Duplicate Cleaner Free: Free version covers basic scanning, polished Windows UI, folder comparison. Paid Pro version adds more filters. Best for users who want a visual, guided experience.
- fdupes (Linux): Command-line tool for Linux power users. No GUI, but extremely fast for scripted workflows.
Pro tip: dupeGuru’s Reference Folder system is something most alternatives lack. If your workflow depends on protecting a “master” folder while deleting duplicates only from other locations, that feature alone may keep you on dupeGuru.
Compare these tools against dupeGuru’s full capability list on our features page.
How does the Reference Folder system work in dupeGuru?
The Reference Folder system is dupeGuru’s built-in safety net for protecting important files. When you set a folder’s state to “Reference” in the Folder Selection panel, dupeGuru will never mark files from that folder for deletion. Only files from “Normal” folders can be selected for removal.
This is particularly useful when you have a curated photo library or a backup drive that you consider the “master” copy, and you want to find and delete only the duplicates scattered across other locations. In the results table, reference files appear with their mark checkbox disabled, so you physically cannot accidentally delete them. If all duplicates of a file exist only in reference folders, dupeGuru shows the group but marks nothing.
- Add your folders to dupeGuru using the “+” button
- Click the folder state dropdown next to each folder path
- Set your master/important folders to “Reference”
- Set folders where duplicates should be deleted to “Normal”
- Run the scan. Only files in “Normal” folders will be selectable for deletion.
Pro tip: You can also promote individual files to reference status after a scan. Select a duplicate in the results, go to Actions, and click “Make Selected into Reference.” This is helpful when you realize mid-review that a specific copy is the one you want to keep.
Walk through a complete example in our Getting Started guide.
How do I scan for duplicate photos using Picture mode?
dupeGuru’s Picture mode finds visually similar images even when file sizes, resolutions, or formats differ. It works by creating a 15×15 pixel color grid fingerprint for each image and comparing these fingerprints with a configurable similarity threshold.
Unlike a simple hash comparison, Picture mode catches cases like: the same photo saved as both JPEG and PNG, photos cropped differently, images resized to different dimensions, and even edited versions with minor color adjustments. The tradeoff is speed. Every image must be compared against every other image, so scanning 10,000 photos takes significantly longer than a standard content scan of 10,000 documents.
- Select “Picture” from the Application Mode dropdown at the top
- Add the folders containing your images
- Set the Filter Hardness slider: 90%+ for near-identical images, 70-85% for visually similar ones
- Choose scan type: “Contents” for visual comparison, “Filename” for name-based matching
- Click “Scan” and wait. CPU usage will spike during the comparison phase.
- Review results. Match percentages (e.g., 96%) indicate visual similarity.
Pro tip: Start with Filter Hardness at 85% and scan a small test folder of 100-200 images first. Review the matches to calibrate what percentage threshold catches real duplicates without flagging unrelated photos. Then apply that setting to your full library.
See all scan modes explained on our features page.
Can I export dupeGuru scan results or recover accidentally deleted files?
Yes, dupeGuru can export scan results to both HTML and CSV formats. After a scan completes, go to File and select “Export Results to HTML” or “Export Results to CSV.” The exported file includes filenames, folder paths, file sizes, and match percentages for all duplicate groups found.
If you accidentally deleted files through dupeGuru, recovery depends on how you deleted them. If you used the default “Send to Trash” option, your files are in the Windows Recycle Bin (or Linux/macOS trash) and can be restored from there. If you enabled “Directly delete files” in Preferences, the deletion is permanent and you would need a file recovery tool like Recuva or TestDisk to attempt recovery from the raw disk, with no guarantee of success.
- Export to CSV for spreadsheet analysis in Excel or Google Sheets
- Export to HTML for a formatted report you can share with colleagues
- Deleted via Trash: open Recycle Bin, find the files, right-click, and select “Restore”
- Deleted permanently: try Recuva (free) immediately. Do not write new data to the drive, as that reduces recovery chances.
Pro tip: Before any bulk deletion, export results to CSV first. This gives you a complete record of what was found and where. If something goes wrong, you have a map of every file path to help locate backups or attempt recovery.
Learn how to set up safe deletion workflows in our Getting Started guide.
Still have questions? Check the dupeGuru GitHub Discussions or download dupeGuru and try it yourself.