What This Workflow Does
Duplicate files silently consume valuable Google Drive storage, create confusion during collaboration, and waste hours of manual cleanup time. This automation solves that problem systematically.
The workflow continuously monitors your specified Google Drive folders, identifies duplicate files using content verification (MD5 checksums), and takes action based on your rules—either moving duplicates to trash or flagging them with "DUPLICATE-" prefixes. It handles the tedious work so your team can focus on what matters.
Beyond simple cleanup, this system establishes ongoing file hygiene, preventing storage bloat before it becomes a costly problem. It's particularly valuable for teams sharing drives, agencies managing client assets, or any business where multiple people upload similar files.
How It Works
The automation follows a logical four-step process that mimics how a meticulous human would clean files, but with perfect consistency and at scale.
1. Trigger & Monitoring
A Google Drive trigger node watches your designated folder (or entire drive) for new file activity. You configure the polling interval—typically every 15-60 minutes—balancing responsiveness with API usage considerations.
2. Configuration & Rules
A configuration node lets you set key decisions: keep the "first" or "last" uploaded version of duplicate files, and choose between "trash" (deletion) or "flag" (renaming) actions. These rules ensure the automation aligns with your business processes.
3. Duplicate Detection
The workflow retrieves file metadata and calculates MD5 checksums—a unique fingerprint of file content. Files with identical checksums are confirmed as exact duplicates, regardless of filename differences. Google Docs/Sheets are intelligently excluded from this comparison.
4. Action & Cleanup
Based on your configuration, duplicates are either moved to Google Drive trash (with recovery possible) or renamed with "DUPLICATE-" prefixes for manual review. The original file you chose to keep remains untouched and accessible.
Pro tip: Start with "flag" mode for 30 days to build confidence in the detection accuracy before switching to automatic deletion. Create a separate "Archive" folder instead of trash if you need longer recovery windows.