What This Workflow Does
This automation solves the massive time drain of video content creation. Instead of spending hours scripting, recording, editing, and rendering, you provide a simple topic idea and receive a complete 60-second faceless video ready for publishing.
The system intelligently coordinates multiple AI tools: Gemini writes the script, ElevenLabs generates the voiceover, Leonardo creates matching visuals, and Shotstack assembles everything with perfect timing. What traditionally required a team of specialists now happens automatically in minutes.
For content creators, marketers, educators, and businesses, this means scaling video production without proportional increases in time, cost, or team size. You maintain consistent quality while dramatically increasing output capacity.
How It Works
1. Input & Script Generation
You enter a topic or idea into the workflow. Google Gemini analyzes this input and generates a concise 60-second script optimized for faceless video format, including natural pacing and scene transitions.
2. Voiceover Creation
The script passes to ElevenLabs, which converts text to high-quality, emotionally nuanced speech. The audio file is uploaded to Google Drive and made accessible for the next stages while simultaneously being transcribed for timing accuracy.
3. Visual Generation & Timing
OpenAI Whisper transcribes the voiceover, then Gemini creates timestamped image prompts matching the script content. Leonardo AI generates corresponding visuals for each scene based on these precise descriptions.
4. Assembly & Final Output
Leonardo stitches images into scene videos, then Shotstack assembles everything with proper timing, transitions, and effects. The final polished video downloads automatically to your storage, ready for publishing.
Who This Is For
Content creators needing daily YouTube Shorts, TikTok, or Instagram Reels without appearing on camera. Marketing agencies scaling client content production without hiring additional editors. Educators and trainers creating consistent instructional materials. Solopreneurs building personal brands through video without video editing skills. Business teams producing internal communications, product updates, or customer onboarding content.
Pro tip: Start with 2-3 videos weekly to establish consistency, then scale to daily production as you refine your prompts and workflow settings. The system improves with usage.
What You'll Need
- Active accounts with API access for Google Gemini, ElevenLabs, OpenAI Whisper, Leonardo AI, and Shotstack
- Google Cloud Console setup with Drive API enabled
- n8n instance (cloud or self-hosted) with internet connectivity
- Basic understanding of how to import JSON workflows into n8n
- Storage destination for final video files (local or cloud)
Quick Setup Guide
- Download the template file using the button above
- Import the JSON into your n8n instance via the workflow import function
- Configure credentials for each service in their respective nodes
- Test with a simple topic in the "Set Idea" node
- Execute the workflow and monitor progress through each phase
- Download your first generated video from the final node
- Adjust prompts and timing parameters based on initial results
Key Benefits
95% time reduction: Transform 4-hour editing sessions into 15-minute automated processes, freeing up creative energy for strategy rather than production.
Consistent quality at scale: Maintain professional standards across dozens or hundreds of videos without quality degradation from editor fatigue or tight deadlines.
Cost-effective content strategy: Replace $5,000+/month editing costs with predictable API expenses, typically under $300/month for substantial volume.
Rapid experimentation and iteration: Test different topics, styles, and formats quickly without sunk time costs, allowing data-driven content strategy.
Future-proof production: As AI models improve, your automated videos automatically benefit from enhanced capabilities without workflow changes.