What This Workflow Does
This automation solves a common problem for distributed, multilingual teams: voice messages sent in one language on Telegram need to be accessible to everyone in Slack. Manually transcribing and translating these messages is time-consuming and creates communication delays.
The workflow automatically detects when a voice message arrives in a Telegram group or channel, downloads the audio file, transcribes it to text using OpenAI's Whisper model, detects the source language, translates it to your target language using GPT-4o-mini, and posts the cleaned-up translation with attribution into your designated Slack channel. It preserves the speaker's identity and adds appropriate language flags for clarity.
This creates a seamless bridge between voice-based communication in Telegram and text-based collaboration in Slack, breaking down language barriers without changing how team members naturally communicate.
How It Works
The automation follows a logical sequence to transform voice messages into translated text:
1. Telegram Voice Message Trigger
The workflow monitors a specified Telegram channel or group for new voice messages. When a message arrives, it captures the audio file, sender information, and timestamp.
2. Audio Download & Preparation
The audio file is downloaded from Telegram's servers and prepared for processing. The workflow handles various audio formats and ensures the file is within Whisper's size limits (under 25MB).
3. Speech-to-Text Transcription
Using OpenAI's Whisper API, the audio is converted to text. Whisper excels at handling accents, background noise, and conversational speech across numerous languages.
4. Language Detection & Translation
The workflow detects the source language, then uses GPT-4o-mini to translate the text to your target language (default: Japanese β English). GPT provides context-aware translations that preserve meaning better than simple word-for-word translation.
5. Slack Message Formatting & Delivery
The translated text is formatted with the original sender's name, language flags (π―π΅βπΊπΈ), and timestamp, then posted to your specified Slack channel. Team members can read the translation alongside any follow-up discussion.
Who This Is For
This workflow is ideal for distributed teams, international companies, bilingual project managers, and global communities that use both Telegram and Slack. It's particularly valuable for:
- Multilingual project teams with members in different countries
- Customer support teams handling voice messages from international clients
- Content creators and agencies collaborating across language barriers
- Educational institutions with international student communities
- Development teams using Telegram for quick updates but Slack for documentation
What You'll Need
- Telegram Bot Token: Create a bot via @BotFather and add it to your Telegram group/channel
- OpenAI API Key: For accessing Whisper and GPT-4o-mini models
- Slack Bot Token: Create a Slack app with chat:write, files:write, and channels:history permissions
- n8n instance: Self-hosted or n8n.cloud account
- Target languages configured: Define which languages to translate between
Pro tip: For sensitive conversations, consider using OpenAI's enterprise API with data processing agreements, or explore self-hosted Whisper alternatives for complete data privacy.
Quick Setup Guide
Get this automation running in under 15 minutes:
- Import the template: Download the JSON file above and import it into your n8n instance
- Configure credentials: Add your Telegram Bot Token, OpenAI API Key, and Slack Bot Token to n8n's credentials manager
- Set channel IDs: Update the Telegram chat ID and Slack channel ID in the workflow nodes
- Adjust language settings: Modify the "Detect Language" and "Translate" nodes if you need different language pairs
- Test with a voice message: Send a test voice message in your Telegram group to verify the full flow works
- Activate the workflow: Toggle the workflow to "Active" and monitor the executions tab for any errors
Key Benefits
Eliminates manual transcription work that typically takes 5-10 minutes per voice message. The automation handles everything in seconds, freeing team members for higher-value work.
Reduces language barriers in real-time by providing instant translations. Team members can communicate naturally in their preferred language without worrying about excluding others.
Improves meeting and update accessibility by creating searchable, translated text records of voice communications that can be referenced later.
Maintains communication context with proper attribution, timestamps, and language indicators, ensuring everyone understands who said what and when.
Scalable across multiple teams and languages β once configured, the workflow can be duplicated for different language pairs or channel combinations without additional development.