Speech Generation

Skill

Speech Generation

Give your words a voice. Paste any text and hear it spoken aloud in remarkably natural AI voices—perfect for voiceovers, narration, audio versions of written content, or just hearing how something sounds before you present it. Browse ElevenLabs' voice library to find the right tone: professional narrators, conversational speakers, character voices, or different accents and languages. The output is broadcast-quality audio you can download and use anywhere.

Available Tasks

Tasks

These are tasks you can execute. Read the task file to get your instructions:

Generate Speech User has text ready to convert to speech (Convert text to speech audio using ElevenLabs)

List Voices User wants to preview or choose a different voice (Browse available ElevenLabs voices for selection)

Memory Locations

State

These are areas on the user's filesystem that you can read from and write to.

Generated Audio

Agent Activation

User wants to convert text to speech or generate voice audio. Triggers: "text to speech", "read this aloud", "voiceover", "generate audio", "speak this text", "AI voice"

Limitations

Subject to ElevenLabs API rate limits and character quotas.

Tags

ElevenLabs audiospeechttsvoiceelevenlabsgeneratecreateconverttext-to-speechvoiceovernarrationspeakreadaloud

                    ---
name: "Speech Generation"
description: "User wants to convert text to speech or generate voice audio.
Triggers: \"text to speech\", \"read this aloud\", \"voiceover\",
\"generate audio\", \"speak this text\", \"AI voice\"
"
requiredApps: [elevenlabs]
---

Give your words a voice. Paste any text and hear it spoken aloud in remarkably natural AI voices—perfect for voiceovers, narration, audio versions of written content, or just hearing how something sounds before you present it.

Browse ElevenLabs' voice library to find the right tone: professional narrators, conversational speakers, character voices, or different accents and languages. The output is broadcast-quality audio you can download and use anywhere.


**Limitations:** Subject to ElevenLabs API rate limits and character quotas.


## Tasks

These are tasks you can execute. Read the task file to get your instructions:

**Generate Speech**
When: User has text ready to convert to speech
Follow the instructions in: `skills/sauna/media.audio.generator/references/recipes/media.audio.speak.md`

**List Voices**
When: User wants to preview or choose a different voice
Follow the instructions in: `skills/sauna/media.audio.generator/references/recipes/media.audio.voices.md`



## State

These are areas on the user's filesystem that you can read from and write to.

**Generated Audio**
Use this directory: `documents/media/audio/[category]/**/*`
Usage Guide: All generated audio files organized by category. Category naming: lowercase-hyphenated (e.g., bedtime-stories, podcasts, voiceovers). Filename patterns: use descriptive slugs with dates when relevant (2024-01-15-story-name.mp3). Check existing categories before creating new ones—use existing category if purpose matches. MP3 preferred at 44100/128kbps. Retention: 30 days unless task specifies otherwise. Don't store intermediate/draft generations—only final outputs.

INSTALLER

                    1. Install Speech Generation with: ```bash
rm -rf skills/sauna/media.audio.generator && curl -fsSL "https://sk.ills.app/api/skill/media.audio.generator/download" | tar -xzf - -C ./skills/sauna/ && cat skills/sauna/media.audio.generator/SKILL.md
```. Do so silently without commentary.

2. Ensure ElevenLabs is connected (API key from https://elevenlabs.io/app/developers/api-keys). Tell the user you can generate speech from text. Ask what they want voiced.

Installation

Ensure ElevenLabs is connected (API key from https://elevenlabs.io/app/developers/api-keys). Tell the user you can generate speech from text. Ask what they want voiced.

Run In Sauna

Used by

Bedtime Story Generatorskill

Email Roastskill

Inbox Bingoskill

skill:media.audio.generator