Local transcription
Audio is transcribed locally with Whisper-based models.
Free alpha · Windows · Local-first · Azure optional
Skald is a local-first speech-to-text and transcription app for Windows. Audio can be processed on the device by default. When cloud processing is needed, Skald can use Azure AI Foundry through your organization’s own endpoint.
Data boundary
Skald separates local transcription, cloud transcription, and optional text polishing into clear processing modes. Cloud processing is only used when configured through your organization’s Azure AI Foundry endpoint.
Audio is transcribed locally with Whisper-based models.
Audio is sent to your Azure AI Foundry transcription endpoint.
Audio stays local. Only transcript text is optionally sent for conservative cleanup.
Cloud transcription and polishing both run through your Azure endpoint.
Product
The interface is practical: record, review, save, and continue working.


Current status
Skald is available for evaluation and pilot feedback while distribution, installer experience, and enterprise operations are hardened.