← All modalities
Transcripts · Captions · Notes
STT (Speech → Text)
Soon
Convert speech to text — for transcripts, captions, call analytics, and voice input.
Overview
STT (Speech → Text)
Use it for meeting transcripts, video captions, call-center analytics, voice memo to text, and voice input. Multilingual, with word- and segment-level timestamps. Supports file upload and streaming.
- Endpoint
- /v1/audio/transcriptions
- Example model
- stt-large-v3
API
API example
curl
curl https://api.openalchemy.io/v1/audio/transcriptions \
-H "Authorization: Bearer $OPENALCHEMY_API_KEY" \
-H "X-Project-Id: $YOUR_PROJECT_ID" \
-F "model=stt-large-v3" \
-F "file=@meeting.m4a" \
-F "response_format=json"Status
Quota and pricing (per project)
Allotments and rate limits apply to the project that issued the API key. A second domain-scoped policy layer constrains which origins may invoke each modality.
START TODAY
Ready to turn inference cost into something closer to alchemy?
The free tier lets you spin up one project and run your first 1,000 requests with no credit card.