Transcripts · Captions · Notes

STT (Speech → Text)

Soon

Convert speech to text — for transcripts, captions, call analytics, and voice input.

Overview

STT (Speech → Text)

Use it for meeting transcripts, video captions, call-center analytics, voice memo to text, and voice input. Multilingual, with word- and segment-level timestamps. Supports file upload and streaming.

Endpoint: /v1/audio/transcriptions
Example model: stt-large-v3

API

API example

curl

curl https://api.openalchemy.io/v1/audio/transcriptions \
  -H "Authorization: Bearer $OPENALCHEMY_API_KEY" \
  -H "X-Project-Id: $YOUR_PROJECT_ID" \
  -F "model=stt-large-v3" \
  -F "file=@meeting.m4a" \
  -F "response_format=json"

Status

Quota and pricing (per project)

Allotments and rate limits apply to the project that issued the API key. A second domain-scoped policy layer constrains which origins may invoke each modality.

Read the docs for this modality →

START TODAY

Ready to turn inference cost into something closer to alchemy?

The free tier lets you spin up one project and run your first 1,000 requests with no credit card.

Get started free See pricing