Bring your own agent — Claude Code, OpenClaw Hermes, Cline, Aider, Continue, or anything that speaks OpenAI. Point it at one base URL, drop in your key, and code against open models on shared GPUs.
OpenAlchemy APIOne OpenAI- and Anthropic-compatible inference layerapi.openalchemy.io
Open modelsLlama · Qwen · DeepSeek · Whisper · Flux
your agent → api.openalchemy.io → open-weight models on the Grid
THREE STEPS
From zero to your first request
1
Create a project & key
Sign in to the console, create a project, and mint an API key. The free tier needs no credit card.
2
Point your agent at us
Set the base URL and API key in your agent's settings — one OpenAI-compatible URL, or the Anthropic-compatible one for Claude Code.
3
Pick a model & build
Choose a model id such as llama-3.1-8b-instruct and start sending requests. Switch models any time without changing code.
TWO WAYS TO CONNECT
One key, two compatible surfaces
Most agents speak the OpenAI API. Claude Code speaks the Anthropic Messages API. OpenAlchemy exposes both, so whichever your tool uses, the same key works.
OpenAI-compatible
OpenAI-compatible
Drop this base_url and key into the OpenAI SDK or any OpenAI-compatible agent — Hermes, Cline, Aider, Continue, Codex CLI.
Base URL
https://api.openalchemy.io/v1
API key (env)
OPENALCHEMY_API_KEY
Anthropic-compatible
Anthropic-compatible
Set this as ANTHROPIC_BASE_URL so Claude Code routes its Messages API calls to OpenAlchemy with no proxy in between.
Base URL
https://api.openalchemy.io
API key (env)
OPENALCHEMY_API_KEY
AGENT SETUP
Copy-paste config for every agent
Each snippet uses the OPENALCHEMY_API_KEY you created and the example model llama-3.1-8b-instruct. Swap in any model from the catalog.
Claude Code
Anthropic-compatible
Anthropic's terminal-native coding agent. Route it through the Anthropic-compatible endpoint.
bash
# Claude Code speaks the Anthropic Messages API — point it at
# OpenAlchemy's Anthropic-compatible endpoint and run as usual.
export ANTHROPIC_BASE_URL="https://api.openalchemy.io"
export ANTHROPIC_AUTH_TOKEN="$OPENALCHEMY_API_KEY"
export ANTHROPIC_MODEL="llama-3.1-8b-instruct"
claude
Point Hermes' OpenAI-compatible provider at OpenAlchemy and choose a model.
bash
# Hermes uses an OpenAI-compatible provider. Export the base URL and
# key, then select the model inside Hermes.
export OPENAI_BASE_URL="https://api.openalchemy.io/v1"
export OPENAI_API_KEY="$OPENALCHEMY_API_KEY"
# Provider: OpenAI-compatible
# Base URL: https://api.openalchemy.io/v1
# Model: llama-3.1-8b-instruct
hermes
The VS Code autonomous coder (and the Roo Code fork). Use the 'OpenAI Compatible' provider.
ini
# VS Code → Cline (or Roo Code) settings → API Provider:
API Provider = OpenAI Compatible
Base URL = https://api.openalchemy.io/v1
API Key = $OPENALCHEMY_API_KEY
Model ID = llama-3.1-8b-instruct
Yes. The /v1 surface mirrors the OpenAI REST API — chat completions, embeddings, audio, and more — so existing OpenAI SDKs and tools work by changing only the base_url and key.
Can I use it with Claude Code?
Yes. OpenAlchemy also exposes an Anthropic-compatible endpoint. Set ANTHROPIC_BASE_URL to https://api.openalchemy.io and ANTHROPIC_AUTH_TOKEN to your key — no proxy needed.
Which models can I use?
Open-weight models across modalities — Llama, Qwen, DeepSeek, Gemma for chat, plus Whisper, embeddings, rerankers and more. List them at GET /v1/models and pass the id in your request.
Do I need a credit card?
No. The free tier lets you create one project and run your first 1,000 requests without a card.
How is billing and access scoped?
Usage, quotas, and request logs are scoped per project; rate limits and access control apply per domain. Your key carries that scope automatically.
Does streaming work?
Streaming responses (SSE) are rolling out — some endpoints currently return 501 for stream:true. Non-streaming requests are fully supported today.
START BUILDING
Wire up your agent in minutes
Create a free project, grab a key, and point your coding agent at open models running on the Grid.