The GridConsume InferenceTrade InferenceDashboard
Online
AboutDocsContact Us
All Rights Reserved 2026 © The Grid
Available Instruments
Each instrument is served by multiple qualifying models that meet its specifications. Your /chat/completions API requests are routed the lowest cost provider and model that satisfies those requirements. All instruments have zero data retention and support:
TEXT TO TEXT
STREAMING
FUNCTION CALLING
JSON MODE
Help Me Choose
View Instrument Specs
InstrumentQualityLatencyThroughputPrice30D Average
text-max
≥ 53
≤ 3.5
≥ 30
--
text-prime
≥ 38
≤ 4.62
≥ 40
--
text-standard
≥ 18
≤ 1.32
≥ 100
--
Quality benchmark sourced from Artificial Analysis.Token prices on The Grid are variable.
Getting Started
Everything you need to start using The Grid in your inference workflows. Create an API key and make your first request.
Make your first API call
Best Practices
Streaming Responses
cURL
Python
Javascript
Simply make your first chat completions request
curl -L https://api.thegrid.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer <YOUR_API_KEY>" \
  -d '{
    "model": "text-max",
    "max_tokens": 50,
    "messages": [
      {"role": "system", "content": "Be extremely brief. One sentence max."},
      {"role": "user", "content": "Why does AI inference need a market?"}
    ]
  }'