Documentation Index
Fetch the complete documentation index at: https://docs.asgcompute.com/llms.txt
Use this file to discover all available pages before exploring further.
ASG Inference
Access state-of-the-art AI models with per-token pricing and instant access.Overview
ASG Inference provides:- 100+ models — GPT-5.2, Claude Sonnet 4, Gemini 2.5 Pro, DeepSeek R1 and more
- OpenAI-compatible — Drop-in replacement
- Per-token billing — Pay exactly for usage
- Automatic fallback — Reliability across providers
Quick Example
Available Models
Pass the full model identifier in themodel parameter:
| Model | Best For | Cost |
|---|---|---|
openai/gpt-4o-mini | Quick responses, chat | $ |
openai/gpt-4.1 | General purpose | $$ |
openai/gpt-5.2 | Complex reasoning | $$$ |
anthropic/claude-sonnet-4 | Coding, analysis | $$$ |
google/gemini-2.5-pro | Multimodal, long context | $$ |
deepseek/deepseek-r1 | Math, reasoning | $$ |
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | Model identifier (see table above) |
messages | array | Yes | Conversation messages |
max_tokens | number | No | Max output tokens (default: 1024) |
temperature | number | No | Randomness (0-2, default: 1) |
stream | boolean | No | Enable streaming (default: false) |
Response
Streaming
For real-time responses, setstream: true in arguments. Streaming responses use Server-Sent Events.