Skip to main content

ASG GPU

🔜 Coming Soon — Dedicated GPU provisioning is on the roadmap. The tools described below are planned but not yet active. See Tool Availability for current status.
Rent dedicated GPU instances on-demand with per-second billing via MCP tool calls.

Planned Capabilities

  • H100 80 GB, A100 80 GB, RTX 4090 24 GB — Premium GPUs for every workload
  • Per-second billing — No hourly minimums, pay only for what you use
  • Instant provisioning — Target <30 seconds from tool call to SSH
  • SSH access — Full root control of your pod
  • Multi-GPU — Scale to 8× GPU nodes for large training runs

Planned Tools

ToolDescriptionStatus
gpu_provisionProvision a GPU pod🔜 Coming Soon
gpu_heartbeatExtend a lease🔜 Coming Soon
gpu_terminateTerminate a pod🔜 Coming Soon

Available Today

While GPU provisioning is coming soon, you can use these tools right now:
  • inference_chat — AI completions with 100+ models
  • sandbox_execute — Isolated code execution
  • optify_vram_estimate — Estimate VRAM requirements for models

Pricing

GPU pricing will be announced when the service launches. See Pricing.
Interested in early access? Contact us at hello@asgcompute.com to join the GPU waitlist.