Instance lifecycle
| Status | Description |
|---|---|
| awaiting_allocation | No allocation available on the capacity right now |
| running | VM is up and accessible |
| terminated | Allocation ended or instance was manually terminated |
| failed | Something went wrong |
awaiting_allocation starts running when the capacity’s allocation schedule covers
the current time. For example, if you buy compute starting at now, the instance starts within
about a minute. If the order starts in the future, the instance waits until then. The image
download and boot process takes up to 10 minutes before SSH is available.
Create an instance
Images
List available images to find one to use with your instance.Cloud-init
Pass a startup script or cloud-config YAML via--cloud-init to configure the VM on first boot.
Shell script — inject your SSH public key:
startup.yaml
Check instance status
--created-after and --created-before accept a datetime, Unix timestamp, or a duration like 24h or 7d (meaning 24 hours or 7 days ago, respectively). You can also use --since and --until as aliases.
Get details on a specific instance.
SSH into an instance
The instance must be in
running status and the image must be fully booted. This can take up to 10
minutes after the instance starts.View logs
--after and --before accept flexible time expressions: "now", "in -1h", a date like mar 1,
ISO 8601, or a Unix timestamp.
Terminate an instance
Stop the VM immediately. The instance moves toterminated status.
Replace an instance
Replace an instance with a new one on the same capacity. The original instance is terminated and a new instance is created with the specified image.Cloud-init is not carried over from the original instance. If you omit
--cloud-init, the
replacement boots with no startup script and you won’t be able to SSH into it.Delete an instance
Permanently remove an instance.Replace an instance
Replace an instance with a new one on the same capacity and SKU. The original instance is terminated and a fresh instance is created with a new image (and optionally a new startup script and name).If you omit
--cloud-init, the replacement starts with no startup script — you will not be able to SSH into it. Pass a cloud-init file to configure SSH access.Instance priority
When a capacity’s quota drops below the number of running instances, the system terminates instances to match. Instance priority controls which ones survive. Higher-priority instances are kept longer; lower-priority ones are terminated first. Priority has four arms, orderedyield < normal < preferred < critical. Every instance defaults to
normal. The predicted termination time is returned in expected_shutdown_at on the instance
response and already reflects priority.
| Arm | Meaning |
|---|---|
yield | Surrender this instance first when capacity shrinks. |
normal | Default. No special preference. |
preferred | Keep this instance over normal ones when possible. |
critical | Keep this instance over everything else. |
priority_level in the body of
POST /preview/v2/instances to set it at create time.
priority_level to default to normal.
Update one instance with PATCH /preview/v2/instances/{id}.
priority_level to leave it unchanged. Pass "normal" to reset to the default.
Update many instances atomically by PATCHing the collection.
422 and no instance is touched. Up to 10,000 entries per call. An empty data
returns 200 with no writes. When the same id appears more than once, the last occurrence wins.
The response includes the updated instance for each entry, so you can confirm the result in one
round-trip.
When two instances have the same priority, the newer one is terminated first.
critical and procurement automations
critical cannot be set on an instance whose capacity has an active procurement automation —
procurement-backed capacities resize automatically, so critical can’t be honored there. The
constraint applies in both directions:
- Setting
priority_level: criticalon such an instance returns422. - Creating a procurement on a capacity that already has
criticalinstances returns422— downgrade the affected instances (for example, topreferred) first.
preferred instead when the capacity is procurement-backed, or disable the procurement before
setting critical.
Setting priority requires the Node: Write permission. Read-only tokens see priority_level on GET
responses but receive 403 on writes.
Limitations
- No persistent storage. If the underlying machine dies, you get a replacement but not the same disk.
- No public IPs. To serve inference, set up a proxy in another cloud and connect it via VPN.
- No InfiniBand.
- Instances don’t share a VPC or VLAN. Configure a VPN if you need connectivity between instances.
- Boot time can be up to 10 minutes for image download and startup.
- No GPU monitoring. GPUs may occasionally fall off the bus — terminate and recreate the instance, or contact us.