Skip to content

Instant Clusters

You can now deploy high-performance GPU training clusters with Infiniband interconnect from your Verda Cloud Console, the same way you would deploy a single GPU instance.

The only available contract length is: Pay As You Go.

Instant clusters are available with Nvidia H200 SXM5, Nvidia B200 SXM6, or Nvidia B300 GPUs. Each worker node has eight InfiniBand links — 400 Gb/s each on H200 and B200 (3.2 Tb/s per node), or 800 Gb/s each on B300 (6.4 Tb/s per node) — plus a 100 Gbit/s Ethernet network. The uplink to the Internet is symmetric 2 Gb/s.

Our instant clusters range from 16 to 128 GPUs. Each cluster has up to 16 worker nodes, with 8 GPUs per worker node, one jump host and a service node. Each worker node has local NVMe storage and access to a configurable shared filesystem with up to 50TB of storage.

For larger cluster setups contact our support.

Clusters have Slurm or Kubernetes pre-installed for easy job management and Grafana dashboard for monitoring and alerts. The instant clusters are currently available in FIN-03 location.

View more:

Deploying an Instant cluster

Slurm

Environments

Containers

Monitoring

Good to know

Local Users

Release Notes