Instant Clusters¶
You can now deploy high-performance GPU training clusters with Infiniband interconnect from your Verda Cloud Console, the same way you would deploy a single GPU instance.
The only available contract length is: Pay As You Go.
Instant clusters are available with Nvidia H200 SXM5, Nvidia B200 SXM6, or Nvidia B300 GPUs. Each worker node has eight InfiniBand links — 400 Gb/s each on H200 and B200 (3.2 Tb/s per node), or 800 Gb/s each on B300 (6.4 Tb/s per node) — plus a 100 Gbit/s Ethernet network. The uplink to the Internet is symmetric 2 Gb/s.
Our instant clusters range from 16 to 128 GPUs. Each cluster has up to 16 worker nodes, with 8 GPUs per worker node, one jump host and a service node. Each worker node has local NVMe storage and access to a configurable shared filesystem with up to 50TB of storage.
For larger cluster setups contact our support.
Clusters have Slurm or Kubernetes pre-installed for easy job management and Grafana dashboard for monitoring and alerts. The instant clusters are currently available in FIN-03 location.
View more: