Cloud GPUs

NVIDIA RTX 4090 cloud GPUs. From €- per hour. In France, UAE, and USA.

A100 performance for the price of a 4090. Per-second billing. No queues. No egress fees.

A100 performance for the price of a 4090

Single-GPU benchmark on Hivenet infrastructure:

TTFT: 349.9 ms at 1 req/s (single-request baseline).

Peak throughput: 737 tokens/s while delivering 737 tokens/s under sustained load.¹

¹ Benchmark methodology and test conditions here.

RTX 4090 specs

Spec

Value

Why it matters

Architecture

Ada Lovelace

4nm process — efficient under sustained heavy load

Memory

24 GB GDDR6X

Fits Llama-3 70B (4-bit quantization) on a single card

Bandwidth

1,008 GB/s

Prevents tensor stalls on large batch inference

FP16 throughput

165 TFLOPS

Headroom for diffusion models at 1024×1024

TDP

450 W

Lower than an A100 40 GB at equivalent inference throughput

Swipe left to see more

Launch a 4090 now →

Popular use cases

Fine‑tune large language models

Start a QLoRA run in under 60 seconds. Pause and resume anytime — no charge for idle time.

Train diffusion and video models

24 GB VRAM supports 14 GB KV cache at full precision. No quantization required for most diffusion models at 1024×1024.

Run private chatbots

Inference stays in your account. No third-party API logs.

Upscale long‑form video

1,008 GB/s memory bandwidth handles 4K frames without I/O stalls.

Ready to launch?

RTX 4090

- - - /h

1 × - 8 ×

vCPU - - - GB

RAM - - - GB

Disk space - - - GB

Bandwidth - Mb/s

Per-second billing. No egress fees. Storage included.

Questions?

Reach us at support@hivenet.com or through the in-app chat.