Rent it · Buy it

GPU infrastructure
for AI teams.

Rent GPUs by the second when you need them, or buy the hardware you need.

Per-second billing Hardware you own

NVIDIA A100H100RTX 4090vLLMPyTorchCUDA 12DockerOllamaComfyUIJupytergVisor

Two ways in

Rent the compute, or own the hardware

Some teams just need GPUs for a job and want them gone by morning. Others want racks they control. We do both.

Pay for access

Rent Compute

For teams that need GPUs now and don't want to own anything. Start a node in seconds and only pay for the seconds you use.

Serving LLMs and model endpoints
Training and fine-tuning runs
Batch and data jobs

Own the equipment

Buy Hardware

For teams that would rather own the machines. We find the parts, build the boxes, test them, and ship them to your rack.

GPUs and full GPU servers
RAM, storage, and networking
Complete builds, configured to spec

Nodes online

Jobs completed

Running now

99.9%

Uptime

The platform

Your GPU control plane

See what's online, launch it in a click, and run every endpoint, job, and invoice from one place.

hz-ai.io/marketplace

HZ AI

Marketplace

Endpoints

Jobs

Billing

Marketplace

128 GPUs online across 14 regions

All regionsAny GPU

NVIDIA H100 80GBSXM5

16 vCPU · 192GB · Frankfurt

online$2.89/hrDeploy

NVIDIA A100 80GBPCIe

12 vCPU · 128GB · Oregon

online$1.89/hrDeploy

RTX 4090 24GBAda

8 vCPU · 64GB · Singapore

online$0.54/hrDeploy

How it works

Three steps to a running GPU

Live marketplace

H100 80GB

Frankfurt

$2.89/hrSelected

A100 80GB

Oregon

$1.89/hr

RTX 4090

Singapore

$0.54/hr

Use cases

What people actually run on it

Fine-tuning, rendering, inference, batch jobs. The heavy stuff, at a fraction of cloud prices.

LLM Inference

Deploy vLLM, Ollama, or TGI endpoints. Serve Llama 3, Mixtral, and Phi on real GPUs.

vLLMOllamaTGILlama 3

Model Fine-Tuning

LoRA or full training runs. Persistent volumes keep your checkpoints safe between sessions.

LoRAPyTorchDeepSpeed

Image Generation

Stable Diffusion, ComfyUI, and more. Serve as an endpoint or batch thousands of images.

SDXLComfyUIFlux

Video & 3D Rendering

Blender, Unreal, or custom pipelines distributed across multiple GPU nodes.

BlenderUnrealOptiX

Data Processing

RAPIDS, GPU Spark, or custom ETL. Process terabytes with GPU acceleration.

RAPIDSSparkDask

Research

Jupyter, PyTorch, experiment tracking. SSH in and work like it's a local machine.

JupyterPyTorchW&B

Live marketplace

Available right now

Buy Hardware

Prefer to own? We sell the hardware too.

GPUs, servers, RAM, storage, networking, or a whole node, built and tested to your spec. We send an itemized quote before anything ships.

GPUs Servers RAM Storage Networking Full builds

Security

Built so tenants can't touch each other

Isolation and zero-trust at every layer, on by default.

gVisor Sandbox

Optional userspace syscall interception — the same isolation model as Google Cloud Run.

Container Isolation

Per-job containers, dropped capabilities, read-only rootfs, enforced PID limits.

Constant-Time Auth

SHA-256 hashed API keys with constant-time validation and instant revocation.

Automatic TLS

Caddy provisions HTTPS automatically. HSTS enforced, mTLS between control planes.

Per-Second Billing

Real provider pricing, 5% platform fee, transparent invoicing with CSV export.

Memory-Safe Agent

Rust agent, zero GC pauses, Docker via library — no shell injection surface.

Rate Limiting

Per-buyer limits and idempotency keys, Redis-backed for sub-millisecond checks.

Full Observability

Prometheus metrics, Grafana dashboards, and alerting rules out of the box.

For GPU owners

Have GPUs? Put them to work.

Install the agent, set your price, and earn whenever a buyer runs a job on your hardware. You keep 95% of every transaction.

One-command install on Ubuntu / Debian
Set your own price per GPU-hour
Auto-detected specs shown to buyers
Fiat payouts, transparent earnings

provider setup

$ curl -fsSL https://hz-ai.io/install.sh | bash
✓ Installing HZ AI Agent v0.1.0…
✓ Detected 2× NVIDIA A100 80GB
✓ Agent registered and running

$ hzai-agent --price 1.89
Listing updated · $1.89 / gpu-hr
Waiting for jobs…

↳ Job received · buyer@corp.io
Earned $3.78 (2 GPU-hours)▋

FAQ

Questions, answered

You prepay with credits. When a job runs, compute is billed per second at the provider's GPU-hour rate. Unused credits stay in your account.

Yes. On-demand instances and dedicated reservations give you full SSH access. Your public key is configured in Settings.

Whatever providers list — today that spans consumer cards (RTX 4090, 3090) and data-center GPUs (A100, H100, A10G). The marketplace shows real-time availability.

Each job runs in an isolated container with a read-only root filesystem. Volumes are scoped per buyer and pinned per node, so no other buyer can reach your data.

Register a provider account, run the one-command installer on your Linux machine, set your price, and you're live. The agent auto-detects your GPU specs.

Any public image from Docker Hub, NVIDIA NGC, or GitHub Container Registry. Private-registry support is on the roadmap.

Spin up your first GPU
in under a minute.

Create an account and deploy your first workload in seconds.

GPU infrastructure
for AI teams.

Rent the compute, or own the hardware

Rent Compute

Buy Hardware

Your GPU control plane

Three steps to a running GPU

Pick a node

Deploy

Pay per second

What people actually run on it

LLM Inference

Model Fine-Tuning

Image Generation

Video & 3D Rendering

Data Processing

Research

Available right now

Prefer to own? We sell the hardware too.

Built so tenants can't touch each other

gVisor Sandbox

Container Isolation

Constant-Time Auth

Automatic TLS

Per-Second Billing

Memory-Safe Agent

Rate Limiting

Full Observability

Have GPUs? Put them to work.

Questions, answered

Spin up your first GPU
in under a minute.

GPU infrastructurefor AI teams.

Rent the compute, or own the hardware

Rent Compute

Buy Hardware

Your GPU control plane

Three steps to a running GPU

Pick a node

Deploy

Pay per second

What people actually run on it

LLM Inference

Model Fine-Tuning

Image Generation

Video & 3D Rendering

Data Processing

Research

Available right now

Prefer to own? We sell the hardware too.

Built so tenants can't touch each other

gVisor Sandbox

Container Isolation

Constant-Time Auth

Automatic TLS

Per-Second Billing

Memory-Safe Agent

Rate Limiting

Full Observability

Have GPUs? Put them to work.

Questions, answered

Spin up your first GPUin under a minute.

GPU infrastructure
for AI teams.

Spin up your first GPU
in under a minute.