AI Infrastructure You Actually Own

Q: When will Alpha Cube ship?

Initial run units are scheduled to begin shipping late 2026. Reservations are filled in the order they are received; we’ll confirm a delivery window with you once your unit enters assembly.

Q: How does the air-gapped security actually work?

The Alpha Cube and Alpha Cube Pro only ship with a wired network surface: LAN Ethernet you control. All inbound files, including models and datasets, move through an air lock workflow. Data is staged on a separate quarantine partition, scanned, and explicitly released into the protected cube environment under the Vault security specifications. Authentication runs locally, so a network outage or compromised third-party software can never lock you out of your own computer.

Q: Which models does Vault OS support?

Vault OS runs current frontier-class and open-weight models locally where weights or approved runtimes are available, including Claude Opus 4.7, Claude Sonnet 4.6, GPT-5.4, Gemini 3.1, Llama 4, DeepSeek V4, Qwen 3.5, Mistral Large 3, Gemma 4, and your own fine-tunes. Vault OS manages agentic AI, model deployment, chat sessions, and background jobs including training, evaluation, and ingestion. Dev Mode provides direct system access for custom configurations.

Q: What is the refund or cancellation policy for pre-orders?

Pre-order deposits are non-refundable. They reserve your place in the production queue and go directly to the production costs of your unit. The remaining balance is invoiced prior to shipment.

Q: How does the ROI compare to cloud API spending?

For a team of heavy AI users, a single Alpha Cube typically pays for itself within the first 12 to 18 months versus blended cloud spend, then continues running at zero marginal token cost for the life of the hardware. See the calculator above.

Q: What happens if a cube fails or needs service?

The Vault AI Systems warranty covers hardware failures and issues for 24 months. Additional services or advanced issues will be handled by our support team.

Vault Alpha Cube runs open-weight AI models inside your network, built to the same standard as the data it protects. Plug it into your network, run cutting edge AI models locally, and eliminate data exposure from the outside world.

Pre-order See the platform

Security

Your AI. Your Data. Your Network.

Cloud AI sends prompts, documents, and proprietary data to infrastructure outside your control. For regulated teams, that transfer can put attorney-client privilege, compliance posture, financial records, and medical data at risk. Vault Alpha Cube keeps inference inside your business, behind your firewall, and under your authority.

Physical Isolation

No wireless radios. Ethernet only.

Air-lock media intake

Model weights, datasets, and files are staged, scanned, and released only after approval.

Physical Access Control

Locking chassis. Tamper-evident seals.

Local Authentication

No SSO round-trips through someone else’s identity provider.

Vault OS

Meet GEM, your assistant inside Vault OS.

Prompt, follow up, and hand off heavier work the way you would with any modern LLM. GEM searches your files, queues background jobs, manages models, and handles fine-tuning with the same instincts you use in cloud chat tools, except the conversation never leaves your network.

Pricing

Same chassis.
Same boundary.

Both models use the same 18-inch anodized aluminum enclosure and the same air-gapped architecture. Choose the compute footprint that matches your workload.

Alpha Cube

$42,950

$5,000 deposit to pre-order

For teams running frontier 70B-class models with strong concurrency. The default starting point for most organizations.

2× NVIDIA RTX 5090
32-core Threadripper Pro
256 GB RAM
8 TB local NVMe storage
Single PSU · standard power

Pre-order Alpha Cube

Alpha Cube Pro

$56,950

$5,000 deposit to pre-order

Doubles the compute headroom for organizations training larger models, running heavier concurrent workloads, or hosting more agents.

4× NVIDIA RTX 5090
32-core Threadripper Pro
512 GB RAM
8 TB local NVMe storage
Single PSU · enhanced power

Pre-order Alpha Cube Pro

Platform

The full platform, at a glance.

Six pieces that turn an aluminum enclosure into a working AI environment: an air-gapped network posture, dense local compute, Vault OS for running models, fixed-cost token economics, multi-cube clustering, and an on-chassis status display.

01 / Air-Gapped

Data never leaves the building.

Disconnected by design. Every model, prompt, document, and inference stays behind your own firewall. Nothing about your work product is uploaded, mirrored, or sent back as telemetry.

02 / Dense Compute

Frontier-class horsepower in an eighteen-inch chassis.

Frontier-grade performance in an 18-inch cube. Alpha Cube Pro scales to four NVIDIA GeForce RTX 5090 Founders Edition GPUs and 256 GB of RAM, enough headroom for 70B-class models and concurrent agent workloads.

06 / Display

Uptime hours 46

Tokens 12.4M

Utilization 63%

Node-2 Working

Temperature 135°

Status and thermals at a glance.

An on-device AMOLED display surfaces what’s running, who’s connected, and how hot the silicon is.

05 / Clustering

More cubes, one local resource pool.

Run multiple cubes side by side and Vault OS pools their compute as one local resource for larger deployments.

04 / TOKENOMICS

Own the hardware instead of metered inference.

Cloud AI costs rise every time a team retries a prompt, adds an agent, or runs a larger job. Alpha Cube turns inference cost from recurring token billing into a one-time hardware purchase you already own.

03 / Vault OS

Llama, Mistral, Qwen, or your own weights.

Run frontier open-weight models such as Llama, Mistral, and Qwen, or upload your own fine-tunes. Vault OS handles deployment, agent orchestration, and background training jobs.

Calculator

Run the numbers on ownership.

Estimate annual cloud-AI spend, compare it with a one-time Vault hardware purchase, and see the configuration sized for your team.

AI-active employees

AI spend per employee, per year

Heavy daily AI use lands around $600/mo across Cursor, Claude, Copilot, and API overage.

$7,200 per heavy user per year

Heavy daily AI use, including coding agents, document research, and model evaluation, typically lands around $600 per month across Cursor, Claude, Copilot, and API overage. Quoted in nominal list pricing before customer-specific discounts. Capacity defaults assume 10 heavy concurrent users per Alpha Cube and 20 per Alpha Cube Pro.

Annual cloud spend

Recommended Vault cost

3-year savings

5-year savings

No cube recommended yet

Move the dial to compute your configuration.

Pre-order

FAQ

Everything you need to know.

When will Alpha Cube ship?

How does the air-gapped security actually work?

Which models does Vault OS support?

What is the refund or cancellation policy for pre-orders?

How does the ROI compare to cloud API spending?

What happens if a cube fails or needs service?

Pre-order Contact us