Spark hardware
OpenAI-compatible endpoint
Base URL
—
Model ID
—
curl example
Swap in progress
0:00
Starting…
Show technical logs
Always-on services
LLM swap
Browse vLLM-compatible models
· NVFP4-quantized models (e.g.
RedHatAI/...) are best for Blackwell hardware
Where
For solo models, download to wherever you'll run them. For cluster models (-tp 2), both Sparks need the weights — "Both" downloads to one Spark and rsyncs to the other in parallel.
Downloading…
0:00
Connecting…
Show technical logs
Updates to eugr/spark-vllm-docker
— the upstream project that orchestrates vLLM on your Sparks (launch-cluster.sh, recipes, mods). These are not firmware, OS, or model updates.
Checking for updates…
Pending commits
Explained by the loaded LLM
Applying update…
0:00