Deploy applications and background jobs on your SoraNova cluster
my-backend:latest
)my-frontend:latest
)my-app.hcl
:
TheTo deploy:${service.backend}
variable injects the backend’s endpoint URL into the frontend container as theBACKEND_URL
environment variable.
/var/log
and Docker, and can be configured to push them to an external HTTP endpoint.
my-gpu-backend:latest
)my-frontend:latest
)my-gpu-app.hcl
:
Note: Your custom images must implement their respective health check endpoints that returns a 200 status code when the service is healthy. This endpoint is used by the orchestration platform to determine when your service is up and ready to receive traffic. TheTo deploy:${service.gpu-backend.api.url}
variable injects the backend’s endpoint URL into the frontend container as theBACKEND_URL
environment variable.
gpu-echo-daemon.hcl
:
You can replace echo-server:latest
with any image you want. The daemon will run on all nodes with a GPU that meets the specified requirements. When new machines are provisioned, they will automatically start running this daemon if they have the required resources.
To deploy:
service
block for frontend/backend workloads. Use ${service.<name>}
to inject service URLs.resources
block to control CPU and memory usage.device
block to specify GPU requirements. You can target specific GPU models or specify memory requirements.sharing_strategy = "mps"
to enable CUDA MPS for multi-process sharing on the same GPU.my-app.hcl
or vector-daemon.hcl
).