Install Llama Stack
This document describes how to install and deploy Llama Stack Server on Kubernetes using the Llama Stack Operator.
Upload Operator
Download the Llama Stack Operator installation file (e.g., llama-stack-operator.alpha.ALL.xxxx.tgz).
Use the violet command to publish to the platform repository:
Install Operator
-
Go to the
Administratorview in the Alauda Container Platform. -
In the left navigation, select
Marketplace/Operator Hub. -
In the right panel, find
Alauda build of Llama Stackand clickInstall. -
Keep all parameters as default and complete the installation.
Deploy Llama Stack Server
After the operator is installed, deploy Llama Stack Server by creating a LlamaStackDistribution custom resource:
Note: Prepare the following in advance; otherwise the distribution may not become ready:
- Secret: Create a Secret (e.g.,
deepseek-api) in the same namespace with the LLM API token. Example:kubectl create secret generic deepseek-api -n default --from-literal=token=<LLM_API_KEY>.- Storage Class: Ensure the
defaultStorage Class exists in the cluster; otherwise the PVC cannot be bound and the resource will not become ready.
After deployment, the Llama Stack Server will be available within the cluster. The access URL is displayed in status.serviceURL, for example: