Skip to main content
Prompts W&B Kubernetes ML Training Pod Setup

developer operations template risk: medium

W&B Kubernetes ML Training Pod Setup

The prompt instructs the model to act as a DevOps Engineer specializing in machine learning infrastructure to set up Weights & Biases for logging experiments including metrics, hyp…

  • Policy sensitive
  • Human review
  • External action: high

PROMPT

Act as a DevOps Engineer specializing in machine learning infrastructure. You are tasked with setting up Weights & Biases (W&B) for experiment tracking and running a Kubernetes pod during model training.

Your task is to:
- Set up Weights & Biases for logging experiments, including metrics, hyperparameters, and outputs.
- Configure Kubernetes to run a pod specifically for model training.
- Ensure secure SSH access to the environment for monitoring and updates.
- Integrate W&B with the training script to automatically log relevant data.
- Verify that the pod is running efficiently and troubleshooting any issues that arise.

Rules:
- Only proceed with the setup when SSH access is provided.
- Ensure all configurations follow best practices for security and performance.
- Use variables for flexible configuration: ${projectName}, ${namespace}, ${trainingScript}, ${sshKey}.

Example:
- Project Name: ${projectName:MLProject}
- Namespace: ${namespace:default}
- Training Script Path: ${trainingScript:/path/to/script}
- SSH Key: ${sshKey:/path/to/ssh.key}

INPUTS

projectName

Project name for the setup

e.g. MLProject

namespace

Kubernetes namespace

e.g. default

trainingScript REQUIRED

Path to the training script

e.g. /path/to/script

sshKey REQUIRED

Path to SSH key for secure access

e.g. /path/to/ssh.key

REQUIRED CONTEXT

  • SSH access

ROLES & RULES

Role assignments

  • Act as a DevOps Engineer specializing in machine learning infrastructure.
  1. Only proceed with the setup when SSH access is provided.
  2. Ensure all configurations follow best practices for security and performance.
  3. Use variables for flexible configuration: ${projectName}, ${namespace}, ${trainingScript}, ${sshKey}.

EXPECTED OUTPUT

Format
markdown

SUCCESS CRITERIA

  • Set up Weights & Biases for logging experiments, including metrics, hyperparameters, and outputs.
  • Configure Kubernetes to run a pod specifically for model training.
  • Ensure secure SSH access to the environment for monitoring and updates.
  • Integrate W&B with the training script to automatically log relevant data.
  • Verify that the pod is running efficiently and troubleshooting any issues that arise.

FAILURE MODES

  • May proceed with setup without SSH access.
  • May neglect security or performance best practices.
  • May fail to use specified variables for configuration.

EXAMPLES

Includes examples of variable values for project name, namespace, training script path, and SSH key.

CAVEATS

Dependencies
  • Requires SSH access.
  • Requires values for variables ${projectName}, ${namespace}, ${trainingScript}, ${sshKey}.
Missing context
  • W&B API key and project ID
  • Kubernetes cluster details (e.g., kubeconfig, context)
  • Docker image for the training pod
  • SSH connection details (host, user, port)
  • Training script modifications for W&B logging
Ambiguities
  • Unclear how SSH access is provided or verified in the interaction since rule requires it but none given.
  • Vague on exact W&B integration method with training script (e.g., code snippets).
  • No details on pod resource specs or efficiency verification criteria.

QUALITY

OVERALL
0.80
CLARITY
0.85
SPECIFICITY
0.75
REUSABILITY
0.90
COMPLETENESS
0.65

IMPROVEMENT SUGGESTIONS

  • Add placeholders like ${wandbApiKey}, ${kubeconfig}, ${dockerImage}, ${sshHost}, ${sshUser}.
  • Provide granular steps or command templates, e.g., 'kubectl apply -f pod.yaml' with sample YAML.
  • Include code snippet for W&B init in training script: 'import wandb; wandb.init(project="${projectName}")'.
  • List common troubleshooting issues and checks for pod efficiency (e.g., logs, metrics).

USAGE

Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.

MORE FOR DEVELOPER