Troubleshooting

Common fixes for CLI, agent, and deployment hiccups.

CLI diagnostics

Run `deepbox doctor` to capture environment info, API reachability, and token validity.

Stuck experiments

Check the scheduler queue, review node logs, and consider pausing competing workflows.

GPU provisioning

Ensure your cloud quotas cover bursty jobs and enable automatic fallback to spot fleets.

Was this page helpful?