CommError, Run does not exist
and ERROR Error uploading
Your W&B Run ID might be defined if these two error messages are both returned. As an example, you might have a similar code snippet defined somewhere in your Jupyter Notebooks or Python script:
Cuda out of memory
Refactor your code to use process-based executions if you see this error message. More specifically, rewrite your code to a Python script. In addition, call the W&B Sweep Agent from the CLI, instead of the W&B Python SDK.
As an example, suppose you rewrite your code to a Python script called train.py
. Add the name of the training script (train.py
) to your YAML Sweep configuration file (config.yaml
in this example):
train.py
Python script:
wandb agent
with the CLI instead of the Python SDK (wandb.agent
). Replace sweep_ID
in the code snippet below with the Sweep ID that was returned in the previous step:
anaconda 400 error
The following error usually occurs when you do not log the metric that you are optimizing:
wandb.log
) this metric. In addition, ensure you use the exact metric name that you defined the sweep to optimize within your Python script or Jupyter Notebook. For more information about configuration files, see Define sweep configuration.