Use cases
You can use artifacts throughout your entire ML workflow as inputs and outputs of runs. You can use datasets, models, or even other artifacts as inputs for processing.
Use Case | Input | Output |
---|---|---|
Model Training | Dataset (training and validation data) | Trained Model |
Dataset Pre-Processing | Dataset (raw data) | Dataset (pre-processed data) |
Model Evaluation | Model + Dataset (test data) | W&B Table |
Model Optimization | Model | Optimized Model |
The proceeding code snippets are meant to be run in order.
Create an artifact
Create an artifact with four lines of code:- Create a W&B run.
- Create an artifact object with the
wandb.Artifact
API. - Add one or more files, such as a model file or dataset, to your artifact object.
- Log your artifact to W&B.
dataset.h5
to an artifact called example_artifact
:
- The
type
of the artifact affects how it appears in the W&B platform. If you do not specify atype
, it defaults tounspecified
. - Each label of the dropdown represents a different
type
parameter value. In the above code snippet, the artifact’stype
isdataset
.
See the track external files page for information on how to add references to files or directories stored in external object storage, like an Amazon S3 bucket.
Download an artifact
Indicate the artifact you want to mark as input to your run with theuse_artifact
method.
Following the preceding code snippet, this next code block shows how to use the training_dataset
artifact:
You can pass a custom path into the
root
parameter to download an artifact to a specific directory. For alternate ways to download artifacts and to see additional parameters, see the guide on downloading and using artifacts.Next steps
- Learn how to version and update artifacts.
- Learn how to trigger downstream workflows or notify a Slack channel in response to changes to your artifacts with automations.
- Learn about the registry, a space that houses trained models.
- Explore the Python SDK and CLI reference guides.