Governance Studio
Governance Studio is the DFS Pro workspace for repeatable data processing pipelines. Use it when a workflow needs governed input datasets, reusable methods, run history, and node-level execution evidence.
Typical uses:
- prepare reviewed data before a BI report;
- apply a reusable method to a validated dataset;
- produce an output dataset for fusion or AI Agent workflows;
- inspect run evidence after a processing failure;
- standardize a data preparation workflow across sites.
Before you start
Confirm:
dfs:readanddfs:writeaccess;- DFS Pro package access;
- at least one prepared dataset;
- at least one published or draft method suitable for the workflow;
- a named owner for the resulting output.
Open Governance Studio
Go to:
Data Integration > Governance Studio
The list view shows pipelines, status, node count, update time, and delete action.
Create a pipeline
- Select New Pipeline.
- Enter a pipeline name.
- Add a description.
- Save.
- Open the canvas.
Use a name that states the operational outcome.
Examples:
Facility sensor quality preparationWork order and asset feature pipelineInspection evidence normalizationEnergy meter reporting preparation
Add dataset nodes
In the canvas:
- Select the Dataset Nodes palette.
- Search for the dataset.
- Drag the dataset to the canvas.
- Confirm the label in the node inspector.
Use validated datasets where possible. Draft datasets can be useful during early design, but production pipelines should have a stewarded input set.
Add method nodes
- Select the Processing Nodes palette.
- Search for a method.
- Drag the method to the canvas.
- Connect dataset nodes to method nodes.
- Select the method node to review label and method details.
Use methods from the Method Library when the same data processing logic should be reused or versioned.
Save the pipeline
Select Save after arranging nodes and edges.
Before saving for shared use, confirm:
- each node label is understandable;
- the flow direction is clear;
- methods match the input dataset shape;
- the owner can explain each processing step.
Validate the pipeline
Select Validate before running.
Review:
- validation status;
- node count;
- edge count;
- method count;
- sink count;
- errors;
- warnings.
Fix validation errors before running the pipeline. Treat warnings as review items for the pipeline owner.
Run the pipeline
Select Run Pipeline when validation has passed or the owner accepts the remaining warnings.
During a run, the pipeline can show running, completed, or failed status. Use the run drawer to inspect execution history.
Inspect run history
Open Runs to review previous pipeline runs.
Check:
- run ID;
- status;
- start time;
- duration;
- triggering user when visible;
- registered output datasets;
- error message when present.
Use run history during troubleshooting and for evidence review.
Inspect node execution logs
After a run, select a node and open its execution log.
Review:
- node ID;
- node label;
- node status;
- method key when present;
- duration;
- rows in;
- rows out;
- error message;
- preview rows when available.
Node logs are useful when the full pipeline fails but only one dataset or method caused the issue.
Troubleshooting
| Symptom | Check |
|---|---|
| Dataset is missing from palette | Confirm dataset access, status, and tenant scope. |
| Method is missing from palette | Confirm method category, status, and permissions. |
| Validation fails | Review missing edges, missing methods, invalid dataset references, and warnings. |
| Run fails | Open run history, inspect node logs, and fix the first failed node. |
| Output dataset is unavailable | Check registered outputs in run history and downstream dataset permissions. |
Next step
Use Audit and Metrics to review operational health after pipelines are running.