Skip to main content

DFS Pro

DFS Pro turns connected data into governed data assets and repeatable data workflows. Use it when operational data needs lifecycle control, stewardship, version history, fusion, review, audit, or BI reporting.

What users do in DFS Pro

TaskUI areaResult
Create governed data assetsDataset CenterA dataset exists with source type, schema, profile, owner, and lifecycle status.
Validate data assetsDataset DetailA stewarded dataset can be used downstream.
Review schema changesDataset versions and change impactDownstream users understand what changed.
Build reusable processing logicMethod LibraryA method can be tested, published, versioned, and used in fusion tasks.
Fuse multiple datasetsData FusionReviewed outputs combine records from multiple sources.
Resolve uncertaintyReview QueueConflicts, low-confidence results, source disagreements, and manual flags are reviewed.
Fix rejected rowsRejected RowsUpstream corrections can be tracked and reprocessed.
Track evidenceAudit Trail and MetricsChanges, run outcomes, and operational health are traceable.
Build reportsDFS Pro BIReviewed datasets can drive dashboards and scheduled reports.

Lite to Pro workflow

DFS Lite connector -> mapped and synced data -> DFS Pro dataset
-> method or fusion task -> review queue -> validated output

Move from DFS Lite to DFS Pro when a data feed needs any of the following:

  • data steward;
  • dataset lifecycle;
  • schema versioning;
  • profile and preview;
  • lineage or change-impact review;
  • multi-source fusion;
  • review queue;
  • BI reporting.
  1. Create a connector in DFS Lite.
  2. Map and sync source data.
  3. Check data quality.
  4. Create a DFS Pro dataset from the connector output or imported data.
  5. Preview and profile the dataset.
  6. Assign a steward.
  7. Validate the dataset.
  8. Use it in a fusion task, AI Agent workflow, or BI report.
PageUse
DatasetsCreate and validate governed datasets.
Fusion TasksCombine multiple datasets with reviewable matching logic.
Review QueueResolve conflicts, low-confidence outputs, source disagreements, and rejected rows.