Create an AI Agent-Ready Dataset
FactVerse AI Agent workflows need data that is traceable, fresh enough for the task, mapped to the right operational identity, and reviewed by the right owner. Use this recipe to prepare a DFS Pro dataset for AI Agent workflows.
Outcome
At the end of this workflow, you should have:
- a defined Agent question or workflow;
- source systems connected or imported;
- mappings reviewed;
- a DFS Pro dataset created;
- profile and quality issues reviewed;
- steward assigned;
- dataset validated;
- source limitations documented for Agent users.
Step 1: Define the Agent workflow
Start with the operational task, not the data table.
Examples:
- summarize equipment status before inspection;
- prepare predictive maintenance evidence;
- compare recent alarms with work orders;
- explain why an energy or facility value changed;
- prepare a reviewed work-order draft.
Record:
- user role;
- site or tenant boundary;
- assets or systems involved;
- expected output;
- reviewer;
- allowed action boundary.
Step 2: Identify required evidence
List the evidence the Agent should use.
| Evidence type | Examples |
|---|---|
| Asset context | Asset ID, equipment name, aliases, location, system relationship. |
| Signals | Sensor values, meter values, status, event history. |
| Work records | Work orders, inspection records, close notes, operator actions. |
| Documents | SOP, maintenance manual, drawings, compliance notes. |
| Quality notes | Missing values, stale values, source limitations. |
Step 3: Connect or import source data
Use DFS Lite connectors for live or scheduled sources. Use DFS Pro dataset creation for imported or prepared data.
For each source, confirm:
- source owner;
- update frequency;
- timestamp field;
- identity field;
- unit and value range;
- permission to use the data in the Agent workflow.
Step 4: Map operational identity
AI Agent output is only useful when identity is stable.
Review:
- source equipment names;
- asset IDs;
- aliases;
- location hierarchy;
- point names;
- work-order asset references;
- document references.
If source systems use different names for the same asset, create or update mappings before validating the dataset.
Step 5: Create DFS Pro dataset
In Dataset Center:
- Create dataset.
- Choose source type.
- Add name and description.
- Add or confirm source ID.
- Save.
- Open detail page.
- Preview rows.
- Review profile.
Dataset description should include:
- source system;
- purpose;
- owner;
- expected refresh cadence;
- known limitations;
- intended Agent workflow.
Step 6: Review quality and profile
Check:
- row count;
- required columns;
- null ratio;
- distinct IDs;
- timestamp range;
- stale records;
- quality issues;
- schema changes.
For Agent workflows, document limitations in plain language. Example: "Work-order close notes are available from the CMMS feed, but part replacement details are only available after nightly sync."
Step 7: Assign steward and validate
Assign a steward who can approve the dataset for the Agent workflow.
Validate when:
- identity mapping is reviewed;
- source timestamps are understood;
- quality issues are acceptable for the task;
- dataset schema is stable enough;
- downstream reviewer accepts the assumptions.
Step 8: Link to Agent workflow
When using the dataset in an Agent workflow, keep these visible:
- dataset name and version;
- source timestamp;
- quality notes;
- reviewer;
- allowed output type;
- action approval requirement.
For MCP workflows, confirm scopes and endpoint access separately in the MCP documentation.
Related pages
| Page | Use |
|---|---|
| DFS Pro Datasets | Create and validate the dataset. |
| Data Quality | Review source freshness and quality. |
| FactVerse AI Agent Data Readiness | Check whether source evidence is ready for Agent use. |
| MCP Scope Matrix | Plan endpoint and scope access for MCP clients. |