
Mapping Data Flows Across Design, Construction, and Operations
Most firms generate more data in a single year than they will ever meaningfully use. Projects produce drawings, models, schedules, specifications, correspondence, change orders, submittals, meeting minutes, and financial records in quantities that accumulate faster than any individual can track. But the accumulation of data is not the same as the availability of information. In most firms, the data is there. The ability to learn from it, to act on it in real time, and to move it from where it is created to where it is needed — that is largely absent.
The reason is that data in the built environment does not flow. It accumulates in disconnected silos at each phase of a project's life. The design phase generates data that lives in BIM software and document management systems. The construction phase generates data that lives in field management apps, RFI logs, and contractor communication threads. The operations phase — if it is considered at all from a data perspective — generates maintenance records, warranty claims, and facility management data that is completely disconnected from everything that came before.
Building the Data Flow Map
A data flow map is a visual and structured documentation of what information is generated at each stage of the work, in what format, and where it needs to go for the next stage to proceed without manual rework. It is the starting point for any serious workflow improvement effort, because it makes visible what is otherwise assumed — the specific paths that data is supposed to travel, and the specific places where those paths do not exist.

Building the map requires a phase-by-phase walkthrough of a representative project. For each phase — from initial programming or design through construction and handover — the analysis documents every significant piece of information that is created: what system captures it, what format it takes, who needs it next, and what they do with it.
The process of building this map almost always surfaces discoveries that surprise the people who commission it. Data that everyone assumed was being transferred automatically turns out to require manual intervention. Information that is captured in one system and needed in another is being re-entered by hand multiple times per week. Decisions that should be informed by data from a previous phase are being made without it, because no one established a mechanism for that data to cross the phase boundary.
What the Map Reveals
The most common finding is redundant data entry. The same piece of information — a project budget figure, a schedule milestone, a specification value — is entered manually into multiple systems that do not communicate with each other. Each manual entry is an opportunity for error, and over the course of a complex project with hundreds of data points tracked across multiple systems, the cumulative error rate is significant.
A related finding is data loss at phase transitions. When a project moves from design to construction, information that was carefully maintained in design-phase systems often does not transfer to construction-phase systems in a usable form. Field teams work from printed documents rather than live models. Change order impacts do not automatically update the project schedule. Specification values have to be re-entered into submittal tracking systems. The information existed — it was just not transferred in a way that the receiving system could use.
The third common finding is the absence of feedback loops. Information that is generated in later project phases — actual costs versus estimated, actual durations versus scheduled, change order rates by project type — is rarely fed back to the systems used in earlier phases to inform future estimates and decisions. Each project starts without the benefit of fully analyzed experience from previous ones.
From Map to Integration
The data flow map translates directly into an integration roadmap: a prioritized list of connections to build between systems, so that data moves automatically from where it is created to where it is needed. The prioritization should weight the frequency of the data transfer, the cost of the current manual process, and the complexity of building the integration.
In most cases, the highest-priority integrations are straightforward to build. Moving data between two systems that both have API access is a matter of mapping the fields, writing the connection logic, and testing the output. The technical work is not the constraint. The constraint is knowing which connections to build first — which requires the map.
