This work discusses a crucial activity for Heat Integration (HI): Data Extraction (DE). Overall, data acquisition is a complex process involving many stages and stakeholders. The paper analyses the treatment of DE in the published literature. The analysis established that many publications just present the results from already performed DE and others limit the provided reasoning only to a few key issues. Further, the importance of the preservation of data integrity and semantics is highlighted. This can be done by documenting the reasoning and the choices made during DE, as well as distributing the DE reasoning between the general discussion and stream-specific information placement into the Process Streams Table for HI, preserving the context. The approach is illustrated on a published example.