Data and information are becoming more and more important in companies. Data has developed into a key production factor for many enterprises in recent years, thus making the storage and utilization of this data just as crucial.
Not only the sheer volume of data being created but also new kinds of data make the traditional management thereof using data warehouse concepts an impossibility. While with DWH concepts, incoming data is subjected to complex quality assurance and integration procedures before they can be stored in predefined structures (schema on write), data is stored as raw information in a data lake, and not converted and prepared until they are actually needed (schema on read). This means that large volumes of heterogeneous data can be stored far more efficiently, and specific data can be made available and linked together quickly and efficiently for analysis when required.
So Data Lake opens up entirely new possibilities for using data and information profitably in companies. But at the same time, what is becoming more important are questions of the source, trustworthiness, protection and life-cycle management of the data.