Jessie Rudd, Technical Business Analyst at PBT Group
Even though data warehouses and data lakes are considered large [data] storage repositories, this is where the similarities stop. While the latter offers significant business opportunities, not many organisations understand how to effectively unlock its potential.
A data lake is unstructured data that comes direct from the source. The data structures and requirements are not defined in any way until the data is needed. By its nature, it is exceptionally agile and provides data scientists with a platform to extract meaningful insights.
In South Africa, the only companies that have been able to benefit (to a limited extent) from data lakes have been those operating in the telecommunications sector. This is purely based on the amount of data they have at their disposal and their budgets to acquire the resources needed to store it.
However, whether they choose to utilise it or not, remains to be seen. After all, South Africa is still quite traditional in how it approaches data management. So, while telecoms operators understand what data lakes are, their willingness to access it effectively is still up for debate.
Data ownership
Complicating matters is that data analysts are organised and prefer to work in a structured environment whereas data lakes require a more scientific approach based on curiosity. The mindset needed to really do the proverbial wading in to a data lake is quite different to how many organisations and analysts view data currently.
It does not help that many organisations are still up in the air about the benefits of big data. There is still an ongoing debate on the merits of structured versus unstructured data and what the practicalities are for the daily running of a business. Unless a company has a dedicated group of people delving into big data (or even a data lake for that matter), there is not enough leeway to really get the benefits associated to it.
Swimming in the lake
A data lake does offer companies a powerful platform to do a lot of things with data, but it does require a leap of faith in how to access it. If you do not know what you are looking for in the data lake, you are never going to find it. Companies have limited budgets. And in a tough economy, they want to remain focused. For smaller businesses, lakes are simply not a viable option even though it is built on the unstructured data they are leveraging.
An option would be to divide the lake into smaller data ponds where each kind of data is pooled together. This means scientists or analysts can go to the right pond looking for specific data. So, even though the data is still a complete mess, at least it will be of the right kind.
Whether the telecoms operators are wiling to experiment with this and play around with data lakes will drive a lot of the growth potential for the immediate future. But it must be focused on getting the basics right and create a platform from there.
Understanding data lakes

Understanding data lakes
Related Articles
Data governance must be built in and not left to the last minute
Data governance must be built in and not left to the last minute Julian Thomas, Principal Consultant at PBT Group No company can afford to treat data governance as an afterthought. And yet, time again, governance considerations are sidelined. Typically, these are only addressed when audits loom or when compliance flags are raised at the […]
Bridging the gap: How AI is redefining automation in business
Bridging the gap: How AI is redefining automation in business Nicky Pantland, Data Analyst at PBT Group Automation has long been a critical driver of business efficiency, streamlining repetitive tasks and enhancing productivity. However, its rigid, rule-based nature has often limited its potential. The emergence of artificial intelligence (AI) is transforming automation from a static […]
The transformative power of data ownership in modern businesses
The transformative power of data ownership in modern businesses Nathi Dube, Director, PBT Innovation at PBT Group As we all know, data is fundamental to the success of any organisation. It drives decision-making, innovation, and growth. However, the journey from raw data to actionable insights hinges on one critical factor: data ownership. If there are […]
Building a future-proof data governance framework for AI
Building a future-proof data governance framework for AI Petrus Keyter, Data Governance Consultant at PBT Group Artificial intelligence (AI) is reshaping the business landscape. This advanced technology is redefining how companies govern and use data. However, with its dependency on large datasets and its ability to inform decision-making, AI integration requires a more robust, adaptable […]
The relevance of data literacy in the context of AI
The relevance of data literacy in the context of AI Jan de Villiers, Head of Cloud Academy at PBT Group Even though artificial intelligence (AI) provides organisations across all industry sectors with opportunities to improve decision-making and enhance operational efficiencies, its success is reliant on the availability and quality of data used. For AI to […]
The role of data products in data-driven decision making
The role of data products in data-driven decision making Nathi Dube, Director, PBT Innovation at PBT Group In today’s competitive business landscape, the ability to make data-driven decisions offers businesses a significant advantage. Insights gleaned from high-quality, well-curated data can drive smarter strategies, improved customer experiences, and long-term customer loyalty. Yet, simply having data isn’t […]