Artificial intelligence (AI) is now on the forefront of how enterprises work with information to assist reinvent operations, enhance buyer experiences, and keep a aggressive benefit. It’s now not a nice-to-have, however an integral a part of a profitable data strategy. Step one for profitable AI is entry to trusted, ruled information to gas and scale the AI. With an open data lakehouse architecture strategy, your groups can maximize worth from their information to efficiently undertake AI and allow higher, quicker insights.
Why does AI want an open information lakehouse structure?
Think about this, a forecast by IDC exhibits that international spending on AI will surpass $300 billion in 2026, leading to a compound annual development price (CAGR) of 26.5% from 2022 to 2026. One other IDC study confirmed that whereas 2/3 of respondents reported utilizing AI-driven information analytics, most reported that lower than half of the info below administration is out there for this sort of analytics. In truth, in accordance in an IDC DataSphere examine, IDC estimated that 10,628 exabytes (EB) of knowledge was decided to be helpful if analyzed, whereas solely 5,063 exabytes (EB) of knowledge (47.6%) was analyzed in 2022.
A data lakehouse structure combines the efficiency of knowledge warehouses with the pliability of knowledge lakes, to address the challenges of today’s complex data landscape and scale AI. Sometimes, on their very own, information warehouses may be restricted by excessive storage prices that restrict AI and ML mannequin collaboration and deployments, whereas information lakes can lead to low-performing information science workloads.
Nonetheless, when bringing collectively the ability of lakes and warehouses in a single strategy — the info lakehouse — organizations can see the advantages of extra dependable execution of analytics and AI initiatives.
A lakehouse ought to make it simple to mix new information from quite a lot of completely different sources, with mission vital information about prospects and transactions that reside in current repositories. New insights and relationships are discovered on this mixture. Additionally, a lakehouse can introduce definitional metadata to make sure readability and consistency, which allows extra reliable, ruled information.
All of this helps using AI. And AI, each supervised and unsupervised machine studying, is usually one of the best or typically solely solution to unlock these new massive information insights at scale.
How does an open information lakehouse structure assist AI?
Enter IBM watsonx.data, a fit-for-purpose information retailer constructed on an open information lakehouse, to scale AI workloads, for all of your information, anyplace. Watsonx.information is a part of IBM’s AI and information platform, watsonx, that empowers enterprises to scale and speed up the affect of AI throughout the enterprise.
Watsonx.information allows customers to entry all information by way of a single level of entry, with a shared metadata layer deployed throughout clouds and on-premises environments. It helps open information and open desk codecs, enabling enterprises to retailer huge quantities of knowledge in vendor-agnostic codecs, equivalent to Parquet, Avro, and Apache ORC, whereas leveraging Apache Iceberg to share giant volumes of knowledge by way of an open desk format constructed for high-performance analytics.
By leveraging a number of fit-for-purpose question engines, organizations can optimize pricey warehouse workloads, and can now not must maintain a number of copies of knowledge for numerous workloads or throughout repositories for analytics and AI use circumstances.
Lastly, as a self-service, collaborative platform, your groups are now not restricted to solely information scientists and engineers working with information, however now can lengthen the work to non-technical customers. Later this 12 months, watsonx.data will infuse watsonx.ai generative AI capabilities to simplify and speed up the best way customers work together with information, with the power to make use of pure language to find, increase, refine and visualize information and metadata powered by a conversational, pure language interface.
Subsequent steps to your information and AI technique
Take the time to verify your enterprise information and AI technique is prepared for the dimensions of knowledge and affect of AI with an open information lakehouse strategy. With watsonx.information, you possibly can expertise the advantages of an information lakehouse to assist scale AI workloads for all of your information, anyplace.
Request a live 30-minute demo for watsonx.data
Access the IDC study on the datalakehouse approach here





