Delivering Innovation with Object-Based Analytics
Thu, 25 Mar 2021 18:17:06 -0000
|Read Time: 0 minutes
As analytic workloads continue to grow, more pressure is placed on data teams to build an effective data strategy.
One large financial customer of ours is challenging their data teams to help fight false-positive card declines, which according to Javelin Research costs issuers and merchants over $118B. However, ensuring fraud prevention while cutting down on the false positives can be a fine line to walk. The key to solving this problem with an analytics model is to start with the data.
At Dell Technologies, we have helped our customers work through these challenges for many years, from building fraud detection to enabling life-saving healthcare. We understand that getting the data strategy right can help teams build models to solve their real-world problems. Dell Technologies has engaged in joint engineering and validation efforts to bring our leading distributed object storage product Dell EMC ECS to integrate industry leaders in the Analytics space.
Today we are happy to announce a collaboration with unified analytics warehousing leader Vertica, which will allow our customers to deliver cloud innovation on-premises, greater operational flexibility and efficiency, and scale infrastructure resources independently. Working with Vertica will allow our joint analytics customers to deliver flexible and efficient architecture by separating computer and storage.
Collaborating on Object-Based Analytics with Vertica
Vertica is a unified analytics warehouse built to deliver blazing-fast query performance regardless of scale or concurrency requirements. It is a highly scalable analytical database that works well in many deployment situations – on-premises, on top of a Hadoop or S3 data lake, and in any public clouds. Vertica features powerful SQL-based analytics, time-series, geospatial, and in-database machine learning capabilities. Vertica removes typical barriers to analytics for some of the world’s most prominent data-centric organizations.
Vertica in Eon mode decouples compute from storage to give customers the benefits of cloud architecture for analytic workloads. Previously only available in the public clouds, Vertica now offers Eon Mode with superior on-premises object storage solutions.
“Our customers trust us to provide the greatest freedom in how they consume the highest performance analytics – flexibility for the broadest deployment options, whether it’s deploying Vertica on any major public cloud or on-premises with more leading object storage options,” said Colin Mahony, senior vice president and general manager of Vertica.
Help Data Teams Solve Real World Problem
Data Teams can begin taking advantage of our joint collaboration today. Today’s announcement allows customers to:
- Vertica in EON Mode for Dell EMC ECS delivers cloud innovation on-premises.
- Separate compute and storage with ECS + Vertica EON Mode for delivering operations flexibility and efficiency.
- Scale infrastructure resources independently. Storage can grow without adding expensive compute, and compute can be scaled up or down with variable or intermittent workloads.
Vertica in Eon Mode for Dell EMC ECS gives companies a consistent platform for analytics across all of their environments, whether their data resides in the cloud or on-premises, or in a hybrid architecture. Check this white paper to learn about the technologies and environment used to confirm compatibility between Vertica in Eon Mode and Dell EMC ECS platform.
Related Blog Posts
Dell and Databricks Announce a Multicloud Analytics and AI Solution
Mon, 22 May 2023 16:58:09 -0000
|Read Time: 0 minutes
Dell and Databricks' partnership will bring customers cloud-based analytics and AI using Databricks with data stored in Dell Object Storage.
The biggest business opportunity for enterprises today lies in harnessing data for business insight and gaining a competitive edge. At the same time, the data landscape is more distributed and fragmented than ever. Data is spread out across multiple environments including on-premises and multiple public clouds, thus complicating the ability to access and process data efficiently.
Enterprises require solutions that enable a multicloud data strategy by design. That means leveraging data wherever it is stored, across clouds, with a consistent management, security, and governance experience to build analytical and AI/ML-based workloads.
At Dell, the business of data is not new to us. We store and process a large majority of the world’s data on our systems. And we work with customers across the globe every day to accelerate time to value from their data. Dell is building an open ecosystem of partners who together can help address next-gen challenges in data management.
Dell and Databricks
Today, during the opening keynote at Dell Technologies World 2023, Dell Technologies announced a strategic and multi-phase partnership with Databricks.
I want to share some additional details around that announcement.
Customers today can leverage native Databricks capabilities to process, analyze and share data stored in Dell object storage, located on-prem or in a cloud-adjacent datacenter like Faction, without moving data into the cloud. This unlocks phenomenal benefits for customers including compute on-demand to process on-premise data assets, the ability to securely share data within and outside the enterprise, reduced data movements and copies, compliance with data localization regulations, multicloud resiliency, the adoption of open architecture standards and an overall reduction in cost and complexity of their data landscape. Key to this integration is support for Delta Sharing, an open standard for secure sharing of data assets to securely share live data with any computing platform.
And that’s not all - our teams at Dell and Databricks are excited to engineer a deeper integration that will deliver a truly seamless experience of using Dell object storage within the Databricks Lakehouse Platform. This will completely transform the way customers manage on-premises data with cloud platforms.
Dell and Databricks realize it is a multi-faceted world, and customers benefit from being able to access data wherever it resides to unlock competitive differentiation. Dell and Databricks will closely partner in the market to bring these solutions to our joint customers.
“Databricks is focused on helping businesses extract the most valuable insights from their data, wherever it resides,” said Adam Conway, senior vice president, product at Databricks. “This partnership provides the ability to leverage cloud and on-premises data together with best-of-breed technologies, and to securely share that data through Delta Sharing. Combining the best of Dell and Databricks changes the data landscape for customers as they operate in today’s multicloud world.”
Data Management with Dell Technologies
This partnership is a great addition to Dell’s open ecosystem of technologies in the data space. Together with Dell’s market-leading portfolio of storage, compute and services, this ecosystem aims to provide the best-in-class data management solutions to our customers and help them build a multicloud data strategy by design. The data space is buzzing with new innovations and technologies and aimed at improving the user experience, productivity, and business value by orders of magnitude. Partner with Dell Technologies to create the right multicloud data strategy for your enterprise and unleash the next wave of transformation and competitive edge for your business. Visit us at dell.com/datamanagement to stay tuned to the latest in this space.
To learn more about how Dell and Databricks can help your organization streamline its data strategy, read the "Power Multicloud Data Analytics and AI using Dell Object Storage and Databricks" white paper, or contact the Dell Technologies data management team at Dell.data.management@dell.com.
Navigating the modern data landscape: the need for an all-in-one solution
Mon, 18 Mar 2024 19:56:59 -0000
|Read Time: 0 minutes
There are two revolutions brewing inside every enterprise. We are all very familiar with the first one - the frenzied rush to expand an organization's AI capabilities, which leads to an exponential growth in data creation, a rise in availability of high-performance computing systems with multi-threaded GPUs, and the rapid advancement of AI models. The situation creates a perfect storm that is reshaping the way enterprises operate. Then, there is a second revolution that makes the first one a reality – the ability to harness this awesome power and gain a competitive advantage to drive innovation. Enterprises are racing towards a modern data architecture that seeks to bring order to their chaotic data environment.
The Need For An All-In-One Solution
Data platforms are constantly evolving, despite a plethora of options such as data lakes, data warehouses, cloud data warehouses and even cloud data lakehouses, enterprise are still struggling. This is because the choices available today are suboptimal.
Cloud native solutions offer simplicity and scalability, but migrating all data to the cloud can be a daunting task and can end up being significantly more expensive over the long term. Moreover, concerns about the loss of control over proprietary data, particularly in the realm of AI, is a major cause for concern, as well. On the other hand, traditional on-premises solutions require significantly more expertise and resources to build and maintain. Many organizations simply lack the skills and capabilities needed to construct a robust data platform in-house.
A customer once told me – “We’ve heard from so many vendors but ultimately there is no easy button for us.”
When Dell Technologies set out to build that easy button, we started with what enterprises needed most: infrastructure, software, and services all seamlessly integrated. We created a tailor-made solution with right-sized compute and a highly performant query engine that is pre-integrated and pre-optimized to perfectly streamline IT operations. We incorporated built-in enterprise-grade security that also can seamlessly integrate with 3rd party security tools. To enable rapid support, we staffed a bench of experts, offering end-to-end maintenance and deployment services. We also knew the solution needed to be future proof – not only anticipating future innovations but also accommodating the diverse needs of users today. To support this idea, we made the choice to use open data formats, which means an organization’s data is no longer locked-in to a proprietary format or vendor. To make the transition easier, the solution makes use of built-in enterprise-ready connectors that ensures business continuity. Ultimately, our goal was to deliver an experience that is easy to install, easy to use, easy to manage, easy to scale, and easy to future-proof.
Dell Data Lakehouse’s Core Capabilities
Let’s dig into each component of the solution.
- Data Analytics Engine, powered by Starburst: A high performance distributed SQL query engine, built on top of Starburst, based on Trino, which can run fast analytic queries against data lakes, lakehouses and distributed data sources at internet-scale. It integrates global security with fine-grained access controls, supports ad-hoc and long-running ELT workloads and is a gateway to building high quality data products and power AI and Analytics workloads. Dell’s Data Analytics Engine also includes exclusive features that help dramatically improve performance when querying data lakes. Stay tuned for more info!
- Data Lakehouse System Software: This new system software is the central nervous system of the Dell Data Lakehouse. It simplifies lifecycle management of the entire stack, drives down IT OpEx with pre-built automation and integrated user management, provides visibility into the cluster health and ensures high availability, enables easy upgrades and patches and lets admins control all aspects of the cluster from one convenient control center. Based on Kubernetes, it’s what converts a data lakehouse into an easy button for enterprises of all sizes.
- Scale-out Lakehouse Compute: Purpose-built Dell Compute and Networking hardware perfectly matched for compute-intensive data lakehouse workloads come pre-integrated into the solution. Independently scale from storage by seamlessly adding more compute as needs grow.
- Scale-out Object Storage: Dell ECS, ObjectScale and PowerScale deliver cyber-secure, multi-protocol, resilient and scale-out storage for storing and processing massive amounts of data. Native support for Delta Lake and Iceberg ensures read / write consistency within and across sites for handling concurrent, atomic transactions.
- Dell Services: Accelerate AI outcomes with help at every stage from trusted experts. Align a winning strategy, validate data sets, quickly implement your data platform and maintain secure, optimized operations.
- ProSupport: Comprehensive, enterprise-class support on the entire Dell Data Lakehouse stack from hardware to software delivered by highly trained experts around the clock and around the globe.
- ProDeploy: Expert delivery and configuration assure that you are getting the most from the Dell Data Lakehouse on day one. With 35 years of experience building best-in-class deployment practices and tools, backed by elite professionals, we can deploy 3x faster1 than in-house administrators.
- Advisory Services Subscription for Data Analytics Engine: Receive a pro-active, dedicated expert to maximize value of your Dell Data Analytics Engine environment, guiding your team through design and rollout of new use cases to optimize and scale your environment.
- Accelerator Services for Dell Data Lakehouse: Fast track ROI with guided implementation of the Dell Data Lakehouse platform to accelerate AI and data analytics.
Learn More
With the combination of these capabilities, Dell continues to innovate alongside our customers to help them exceed their goals in the face of data challenges. We aim to allow our customers to take advantage of the revolution brewing that is AI and this rapid change in the market to harness the power of their data and gain a competitive advantage and drive innovation. Enterprises are racing towards a modern data architecture – it's critical they don’t get stuck at the starting line.
For detailed information on this exciting product, refer to our technical guide. For other information, visit Dell.com/datamanagement.
Source
1 Based on a May 2023 Principled Technologies study “Using Dell ProDeploy Plus Infrastructure can improve deployment times for Dell Technology”