Home > Workload Solutions > SQL Server > White Papers > Solution Insight: SQL Server 2022 Data Analytics on Dell PowerEdge with AMD EPYC 7473X Processors and Dell ECS > Solution overview
Microsoft SQL Server is widely used across all industries, and these data sources are often mission critical. This means that backup and restore of SQL Server database is crucial. SQL Server 2022 backup and restore with S3-compatible object storage provides additional flexibility which can be backed up to the cloud. To use this feature, T-SQL provides the TO URL syntax for backup and FROM URL syntax for restore.
Data virtualization is a broad term used to describe an approach to data management. It allows an application to retrieve and manipulate data without requiring technical details about the data, such as how it is formatted at the source or where it is physically located. Data virtualization involves abstracting different sources through a single data access layer. There are tools and software available that organizations are adopting to integrate different types of data virtually. Data integration enables data mining and data analytics, and it is critical for predictive analytics tools that use machine learning (ML) and artificial intelligence (AI).
SQL Server 2022 PolyBase makes data virtualization possible for data scientists to use T-SQL for analytic workloads. PolyBase does this by querying data directly from other sources such as Oracle, Teradata, Hadoop cluster, and S3-compatible object storage without separately installing client connection software. It allows T-SQL queries to join the data from external sources to relational tables in an instance of SQL Server. The use of T-SQL OPENROWSET or EXTERNAL TABLE syntax delivers a powerful tool to query data in S3-compatible storage.
In this validation, AMD EPYC 7473X processors were chosen for the SQL Server 2022 database instances because running T-SQL queries for data analytics require quick response time. The AMD EPYC 7473X processors have 24-core per socket @2.8GHz with L3 cache size of 768 MB which offers enough horsepower for data analytic workloads.
Data analytic workloads require optimal CPUs for data processing. It is also important for these workloads to have a flexible and scalable S3-compatible object storage.
Dell Elastic Cloud Storage (ECS) is a software-defined, cloud-scale, object storage platform that delivers S3, Atmos, CAS, Swift, NFSv3, and HDFS storage services on a single, modern platform. It provides simple RESTful API access for storage services. Dell ECS provides significant value for organizations seeking a platform that supports rapid data growth. Dell ECS advantages and features include:
Table 1. Dell ECS features
Cloud scale |
|
Flexible deployment |
|
Enterprise grade |
|
TCO reduction |
|