Home > Storage > PowerFlex > White Papers > NVIDIA Riva on Red Hat OpenShift with Dell PowerFlex > NVIDIA Riva
NVIDIA Riva is a GPU-accelerated SDK for building speech AI applications that can be customized to deliver real-time performance. Riva offers pretrained speech models in NVIDIA NGC that can be fine-tuned on a custom dataset.
Models can be exported, optimized, and deployed as a speech service on premises or in the cloud with a single command using Helm charts. Riva’s high-performance inference is powered by NVIDIA TensorRT optimizations and served using the NVIDIA Triton Inference Server, which are both part of the NVIDIA AI platform.
Riva services are available for low-latency streaming, and high-throughput offline use cases. Riva is fully containerized and can scale to hundreds and thousands of parallel streams. For more information about Riva, see NVIDIA Riva. To learn more about purchasing Riva, contact NVIDIA Sales.