Download the Llama 2 Model | Llama 2: Inferencing on a Single GPU | Dell Technologies Info Hub

Your Browser is Out of Date

Nytro.ai uses technology that works best in other browsers.
For a full experience use one of the browsers below

Download the Llama 2 Model

Download the Llama 2 Model

Thank you for your feedback!

The model is available on Hugging Face. For Llama 2 model access we completed the required Meta AI license agreement.
The memory consumption of the model on our system is shown in the following table.
Table 3. Llama 2 model memory footprint

Model

Model Precision

No. of GPUs used

GPU memory consumed

Platform

Llama 2-7B-chat

FP-16

1 x A100-40GB

14.08 GiB

PowerEdge R760xa