Chapters
Executive summary
Solution overview
Methodology
Workload description
Hardware configurations tested
Software configuration
Key takeaways
Conclusion
Resources
This document details an analysis from Dell Technologies and Intel of the next-token latency requirements of real-time chatbots and fine-tuning LLMs.