The Art of Speed: Unleashing Peak Performance in Generative AI – Ekaterina Sirazitdinova, NVIDIA

Modern generative AI networks are growing in size and complexity to enhance accuracy and precision.
Data Innovation Summit 2024 Data Innovation Summit 2024
Data Innovation Summit 2024

Session Outline

Modern generative AI networks are growing in size and complexity to enhance accuracy and precision. As a consequence, such larger AI models result in reduced throughput and escalated memory needs. In the fast-paced landscape of AI, optimizing and scaling inferencing workloads becomes paramount. This enlightening talk at the Data Innovation Summit 2024, explores a solution to address this need, which involves optimizing AI models for performance and maximizing the utilization of available resources.

Key Takeaways

  • Understand why it is essential to optimize performance of your LLMs.
  • Learn specific techniques one can utilize to boost the performance of LLMs.
  • Discover free to use software products instrumental for LLM performance optimization.
Add a comment

Leave a Reply