The Art of Speed: Unleashing Peak Performance in Generative AI – Ekaterina Sirazitdinova, NVIDIA

Modern generative AI networks are growing in size and complexity to enhance accuracy and precision.

May 22, 2024

Data Innovation Summit 2024

Data Innovation Summit 2024

Data Innovation Summit 2024

Session Outline

Modern generative AI networks are growing in size and complexity to enhance accuracy and precision. As a consequence, such larger AI models result in reduced throughput and escalated memory needs. In the fast-paced landscape of AI, optimizing and scaling inferencing workloads becomes paramount. This enlightening talk at the Data Innovation Summit 2024, explores a solution to address this need, which involves optimizing AI models for performance and maximizing the utilization of available resources.

Key Takeaways

Understand why it is essential to optimize performance of your LLMs.
Learn specific techniques one can utilize to boost the performance of LLMs.
Discover free to use software products instrumental for LLM performance optimization.

Published May 22, 2024

Add a comment

Leave a Reply Cancel reply

You must be logged in to post a comment.

Read more

Lean, Mean, Green Machines: Optimizing AI for Energy Efficiency

Lean, Mean, Green Machines: Optimizing AI for Energy Efficiency

Lean, Mean, Green Machines: Optimizing AI for Energy Efficiency

May 22, 2024

Data Innovation Summit 2024

Data Innovation Summit 2024

Extracting Value from Text: Are LLMs Always the Best Solution? – Didac Fortuny, Adevinta

May 22, 2024

Data Innovation Summit 2024

Data Innovation Summit 2024

News Classification with LLMs: Building Reliable Evaluation Datasets – Trine Engelund, JP/Politiken Media Group

May 22, 2024

Data Innovation Summit 2024

Data Innovation Summit 2024

Data Innovation Summit 2024

From Legacy to Cloud: Data Platform Evolution at Nordic Investment Bank – Karri Linnoinen, Nordic Investment Bank

May 22, 2024

Data Innovation Summit 2024

Data Innovation Summit 2024

Modern Data Ops @ UPM – Creating Business Value with Data – Abona Robles-Nordling, UPM – The Biofore Company & Jonni Henttinen, Kaito Insight

May 22, 2024

Data Innovation Summit 2024

Data Innovation Summit 2024

ALLY: Large Language Models for Companion Recommenders – Konstantina Christakopoulou, Google DeepMind

May 22, 2024