The Infoq Podcast

Meryem Arik on LLM Deployment, State-of-the-art RAG Apps, and Inference Architecture Stack

Informações:

Sinopsis

In this podcast, Meryem Arik, Co-founder/CEO at TitanML, discusses the innovations in Generative AI and Large Language Model (LLM) technologies including current state of large language models, LLM Deployment, state-of-the-art Retrieval Augmented Generation (RAG) apps, and inference architecture stack for LLM applications. Read a transcript of this interview: https://bit.ly/3X5ZVPu Subscribe to the Software Architects’ Newsletter for your monthly guide to the essential news and experience from industry peers on emerging patterns and technologies: www.infoq.com/software-architects-newsletter Upcoming Events: InfoQ Dev Summit Boston (June 24-25, 2024) Actionable insights on today’s critical dev priorities. devsummit.infoq.com/conference/boston2024 InfoQ Dev Summit Munich (Sept 26-27, 2024) Practical learnings from senior software practitioners navigating Generative AI, security, modern web applications, and more. devsummit.infoq.com/conference/munich2024 QCon San Francisco (November 18-22, 2024) Get prac