Learn to deploy and optimize open source large language models on AWS infrastructure. Master essential tools like llama.cpp and UV for efficient model deployment, and understand key concepts from model conversion to production optimization. This course bridges the gap between research models and production deployment
Performer
Noah Gift, presenter
Notes
Online resource; title from title details screen (O’Reilly, viewed December 2, 2024)