Packt

Optimizing, Deploying, and Governing LLMs in the Enterprise

Packt

Optimizing, Deploying, and Governing LLMs in the Enterprise

Included with Coursera Plus

Gain insight into a topic and learn the fundamentals.
Advanced level

Recommended experience

8 hours to complete
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
Advanced level

Recommended experience

8 hours to complete
Flexible schedule
Learn at your own pace

What you'll learn

  • Develop strategies for optimizing and accelerating LLM inferencing patterns at scale.

  • Learn how to monitor LLM performance and troubleshoot in production systems.

  • Gain insights into responsible AI practices and the ethical considerations of deploying LLMs in enterprises.

Details to know

Shareable certificate

Add to your LinkedIn profile

Recently updated!

April 2026

Assessments

7 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 7 modules in this course

This module explores the critical role of data in developing and fine-tuning large language models (LLMs). Learners will discover strategies for data sourcing, augmentation, quality control, annotation, and bias mitigation, supported by real-world case studies and practical coding examples. By the end, participants will understand how to craft robust data pipelines that enhance LLM performance and fairness.

What's included

1 video11 readings1 assignment

This module explores the practical aspects of deploying large language models (LLMs) in enterprise environments, focusing on efficiency, compliance, and performance optimization. Learners will discover techniques such as model quantization, edge computing, and caching, while also addressing regulatory requirements and performance audits. Real-world examples and hands-on exercises illustrate how to manage and monitor LLM deployments effectively.

What's included

1 video8 readings1 assignment

This module explores practical strategies for accelerating and optimizing large language model (LLM) inference, focusing on memory-efficient formats, deployment engines, and cross-platform solutions. Learners will compare leading frameworks, understand model compilation and quantization, and examine real-world use cases for scalable, low-latency deployments. Emerging trends and advanced optimization techniques are also discussed to prepare learners for cutting-edge AI deployment challenges.

What's included

1 video9 readings1 assignment

This module explores the design and deployment of interconnected large language model (LLM) systems, highlighting key architectural patterns, enabling technologies, and advanced techniques for knowledge sharing and cost efficiency. Learners will examine real-world examples such as autonomous agents, programmable pipelines, and hybrid symbolic-LLM systems to understand how modern AI solutions achieve scalability, adaptability, and reliability.

What's included

1 video11 readings1 assignment

This module explores the essential practices for deploying and maintaining large language models (LLMs) in real-world production environments. Learners will gain insights into monitoring key metrics, ensuring reliability and security, optimizing costs, and scaling architectures for global deployment. Practical strategies and industry insights are provided to help build robust, efficient, and compliant LLM systems.

What's included

1 video10 readings1 assignment

This module explores the ethical, technical, and regulatory challenges associated with large language models (LLMs). Learners will examine fairness by design, post hoc output filtering, real-time content moderation, and documentation practices to ensure responsible AI deployment. The module also covers strategies for enhancing safety, robustness, and the implementation of constitutional AI principles.

What's included

1 video10 readings1 assignment

This module explores the latest advancements in multimodal artificial intelligence, focusing on how systems integrate and process diverse data types such as text and images. Learners will examine key enabling technologies, including cross-modal attention mechanisms, contrastive learning, and efficient fusion techniques, and see real-world applications in domains like healthcare. By the end, participants will understand both the technical foundations and practical implications of multimodal AI.

What's included

1 video8 readings1 assignment

Instructor

Packt - Course Instructors
Packt
1,728 Courses488,803 learners

Offered by

Packt

Why people choose Coursera for their career

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Coursera Plus

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Frequently asked questions