Generative AI for Connected and Autonomous Vehicles

by Kyungtae Han

ISBN 978-1-394-42668-3 · Wiley

This book is a practical, applications-first guide to generative AI for connected and autonomous vehicle (CAV) systems, from large language models and multimodal generative models through prompt engineering, retrieval, tool use, agentic systems and protocols, fine-tuning, simulation, evaluation, and safety. It is designed for graduate courses, advanced undergraduate electives, and industry self-study. The hands-on labs are designed to run locally with open-source software and modest computing resources.

Order from Wiley →

Hands-On Labs

The companion labs are the core of the book's hands-on track: self-contained Jupyter notebooks that run locally on a laptop (CPU, Apple Silicon, or CUDA) using open-source tools. They are released under the MIT license, so readers and instructors can clone, run, and adapt them freely.

40+ hands-on labs across the book
Local-first CPU, Apple Silicon, or CUDA
No API keys open-source models running locally
MIT licensed clone, run, and adapt

A sample of what the labs build, grounded in connected and autonomous vehicle scenarios:

Labs on GitHub (public at publication) →

The repository goes public when the book is released; the link above will be live then.

Quickstart once the repository is available:

git clone https://github.com/han-kt/genai-for-cav-labs.git
cd genai-for-cav-labs
uv sync          # in a chosen lab directory
uv run jupyter lab

Each lab's README covers setup and prerequisites in detail.

What's Inside

A look at the book's full scope, to help readers and instructors decide whether it fits their needs. It is organized into seven parts spanning 22 chapters, grouped into four conceptual layers that progress from generative and multimodal models, through the model interface and agentic systems, to adaptation and validation. Each chapter pairs concepts with a CAV-grounded application and a runnable lab. Chapters are largely self-contained, so the structure below also maps the reading and teaching paths that follow.

Parts & Chapters

The seven parts and their chapters, grouped into the book's four conceptual layers.

LayerPartChapters
Generative & Multimodal Models Part 1Foundations 1. Introduction to Generative AI for CAV Systems
2. Large Language Models
Part 2Multimodal Representation 3. Diffusion and Flow Matching
4. GANs, VAEs, and Autoregressive Models
5. Vision-Language Models
6. Video Large Language Models
Model Interface Part 3Language-Based Interaction 7. Prompt Engineering
8. Retrieval-Augmented Generation
9. Function Calling and Tool Use
Agentic Systems & Protocols Part 4Agentic Intelligence 10. GenAI Agent Architectures
11. Multi-Agent Systems
12. Orchestration Frameworks
Part 5Communication Protocols 13. Communication Protocols
14. Model Context Protocol (MCP)
15. Agent-to-Agent (A2A) Protocols
16. Capability Composition with MCP and A2A
Adaptation & Validation Part 6Adaptation and Deployment 17. Supervised Fine-Tuning
18. Reinforcement Learning Fine-Tuning
19. Knowledge Distillation
Part 7Validation and Safety 20. Simulation Platforms
21. Evaluation of Generative AI Systems
22. Ethical and Safety Considerations

Appendices A–E (in the printed book) consolidate the mathematical, probabilistic, and reinforcement learning foundations, an automotive–AI concept bridge, and a glossary.

Reading Pathways

Different readers can enter the material along different paths:

Graduate / Research

Read all seven parts in sequence for comprehensive theoretical and practical coverage; the best fit for coursework and research.

Upper-Division Undergraduate

An applied LLM-systems track: Parts 1, 3, 4, and 7, with the multimodal models of Part 2 as optional enrichment.

Industry Practitioner

Foundations, then language interaction, agentic systems and protocols, and adaptation and validation; consult Part 2 as specific applications require it.

Sample Teaching Schedules

Two example one-semester (15-week) layouts are shown below. Additional teaching schedules and editable syllabus templates are available to adopting instructors.

Graduate Seminar: Full Sequence

WkTopicCh.
1Introduction to Generative AI for CAVs1
2Large Language Models2
3Diffusion and Flow Matching3
4GANs, VAEs, Autoregressive Models4
5Vision-Language and Video Models5–6
6Prompt Engineering7
7Retrieval-Augmented Generation8
8Function Calling and Tool Use9
9GenAI Agent Architectures10
10Multi-Agent Systems, Orchestration11–12
11Communication Protocols, MCP13–14
12A2A and Capability Composition15–16
13Supervised and RL Fine-Tuning17–18
14Knowledge Distillation, Simulation19–20
15Evaluation and Ethics; project21–22

Undergraduate: LLM-Systems Track

WkTopicCh.
1Introduction to Generative AI for CAVs1
2Large Language Models2
3How LLMs Generate2
4Prompt Engineering7
5Retrieval-Augmented Generation8
6Function Calling and Tool Use9
7Review and midterm; project kickoff
8GenAI Agent Architectures10
9Multi-Agent Systems11
10Orchestration Frameworks12
11Connecting Agents: MCP14
12Adapting Models: Fine-Tuning17
13Simulation Platforms20
14Evaluation21
15Ethics and Safety; project22

Part 2 (Chapters 3–6) is optional enrichment for students interested in image, video, and multimodal models.

Wiley Resources

Instructor and supplementary materials are provided through Wiley. The book's Wiley page is the home for ordering information, supplementary resources, and the instructor materials for adopters.

Visit the Wiley book page →