4th DriveX Workshop In conjunction with CVPR 2026

Foundation Models for V2X-based Cooperative Autonomous Driving

A premier forum uniting academic, industry, and standards communities to shape the next generation of cooperative, foundation-model-driven autonomous driving and intelligent transportation systems.

Wednesday, June 3, 2026 Denver, Colorado, USA In conjunction with CVPR 2026

Call for Papers Challenge Tracks

Curated keynote lineup from academia & industry

Focus on real-world V2X datasets & benchmarks

Safety, robustness, and trustworthy autonomy

Introduction Schedule Speakers Paper Track Challenge Organizers Program Committee Sponsors

Introduction

The 4th edition of the DriveX Workshop focuses on how foundation models and V2X-based cooperative systems can redefine perception, prediction, planning, and decision-making for autonomous driving and intelligent transportation infrastructure.

Traditional single-vehicle pipelines have achieved impressive progress in 3D detection and tracking, yet they remain constrained by limited viewpoints, occlusions, and domain shifts. Cooperative driving systems, powered by V2X communication and roadside/edge intelligence, extend sensing range, enrich scene context, and enable shared representations across vehicles and infrastructure.

In parallel, foundation models, including vision, vision-language, and multi-modal large models, unlock powerful generalization capabilities: open-vocabulary understanding, scalable pretraining, zero-shot adaptation, and interpretable reasoning about complex road scenes. Emerging end-to-end and agentic systems such as large driving models promise unified perception-to-control frameworks but raise new questions in trustworthiness, reliability, calibration, and evaluation at urban scale.

DriveX 2026 convenes researchers and practitioners from computer vision, robotics, communications, transportation, AI safety, and policy to:

Design foundation models natively aware of cooperative perception and real-world V2X constraints.
Bridge large-scale datasets, simulation, and deployment for multi-agent coordination.
Discuss standards, benchmarks, and open challenges for safe, human-centric cooperative autonomy.

Topics of Interest

Foundation models for cooperative autonomous driving and intelligent transportation systems
Vision-language models for traffic scene understanding and explanation
LLM- and agent-based support for perception, prediction, planning, and V2X coordination
Cooperative perception, V2X communication, and infrastructure-assisted driving
Datasets, benchmarks, and evaluation protocols for foundation models and cooperative perception
3D occupancy, 3D detection, 3D semantic segmentation, and holistic 3D scene understanding
End-to-end pipelines and large driving models for multi-agent decision-making
Safety, robustness, uncertainty estimation, and open-set reasoning in cooperative systems
Vehicle-to-Infrastructure (V2I) and Vehicle-to-Everything (V2X) interaction and standards

Schedule (Tentative)

Time	Session
08:00 – 08:10	Opening Remarks – Welcome & Workshop Overview
08:10 – 08:30	Opening Keynote Keynote
08:30 – 09:50	Keynotes 1 - Prof. Dr. Daniel Cremers (TUM) Keynote
08:50 – 09:10	Keynotes 2 - Dr. Mingxing Tan (Waymo) Keynote
09:20 – 09:40	Keynotes 3 - Dr. Jamie Shotton (Wayve) Keynote
09:40 – 10:00	Keynotes 4 - Prof. Dr. Marco Pavone (Stanford & NVIDIA) Keynote
10:00 – 11:00	Poster Session I & Coffee Break Posters
11:00 – 11:20	Keynote 5 - Prof. Dr. Angela Dai (TUM) Keynote
11:20 – 12:00	Panel I Panel
12:00 – 13:00	Lunch Break & Networking
13:00 – 13:20	Keynotes 6 - Prof. Dr. Bolei Zhou (UCLA) Keynote
13:20 – 13:40	Keynotes 7 - Prof. Dr. Manmohan Chandraker (UCSD) Keynote
13:40 – 14:00	Keynotes 8 - Prof. Dr. Sharon Li (UW–Madison) Keynote
14:00 – 14:20	Keynotes 9 - Prof. Dr. Holger Caesar (TU Delft) Keynote
14:20 – 15:00	Oral Presentations 1–4 Oral
15:00 – 16:00	Poster Session II & Coffee Break
16:00 – 16:20	Keynotes 10 - Prof. Dr. Alina Roitberg (Uni Stuttgart) Keynote
16:20 – 16:40	Keynotes 11 - Prof. Dr. Jiaqi Ma (UCLA) Keynote
16:40 – 17:20	Panel II Panel
17:20 – 17:30	Oral Presentation 5Oral
17:30 – 17:40	Awards Ceremony – Best Paper, Poster, Keynote, Challenge
17:40 – 18:00	Closing Remarks & Group Photo
19:00 – 21:00	Workshop Reception & Networking Reception

Final schedule, room allocation, and speaker order will be announced closer to the workshop date.

Confirmed Keynote Speakers

Prof. Dr. Daniel Cremers

Technical University of Munich, Germany

Dr. Mingxing Tan

Waymo, USA

Dr. Jamie Shotton

Wayve, Vancouver

Prof. Dr. Marco Pavone

Stanford University & NVIDIA, USA

Prof. Dr. Angela Dai

Technical University of Munich, Germany

Prof. Dr. Bolei Zhou

University of California Los Angeles (UCLA), USA

Prof. Dr. Manmohan Chandraker

University of California San Diego (UCSD), USA

Prof. Dr. Sharon Li

University of Wisconsin-Madison

Prof. Dr. Holger Caesar

Delft University of Technology, Netherlands

Prof. Dr. Alina Roitberg

University of Stuttgart, Germany

Prof. Dr. Jiaqi Ma

University of California Los Angeles (UCLA), USA

Paper Track

DriveX 2026 invites high-quality contributions on foundation models, V2X-based cooperative perception, large driving models, and related topics outlined above.

We welcome:

Novel full papers (max 8 pages, excluding references) for publication in the official proceedings.
4-page extended abstracts or 8-page versions of recently published work (non-archival, not included in proceedings).

Submissions must follow the official CVPR 2026 style: LaTeX or Typst.

Submission Portal: tbd
Submission Opens: February 1, 2026 (23:59 PST)
Paper Submission Deadline: March 1, 2026 (23:59 PST)
Notification to Authors: March 10, 2026
Camera-Ready Deadline: March 20, 2026

Paper Awards

🏆 Best Paper Award
🏆 Best Paper Runner-Up Award
🏆 Best Poster Award
🏆 Best Keynote Presentation Award

DriveX Challenge

The DriveX Challenge fosters rigorous, reproducible benchmarking of cooperative perception and planning on real-world V2X datasets. Tracks are designed in close collaboration with dataset creators and industry partners.

V2I-Based Cooperative Perception
Infrastructure–vehicle fusion using TUMTraf V2X CP. Focus on cooperative 3D detection and tracking with infrastructure-mounted LiDAR, radar, and cameras, emphasizing occlusion handling, long-range awareness, and reliability under real-world conditions.
Accident Scene Understanding & Safety Reasoning
Built upon TUMTraf Accid3nD. Participants design models for high-risk scenarios, proactive risk assessment, and early accident prediction using cooperative perception signals to support Vision-Zero mobility.
End-to-End Multi-Agent Autonomous Driving
Using V2XPnP and V2V4Real, teams explore end-to-end policies and trajectory planning with single-vehicle, multi-vehicle, and vehicle–infrastructure inputs. The track highlights how cooperative intelligence improves policy learning, coordination, and safety.

Competition Timeline

Competition Announcement: January 1, 2026
Submission Deadline: March 1, 2026 (23:59 PST)
Notification to Participants: March 31, 2026

Top-performing teams will be invited to present at the workshop. Detailed rules, baselines, and submission instructions will be released on the official challenge page.