ECCV 2024 Workshops

AI for Visual Arts Workshop and Challenges (AI4VA)

Workshop

Deblina Bhattacharjee

[ Amber 2 ]

Abstract

Fairness and ethics towards transparent AI: facing the chalLEnge through model Debiasing (FAILED)

Workshop

Vito Paolo Pastore

[ Suite 6 ]

Abstract

9th Workshop on Computer Vision in Plant Phenotyping and Agriculture (CVPPA)

Workshop

Valerio Giuffrida

[ Panorama Lounge ]

Abstract

BioImage Computing (BIC)

Workshop

Alexander Krull

[ Amber 5 ]

Abstract

ACVR2024 - 12th International Workshop on Assistive Computer Vision and Robotics

Workshop

Marco Leo

[ Tower Lounge ]

Abstract

3D Vision and Modeling Challenges in eCommerce

Workshop

Kai Wang

[ Brown 2 ]

Abstract

3rd edition of Computer Vision for Metaverse (CV4Metaverse)

Workshop

Giuseppe Serra

[ Amber 1 ]

Abstract

Critical Evaluation of Generative Models and their Impact on Society

Workshop

Noa Garcia

[ Suite 5 ]

Abstract

Beyond Euclidean: Hyperbolic and Hyperspherical Learning for Computer Vision

Workshop

Pascal Mettes

[ Suite 8 ]

Abstract

Visual object tracking and segmentation challenge VOTS2024 workshop

Workshop

Matej Kristan

[ Amber 4 ]

Abstract

Recovering 6D Object Pose

Workshop

Tomas Hodan

[ Amber 7 + 8 ]

Abstract

Workshop on Spatial AI

Workshop

Niclas Zeller

[ Brown 1 ]

Abstract

Scalable 3D Scene Generation and 3D Geometric Scene Understanding

Workshop

Miaomial Liu

[ Amber 6 ]

Abstract

Workshop on Artificial Social Intelligence

Workshop

Leena Mathur

[ Suite 2 ]

Abstract

The First Workshop on Expressive Encounters: Co-speech gestures across cultures in the wild

Workshop

Viktor Schmuck

[ Suite 3 ]

Abstract

The Second Perception Test Challenge

Workshop

Viorica Patraucean

[ Suite 7 ]

Abstract

Following the successful 2023 edition, we organise the second Perception Test Challenge to benchmark multimodal perception models on the Perception Test (blog, github) - a diagnostic benchmark created by Google DeepMind to comprehensively probe the abilities of multimodal models across:
* video, audio, and text modalities
* four skill areas: Memory, Abstraction, Physics, Semantics
* four types of reasoning: Descriptive, Explanatory, Predictive, Counterfactual
* six computational tasks: multiple-choice video-QA, grounded video-QA, object tracking, point tracking, action localisation, sound localisation

Eyes of the Future: Integrating Computer Vision in Smart Eyewear

Workshop

Francesca Palermo

[ Suite 9 ]

Abstract

As Smart Eyewear devices become increasingly prevalent, optimizing their functionality and user experience through sophisticated computer vision applications is crucial. These devices must not only effectively process real-time data but also operate under power and computational constraints while ensuring user privacy and ethical standards are upheld.x000D
x000D
The "Eyes of the Future: Integrating Computer Vision in Smart Eyewear (ICVSE)" workshop, at ECCV 2024, aims to advance the field of Smart Eyewear by integrating cutting-edge computer vision technologies. This workshop addresses the need to bridge theoretical research and practical implementations in Smart Eyewear, a technology that will transform user interactions in everyday life through enhanced perception and augmented reality experiences.x000D
x000D
The need for this workshop stems from the rapid advancements in both computer vision and wearable technology sectors, necessitating a dedicated forum where interdisciplinary insights and experiences can be shared to accelerate practical applications. Thus, ICVSE not only aims to showcase novel research but also to inspire a roadmap for future developments in Smart Eyewear technology.

Self-Supervised Learning - What is next?

Workshop

Michael Dorkenwald

[ Space 2 ]

Abstract

From GPT to DINO to diffusion models, the past years have seen major advances in self-supervised learning, with many new methods reaching astounding performances on standard benchmarks. Still, the field of SSL is rapidly evolving with new learning paradigms coming up at an unprecedented speed. At the same time, works on coupled data, such as image-text pairs, have shown large potential in producing even stronger models capable of zero-shot tasks and benefiting from the methodology developed in SSL. Despite this progress, it is also apparent that there are still major unresolved challenges and it is not clear what the next step is going to be. In this workshop, we want to highlight and provide a forum to discuss potential research directions, from radically new self-supervision tasks, data sources, and paradigms to surprising counter-intuitive results. Through invited speakers and paper oral talks, our goal is to provide a forum to discuss and exchange ideas where both the leaders in this field, as well as the new, younger generation, can equally contribute to discussing the future of this field.

2nd International Workshop on Privacy-Preserving Computer Vision

Workshop

Vivek Sharma

[ Suite 4 ]

Abstract

The focus of this workshop is to bring together researchers from industry and academia who focus on both distributed and privacy-preserving machine learning for vision and imaging. These topics are of increasingly large commercial and policy interest. It is therefore important to build a community for this research area, which involves collaborating researchers that share insights, code, data, benchmarks, training pipelines, etc., and together aim to improve the state of privacy in computer vision.

5th Advances in Image Manipulation (AIM) Workshop and Challenges

Workshop

Radu Timofte

[ Amber 1 ]

Abstract

2nd Workshop on Quantum Computer Vision and Machine Learning (QCVML)

Workshop

Jan-Nico Zaech

[ Suite 2 ]

Abstract

AI4DH: Artificial Intelligence for Digital Humanities

Workshop

Roberto Pierdicca

[ Suite 6 ]

Abstract

OpenSUN3D: 3rd Workshop on Open-Vocabulary 3D Scene Understanding

Workshop

Francis Engelmann · Zuria Bauer

[ Amber 4 ]

Abstract

The ability to perceive, understand and interact with arbitrary 3D environments is a long-standing goal in research with applications in AR/VR, health, robotics and so on. Current 3D scene understanding models are largely limited to low-level recognition tasks such as object detection or semantic segmentation, and do not generalize well beyond the a pre-defined set of training labels. More recently, large visual-language models (VLM), such as CLIP, have demonstrated impressive capabilities trained solely on internet-scale image-language pairs. Some initial works have shown that these models have the potential to extend 3D scene understanding not only to open set recognition, but also offer additional applications such as affordances, materials, activities, and properties of unseen environments. The goal of this workshop is to bundle these efforts and to discuss and establish clear task definitions, evaluation metrics, and benchmark datasets.

Explainable AI for Computer Vision: Where Are We and Where Are We Going?

Workshop

Robin Hesse

[ Brown 1 ]

Abstract

Deep neural networks (DNNs) are an essential component in the field of computer vision and achieve state-of-the-art results in almost all of its sub-disciplines. While DNNs excel at predictive performance, they are often too complex to be understood by humans, leading to them often being referred to as “black-box models”. This is of particular concern when DNNs are applied in safety-critical domains such as autonomous driving or medical applications. With this problem in mind, explainable artificial intelligence (XAI) aims to gain a better understanding of DNNs, ultimately leading to more robust, fair, and interpretable models. To this end, a variety of different approaches, such as attribution maps, intrinsically explainable models, and mechanistic interpretability methods, have been developed. While this important field of research is gaining more and more traction, there is also justified criticism of the way in which the research is conducted. For example, the term “explainability” in itself is not properly defined and is highly dependent on the end user and the task, leading to ill-defined research questions and no standardized evaluation practices. The goals of this workshop are thus two-fold:x000D
x000D
1. Discussion and dissemination of ideas at the cutting-edge of XAI research (“Where are we?”) …

Transparent & Reflective objects In the wild Challenges (TRICKY)

Workshop

Jean-Baptiste Weibel

[ Suite 8 ]

Abstract

2nd OmniLabel Workshop: Enabling Complex Perception Through Vision and Language Foundational Models

Workshop

Vijay Kumar B G

[ Amber 6 ]

Abstract

Autonomous Vehicles meet Multimodal Foundation Models

Workshop

Yan Wang

[ Brown 2 ]

Abstract

Half-century of Structure-from-Motion (50SfM)

Workshop

Andrea Fusiello

[ Space 2 ]

Abstract

Human-inspired Computer Vision

Workshop

Lucia Schiatti

[ Tower Lounge ]

Abstract

T-CAP - Towards a Complete Analysis of People: Fine-grained Understanding for Real-World Applications

Workshop

Guido Borghi

[ Amber 7 + 8 ]

Abstract

The Third ROAD Workshop & Challenge: Event Detection for Situation Awareness in Autonomous Driving

Workshop

Salman Khan

[ Suite 5 ]

Abstract

The First Workshop on: Computer Vision for Videogames (CV2)

Workshop

iuri frosio

[ Amber 5 ]

Abstract

Our scope is to bring together people working in Computer Vision (CV) and, more broadly speaking, Artificial Intelligence (AI), to talk about the adoption of CV/AI methods for videogames, that represent a large capital market within creative industries and a crucial domain for AI research at the same time. Our workshop will cover various aspects of videogames development and consumption, ranging from game creation, game servicing, player experience management, to bot creation, cheat detection, and human computer interaction mediated by large language models. We believe that focusing on CV for videogames will bring together cohesively related works with foreseeable and practical impact on today’s market, thus we will give priority to submissions specifically devoted to the application of state of the art CV/AI methods FOR videogames, while we will assign lower priority to submissions on the adoption of videogames as test beds for the creation and testing of CV/AI methods. We also plan to favour the presentation of novel datasets that can sparkle further research in this field.x000D
x000D
The committee and keynotes includes multiple genders, researchers with origins from different geographical areas (USA, EU, Asia), from both industry (NVIDIA, Activision, Blockade Labs, Microsoft, Snap) and academia (Universities of …

Traditional Computer Vision in the Age of Deep Learning (TradiCV)

Workshop

Andrea Fusiello

[ Amber 2 ]

Abstract

Workshop on Neuromorphic Vision (NeVi): Advantages and Applications of Event Cameras

Workshop

Federico Becattini

[ Panorama Lounge ]

Abstract

Workshop on Unlearning and Model Editing (U&ME'24)

Workshop

Diego Garcia-Olano

[ Suite 3 ]

Abstract

The Dark Side of Generative AIs and Beyond

Workshop

Hataya Ryuichiro

[ Suite 4 ]

Abstract

AVGenL: Audio-Visual Generation and Learning

Workshop

Shiqi Yang

[ Suite 9 ]

Abstract

n recent years, we have witnessed significant advancements in the field of visual generation which have molded the research landscape presented in computer vision conferences such as ECCV, ICCV, and CVPR. However, in a world where information is conveyed through a rich tapestry of sensory experiences, the fusion of audio and visual modalities has become much more essential for understanding and replicating the intricacies of human perception and diverse real-world applications. Indeed, the integration of audio and visual information has emerged as a critical area of research in computer vision and machine learning, having numerous applications across various domains, from immersive gaming environments to lifelike simulations for medical training, such as multimedia analysis, virtual reality, advertisement and cinematic application. x000D
x000D
Despite these strong motivations, little attention has been given to research focusing on understanding and generating audio-visual modalities compared to traditional, vision-only approaches and applications. Given the recent prominence of multi-modal foundation models, embracing the fusion of audio and visual data is expected to further advance current research efforts and practical applications within the computer vision community, which makes this workshop an encouraging addition to ECCV that will catalyze advancements in this burgeoning field.x000D
x000D
In this workshop, …

Efficient Deep Learning for Foundation Models

Workshop

Hongxu Yin

[ Brown 3 ]

Abstract

Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving: Towards Next-Generation Solutions

Workshop

Lanqing Hong

[ Space 2 ]

Abstract

Vision for Art (VISART) VII Workshop

Workshop

Stuart James · Peter Bell

[ Amber 1 ]

Abstract

TWYN: Trust What You learN. 1st Workshop on Trustworthiness in Computer Vision

Workshop

Marco Cotogni

[ Suite 5 ]

Abstract

In an era of rapid advancements in Artificial Intelligence, the imperative to fosterx000D
Trustworthy AI has never been more critical. The first “Trust What You learNx000D
(TWYN)” workshop seeks to create a dynamic forum for researchers, practition-x000D
ers, and industry experts to explore and advance the intersection of Trustworthyx000D
AI and DeepFake Analysis within the realm of Computer Vision. The workshopx000D
aims to delve into the multifaceted dimensions of building AI systems that arex000D
not only technically proficient but also ethical, transparent, and accountable.x000D
The dual focus on Trustworthy AI and DeepFake Analysis reflects the work-x000D
shop’s commitment to addressing the challenges posed by the proliferation ofx000D
deep fake technologies while simultaneously promoting responsible AI practices.

Vision-Centric Autonomous Driving (VCAD) Workshop

Workshop

Yiming Li

[ Suite 9 ]

Abstract

Wild3D: 3D Modeling, Reconstruction, and Generation in the Wild

Workshop

Wei-Chiu Ma

[ Brown 3 ]

Abstract

Computational Aspects of Deep Learning

Workshop

Giuseppe Fiameni

[ Amber 2 ]

Abstract

Dense Neural SLAM Workshop (NeuSLAM)

Workshop

Martin R Oswald

[ Amber 6 ]

Abstract

xAI4Biometrics at ECCV 2024 - 4th Workshop on Explainable & Interpretable Artificial Intelligence for Biometrics

Workshop

Ana F Sequeira

[ Suite 3 ]

Abstract

Uncertainty Quantification for Computer Vision

Workshop

Andrea Pilzer

[ Brown 2 ]

Abstract

This \textbf{UNcertainty quantification for Computer Vision (UNCV)} Workshop %aims to raise awareness about models, data, and prediction uncertainties to the vision communityx000D
aims to raise awareness and generate discussion regarding how predictive uncertainty can, and should, be effectively incorporated into models within the vision community. The workshop will bring together experts from machine learning and computer vision to create a new generation of well-calibrated and effective methods that \emph{`know when they do not know'}.

2nd Workshop on Vision-based Industrial Inspection (VISION)

Workshop

Hao Yan

[ Tower Lounge ]

Abstract

The 3rd Workshop for Out-of-Distribution Generalization in Computer Vision Foundation Models

Workshop

Bingchen Zhao

[ Brown 1 ]

Abstract

2nd Workshop on More Exploration, Less Exploitation (MELEX)

Workshop

Vasileios Belagiannis

[ Suite 2 ]

Abstract

CV For Ecology Workshop (CV4E)

Workshop

Mohamed Elhoseiny

[ Suite 8 ]

Abstract

1st Workshop on Neural Fields Beyond Conventional Cameras

Workshop

Tzofi Klinghoffer

[ Panorama Lounge ]

Abstract

Neural fields have been widely adopted for learning novel view synthesis and 3D reconstruction from RGB images by modeling transport of light in the visible spectrum. This workshop focuses on neural fields beyond conventional cameras, including (1) learning neural fields from data from different sensors across the electromagnetic spectrum and beyond, such as lidar, cryo-electron microscopy (cryoEM), thermal, event cameras, acoustic, and more, and (2) modeling associated physics-based differentiable forward models and/or the physics of more complex light transport (reflections, shadows, polarization, diffraction limits, optics, scattering in fog or water, etc.). Our goal is to bring together a diverse group of researchers using neural fields across sensor domains to foster learning and discussion in this growing area.

Instance-Level Recognition

Workshop

Andre Araujo

[ Amber 5 ]

Abstract

Map-free Visual Relocalization

Workshop

Aron Monszpart

[ Suite 6 ]

Abstract

The Map-free Visual Relocalization workshop investigates topics related to metric visual relocalization relative to a single reference image instead of relative to a map. This problem is of major importance to many higher level applications, such as Augmented/Mixed Reality, SLAM and 3D reconstruction. It is important now, because both industry and academia are debating whether and how to build HD-maps of the world for those tasks. Our community is working to reduce the need for such maps in the first place.x000D
x000D
We host the first Map-free Visual Relocalization Challenge 2024 competition with two tracks: map-free metric relative pose from a single image to a single image (proposed by Arnold et al. in ECCV 2022) and from a query sequence to a single image (new). While the former is a more challenging and thus interesting research topic, the latter represents a more realistic relocalization scenario, where the system making the queries may fuse information from query images and tracking poses over a short amount of time and baseline. We invite papers to be submitted to the workshop.

Workshop on Visual Concepts

Workshop

Shangzhe Wu

[ Suite 4 ]

Abstract

Multimodal Agents Workshop

Workshop

Zane Durante

[ Amber 7 + 8 ]

Abstract

Emergent Visual Abilities and Limits of Foundation Models (EVAL-FoMo)

Workshop

Ashkan Khakzar

[ Amber 5 ]

Abstract

ROAM: Robust, Out-of-Distribution And Multi-Modal models for Autonomous Driving

Workshop

Arturo Deza

[ Amber 6 ]

Abstract

Synthetic Data for Computer Vision

Workshop

Lucia Cascone

[ Space 2 ]

Abstract

Sometimes Less is More: The First Dataset Distillation Challenge

Workshop

Ahmad Sajedi

[ Amber 2 ]

Abstract

Workshop on Green Foundation Models

Workshop

Yiming Wang

[ Suite 9 ]

Abstract

Geometry in the Large Model Era

Workshop

Yichen Li

[ Brown 3 ]

Abstract

Large-scale Video Object Segmentation

Workshop

Henghui Ding

[ Suite 6 ]

Abstract

AI3DCC: The Second Workshop of AI for 3D Content Creation

Workshop

Despoina Paschalidou

[ Brown 1 ]

Abstract

Multi-Agent Autonomous Systems Meet Foundation Models: Challenges and Futures

Workshop

Haibao Yu

[ Amber 1 ]

Abstract

FashionAI: Exploring the intersection of Fashion and Artificial Intelligence for reshaping the Industry

Workshop

Marina Paolanti

[ Amber 4 ]

Abstract

7th Workshop and Competition on Affective Behavior Analysis in-the-wild

Workshop

Dimitrios Kollias

[ Suite 5 ]

Abstract

FOundation models Creators meet USers (FOCUS)

Workshop

Antonio Alliegro

[ Suite 2 ]

Abstract

Women in Computer Vision

Workshop

Mamatha Thota

[ Panorama Lounge ]

Abstract

GigaVision: When Gigapixel Videography Meets Computer Vision

Workshop

Lu Fang

[ Suite 3 ]

Abstract

Knowledge in Generative Models

Workshop

Anand Bhattad

[ Brown 2 ]

Abstract

Foundation Models for 3D Humans

Workshop

Yao Feng

[ Tower Lounge ]

Abstract

Observing and Understanding Hands in Action

Workshop

Linlin Yang

[ Suite 8 ]

Abstract

Our HANDS workshop will gather vision researchers working on perceiving hands performing actions, including 2D & 3D hand detection, segmentation, pose/shape estimation, tracking, etc. We will also cover related applications including gesture recognition, hand-object manipulation analysis, hand activity understanding, and interactive interfaces. x000D
x000D
The eighth edition of this workshop will emphasize the use of large foundation models (e.g., CLIP, Point-E, Segment Anything, Latent Diffusion Models) for hand-related tasks. These models have revolutionized the perceptions of AI, and demonstrate groundbreaking contributions to multimodal understanding, zero-shot learning, and transfer learning. However, there remains an untapped potential for exploring their applications in hand-related tasks. Our offical website is https://hands-workshop.org.