Edoardo Zorzi
Sapienza University of Rome, Italy
Ph.D Student.Collaborative Instance-object Navigation (CoIN) challenge at Embodied Agent and Dialog Workshop @ ECCV 2026.
1) determine whether the object shown in the image matches the given object description, or not (when sufficient information is available)
2) ask a clarifying question to the user when the description and the image are ambiguous
The goal of the agents is to correctly identify the image that matches the object description, while asking as few questions as possible to the user.Challenge end date: August 15th 2026
TBD
TBD
TBD
Sapienza University of Rome, Italy
Ph.D Student.Fondazione Bruno Kessler, Trento, Italy
Senior Researcher.If you use the provided materials, please cite the relevant paper below.
@InProceedings{taioli2025coin,
author = {Taioli, Francesco and Zorzi, Edoardo and Franchi, Gianni and Castellini, Alberto and Farinelli,
Alessandro and Cristani, Marco and Wang, Yiming},
title = {Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dialogues},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month = {October},
year = {2025},
pages = {18781-18792}
}
@InProceedings{zorzi2026coinqa,
author = {Zorzi, Edoardo and Taioli, Francesco and Wang, Yiming and Cristani, Marco and Farinelli,
Alessandro and Castellini, Alberto and Bazzani, Loris},
title = {Benchmarking Interaction, Beyond Policy: a Reproducible Benchmark for Collaborative Instance Object Navigation},
booktitle = {https://arxiv.org/pdf/2604.00265},
month = {March},
year = {2026},
}