|
CoMind: Understanding Collaborative Human Activity from Multiple Minds and Views
Alexey Gavryushin, Dingxi Zhang, Zhao Huang, Alexandros Delitzas, Jiaqi Chen, Ben Ellis, Cedric Zöllner, Manthan Patel, Manuel Kaufmann, Marc Pollefeys, Xi Wang
ECCV, 2026
project page /
paper coming soon
Simultaneous dual egocentric video feeds paired with 3D scene reconstructions and object scans, with three social-reasoning benchmarks: joint attention estimation, socially conditioned action anticipation, and collaborative handover prediction.
|
|
EgoVerse: An Egocentric Human Dataset for Robot Learning from Around the World
Ryan Punamiya*, Simar Kareer*, Zeyi Liu, Josh Citron, Ri-Zhao Qiu, Xiongyi Cai, Alexey Gavryushin, Jiaqi Chen, Davide Liconti, Lawrence Y. Zhu, Patcharapong Aphiwetsa, Baoyu Li, Aniketh Cheluva, Pranav Kuppili, Yangcen Liu, Dhruv Patel, Aidan Gao, Hye-Young Chung, Ryan Co, Renee Zbizika, Jeff Liu, Xiaomeng Xu, Haoyu Xiong, Geng Chen, Sebastiano Oliani, Chenyu Yang, Xi Wang, James Fort, Richard Newcombe, Josh Gao, Jason Chong, Garrett Matsuda, Aseem Doriwala, Marc Pollefeys, Robert Katzschmann, Xiaolong Wang, Shuran Song, Judy Hoffman, Danfei Xu
Preprint, 2025
project page /
pdf
1,362 hours of human demos across 240 scenes and 2,087 people — a collaborative platform that lets anyone contribute egocentric data to train robots at scale, and shows that cotraining with human data improves robot learning.
|
|
REACT3D: Recovering Articulations for Interactive Physical 3D Scenes
Zhao Huang, Boyang Sun, Alexandros Delitzas, Jiaqi Chen, Marc Pollefeys
IEEE Robotics and Automation Letters (RA-L), 2026
project page /
arXiv
Takes a static 3D scan and automatically figures out which parts open, how they move, and what's behind them — making any scene simulation-ready without manual annotation.
|
|
NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild
Weining Ren*, Zihan Zhu*, Boyang Sun, Jiaqi Chen, Marc Pollefeys, Songyou Peng
CVPR, 2024
project page /
arXiv
Casually captured photos full of pedestrians, cars, and changing light? No problem — we use uncertainty to filter out the mess and still get clean novel views.
|
|
"Where am I?" Scene Retrieval with Language
Jiaqi Chen, Daniel Barath, Iro Armeni, Marc Pollefeys, Hermann Blum
ECCV, 2024
arXiv
Describe a place in words — "the kitchen with the red fridge" — and we'll find it in a collection of 3D scene graphs. Language-based localization without coordinates.
|
|
A 3D Mixed Reality Interface for Human-Robot Teaming
Jiaqi Chen, Boyang Sun, Marc Pollefeys, Hermann Blum
ICRA, 2024
arXiv
A mixed-reality interface that lets you see through walls to where your robots are, watch their maps build in real-time, and drag-and-drop them to new goals.
|
|
Robot Calligraphy using Pseudospectral Optimal Control in Conjunction with a Novel Dynamic Brush Model
Sen Wang, Jiaqi Chen, Xuanliang Deng, Seth Hutchinson, Frank Dellaert
IROS, 2020 (Best Entertainment and Amusement Paper Award Finalist)
arXiv
Teaching a robot arm to write beautiful Chinese calligraphy by turning each brushstroke into a trajectory optimization problem over factor graphs with a dynamic brush model.
|
|