Program

PROGRAM

Days: Monday, June 2nd Tuesday, June 3rd Wednesday, June 4th

Monday, June 2nd

View this program: with abstracts session overview talk overview

13:00-13:40 Registration

13:40-14:00 Session 1: Opening

Chair:

Daniel Thalmann

Location: Amphie 23

14:00-15:00 Session 2: How to train large scale 3D human and object foundation models

How to train large scale 3D human and object foundation models by Prof. Gerard Pons-Moll (University of Tübingen, Tübingen AI Center MPII)

Understanding 3D humans interacting with the world has been a long standing goal in AI and computer vision for decades. Lack of 3D data has been the major barrier of progress. This is changing with the increasing number of 3D datasets featuring images, videos and multi-view with 3D annotations, as well as large-scale image foundation models. However, learning models from such sources is non-trivial. Some of the challenges are: 1) Datasets are annotated with different 3D skeleton formats and outputs, 2) image foundation models are 2D and extracting 3D information from them is hard. I will present solutions to each of these 2 challenges. I will introduce a universal training procedure to consume any skeleton format, a diffusion based method tailored to lift foundation models to 3D (human and also general objects), and a mechanism to probe 3D foundation model features in geometry and texture awareness based on 3D Gaussian splatting reconstruction. I will also show a method to systematically create 3D human benchmarks on demand for evaluation (STAGE).

Chair:

Daniel Thalmann

Location: Amphie 23

15:05-16:20 Session 3A: Fluid & Physical Simulation I

CAVW

Chair:

Yalan Zhang

Location: Amphie 23

15:05	Haokai Zeng, Dongyu Yang, Yanrui Xu, Yalan Zhang, Zhongmin Wang, Feng Tian, Xiaokun Wang and Xiaojuan Ban An Adaptive Boundary Material Point Method with Surface Particle Reconstruction (abstract) PRESENTER: Haokai Zeng
15:30	Bastien Saillant, Florence Zara, Fabrice Jaillet and Guillaume Damiand Going further with Vertex Block Descent (abstract) PRESENTER: Bastien Saillant
15:55	Sun-Lay Gagneux, Khalid Djado and Richard Egli Simulation of Ball Levitation with SPH PRESENTER: Sun-Lay Gagneux

15:05-16:20 Session 3B: AI in Education & Interfaces

CAVW

Chair:

Christos Mousas

Location: Amphie 24

15:05	Jeongha Lee, Ghazanfar Ali and Jaein Hwang A Retrieval-Augmented Generation System for Accurate and Contextual Historical Analysis : AI-Agent for the Annals of the Joseon Dynasty (abstract) PRESENTER: Jeongha Lee
15:30	Dake Liu, Huiwen Zhao, Wen Tang and Wenwen Yang AIKII: An AI-enhanced Knowledge Interactive Interface for Knowledge Representation in Educational Games (abstract) PRESENTER: Dake Liu
15:55	Thuc Long Ha, Juan Verde, Julien Bert and Hadrien Courtecuisse Toward Fluoroscopy Guided Robotic Needle Insertion for Radio Frequency Ablation (abstract) PRESENTER: Thuc Long Ha

16:20-16:35 Coffee break

16:35-18:20 Session 4A: Geometry, Rendering & Mesh Processing

CAVW

Chair:

Frederic Cordier

Location: Amphie 23

16:35	Mingxiao Hu, Linlin Ge and Xujie Li Fuzzy Sampling with Qualified Uniformity Properties for Implicitly Defined Curves and Surfaces (abstract) PRESENTER: Mingxiao Hu
17:00	Mengyao Zhang, Wenting Li, Yong Zhao, Xin Si and Jingliang Zhang A robust 3D mesh segmentation algorithm with anisotropic sparse embedding (abstract) PRESENTER: Mengyao Zhang
17:25	Zezheng Chen, Huizhi Zhu, Fei Luo and Chunxia Xiao ReDACT: Reconstructing Detailed Avatar with Controllable Texture PRESENTER: Zezheng Chen
17:50	Yuxi Zhou, Bowen Gao, Hongxin Zhang, Xiaoliang Luo, Lvchun Wang and Wei Chen A Real-time Virtual-Real Fusion Rendering Framework in Cloud-Edge Environments (abstract) PRESENTER: Yuxi Zhou

16:35-18:20 Session 4B: Conversational Agents & Virtual Reality

CAVW

Chair:

Daniel Thalmann

Location: Amphie 24

16:35	Mehmet Efe Sak, Sinan Sonlu and Uğur Güdükbay Talk with Socrates: Relation Between Perceived Agent Personality and User Personality in LLM-based Natural Language Dialogue Using Virtual Reality (abstract) PRESENTER: Mehmet Efe Sak
16:58	Sagar Ashok Vankit, Vivian Genaro Motti, Tiffany Do, Samaneh Zamanifard, Deyrel Diaz, Andrew Duchowski, Bart Knijnenburg and Matias Volonte Path Modeling of Visual Attention, User Perceptions, and Behavior Change Intentions in Conversations with Embodied Agents in VR (abstract) PRESENTER: Sagar Ashok Vankit
17:21	Antoine Oger, Geoffrey Gorisse, Sylvain Fleury and Olivier Christmann MemorIA, an Architecture for Creating Interactive AI Historical Agents in Educational Contexts (abstract) PRESENTER: Antoine Oger
17:44	Samaneh Zamanifard, Sagar Vankit, Deyrel Diaz, Christos Mousas, Kelly Richardson, Andrew Duchowski and Matias Volonte Exploring the Impact of Multimodal Long Conversations in VR on Attitudes Towards Behavior Change, Memory Retention, and Cognitive Load (abstract) PRESENTER: Sagar Vankit
18:07	Akira Miya, Kunio Yamamoto and Masaki Oshita User Interface for Controlling Crowd in Metaverse Using Spatial Controller (abstract) PRESENTER: Masaki Oshita

18:30-19:45 Cocktail

Tuesday, June 3rd

View this program: with abstracts session overview talk overview

08:30-09:30 Session 5: Keynote: Xubo Yang

Harmonized XR: Seamlessly Bridging Physical and Perceptual Realism by Prof. Xubo yang (Shanghai Jiao Tong University)

Extended Reality (XR) represents a spectrum of immersive technologies that seamlessly blend the digital and physical worlds, creating environments where users can interact with virtual content as if it were part of their reality This keynote synthesizes cutting-edge research across visual perception, physical simulation, and interactive rendering to explore how XR can achieve both physical realism (accurate representation of physical phenomena) and perceptual realism (alignment with human visual and sensory perception).

We begin by addressing the challenges of visual fidelity in XR through innovative techniques that enhance occlusion, color accuracy, and rendering efficiency, ensuring that virtual content aligns seamlessly with human perception. Next, we delve into advancements in simulation methodologies that bring unprecedented physical accuracy to virtual environments, enabling the realistic representation of complex phenomena such as fluids, bubbles, and surface tension effects. Finally, we explore interactive experiences that bridge the gap between physical and perceptual realism by optimizing virtual interactions to align with natural human behavior and visual focus.

By integrating these advancements, XR can achieve a harmonious balance between physical and perceptual realism, creating immersive environments that are not only computationally efficient but also deeply engaging and believable. This keynote will highlight the interplay between these dimensions, offering a comprehensive roadmap for the future of XR technologies.

Chair:

Frederic Cordier

Location: Amphie 23

09:35-10:50 Session 6A: 3D Face and Talking Head Modeling

3 CAVW 1 LNCS

Chair:

Uğur Güdükbay

Location: Amphie 23

09:35	Jiajie Wu, Frederick W. B. Li, Gary K.L. Tam, Bailin Yang, Fangzhe Nan and Jiahao Pan Talking Face Generation with Lip and Identity Priors (abstract) PRESENTER: Jiajie Wu
09:53	Bailin Yang, Jiahao Pan, Fangzhe Nan and Jiajie Wu Speech-Driven 3D Facial Animation with Regional Attention for Style Capture PRESENTER: Jiahao Pan
10:11	Xingfei Xue, Xuesong Wang, Weizhou Liu, Xingce Wang, Junli Zhao and Zhongke Wu Coarse-to-Fine 3D Craniofacial Landmark Detection via Heat Kernel Optimization (abstract) PRESENTER: Xingfei Xue
10:29	Xiwen Shi, Hao Zhao, Yi Jiang, Hao Xu, Ziyi Yang, Yiqian Wu, Qingbiao Wu and Xiaogang Jin GSFaceMorpher: High-Fidelity 3D Face Morphing via Gaussian Splatting (abstract) PRESENTER: Xiwen Shi

09:35-10:50 Session 6B: Cultural Heritage & Artistic Generation

CAVW

Chair:

Huiwen Zhao

Location: Amphie 24

09:35	Yuan Ma, Zhixuan Wang, Yinghan Shi and Meili Wang Chinese Painting Generation with A Stroke-by-Stroke Renderer and a Semantic Loss (abstract) PRESENTER: Yuan Ma
10:00	Hui Liang and Rui Wang Research on Multi-Feature Fusion Shadow Puppet Motifs Generation Based on CSPMotifsGAN and Cultural Heritage Preservation (abstract) PRESENTER: Rui Wang
10:25	Jiahui Pan, Bailin Yang, Frederick W. B. Li and Fangzhe Nan CLPFusion: A Latent Diffusion Model Framework for Realistic Chinese Landscape Painting Style Transfer (abstract) PRESENTER: Jiahui Pan

10:50-11:05 Coffee break

11:05-12:20 Session 7A: Fluid & Physical Simulation II

CAVW

Chair:

Hadrien Courtecuisse

Location: Amphie 23

11:05	Yalan Zhang, Yuhang Xu, Xiaokun Wang, Angelos Chatzimparmpas and Xiaojuan Ban Decoupling Density Dynamics: A Neural Operator Framework for Adaptive Multi-Fluid Interactions (abstract) PRESENTER: Yuhang Xu
11:30	Naruo Nishio, Syuhei Sato, Kaisei Sakurai and Keiko Nakamoto A Control Simulation of Multiple Bubbles for Representing Desired Shapes (abstract) PRESENTER: Syuhei Sato
11:55	Qianwei Wang, Yanrui Xu, Xiangyu Sheng, Chao Yao, Yu Guo, Jian Chang, Jianjun Zhang and Xiaokun Wang A versatile energy-based SPH surface tension with spatial gradients (abstract) PRESENTER: Qianwei Wang

11:05-12:20 Session 7B: Human Behavior and Animation in Virtual and Mixed Reality Environments

Chair:

Junghyun Han

Location: Amphie 24

11:05	Ruochen Cao, Ziyuan Feng, Changyue Ma, Xin Wen, Yanrong Hao, Chenchen Zhang, Zequn Liang, Ziarmal Hussain and Rui Cao Virtual Guides and Crowd Behaviors: Understanding Evacuation Decision-Making in Virtual Reality PRESENTER: Ziyuan Feng
11:23	Jihui Jiao, Rui Zeng, Ju Dai and Junjun Pan BACH: Bi-stage Data-driven Piano Performance Animation for Controllable Hand motion (abstract) PRESENTER: Jihui Jiao
11:41	Cai Cheng-En, Sai-Keung Wong and Tzu-Yu Chen Risk-Aware Pedestrian Behavior Using Reinforcement Learning in Mixed Traffic (abstract) PRESENTER: Tzu-Yu Chen
11:59	Alessandro Visconti, Roberta Macaluso, Gabriele Di Bartolomei, Davide Calandra and Fabrizio Lamberti Improving Fidelity of Close Social Interaction Animations in Social VR with a Machine Learning-based Refinement Framework PRESENTER: Roberta Macaluso

12:20-14:00 Lunch

14:00-15:40 Session 8: AniNex Workshop I: Immersive Media, Culture & Education

4 CAVW 1 LNCS

Chair:

Jian Chang

Location: Amphie 23

14:00	Hui Liang and Longfei Yang Scene-EEGCNN: Visualization of Zen Meditation Experience Based on EEG-Cultural Heritage Integration (abstract) PRESENTER: Longfei Yang
14:20	Jiahao Du, Lihua You and Jianjun Zhang Exploring the Therapeutic Potential of VR-Based ASMR Animation: A Comparative Study on Relaxation and Sleep Aid (abstract) PRESENTER: Jiahao Du
14:40	Hui Liang, Yukun Li and Jialin Fu Immersion Discrepancies in Educational Serious Games Among Children's Age Groups (abstract) PRESENTER: Yukun Li
15:00	Michael Adjeisah, Eshani Kawshika Fernando and Jian Chang Immersive Video Game Experience through Naturalistic and Emotive Dialogue Agent (abstract) PRESENTER: Michael Adjeisah
15:15	Anil Bas, Oleg Fryazinov, Xiaosong Yang and Callum Rex Reid Photorealistic 3D Head Reconstruction via 2D Gaussians (abstract) PRESENTER: Anil Bas

15:40-16:00 Coffee break

16:00-18:00 Session 9: AniNex Workshop II: Simulation, Interaction & Visual Understanding

Chair:

Anil Bas

Location: Amphie 23

16:00	Jiamin Wang, Haoping Wang, Xiaokun Wang, Yalan Zhang, Jiří Kosinka, Steffen Frey, Alexandru Telea and Xiaojuan Ban Peridynamics-Based Simulation of Viscoelastic Solids and Granular Materials (abstract) PRESENTER: Jiamin Wang
16:20	Boyuan Cheng, Shang Ni, Jian Jun Zhang and Xiaosong Yang Automating Visual Narratives: Learning Cinematic Camera Perspectives from 3D Human Interaction (abstract) PRESENTER: Boyuan Cheng
16:40	Xin Luo and Qingshen Li Intelligent Compilation System for Chinese Character Animation Based on Dynamic Data Sets (abstract) PRESENTER: Xin Luo
17:00	Yanfeng Zheng, Pengjie Wang, Hao Liu and Xiaosong Yang Unsupervised Salient Object Detection with Pseudo-Labels Refinement (abstract) PRESENTER: Hao Liu
17:20	Nicolay Rusnachenko, James Franklin, Theophilus Akudjedu, Neel Doshi, Michael Board, Jian Chang and Jian Jun Zhang Using Large Language Models for Evaluation of Radiological Textual Reports (abstract) PRESENTER: Nicolay Rusnachenko
17:32	Aradhya Saini, Jian Chang and Hammadi Nait-Charif AssetMask: Mask R-CNN-based approach for Asset detection in railroad track health monitoring (abstract) PRESENTER: Aradhya Saini
17:44	Ehtzaz Chaudhry, Cliff Kilgore, Michele Board, Jian Chang, Michael Board and Jian Jun Zhang LLM-Powered VR Nursing Training for Dynamic Risk Assessment (abstract) PRESENTER: Ehtzaz Chaudhry

19:00-20:15 Sightseeing tour of Strasbourg by boat

20:30-22:00 Dinner at the Kammerzell restaurant

Wednesday, June 4th

View this program: with abstracts session overview talk overview

08:30-09:30 Session 10: Keynote: Jehee Lee

Generative GaitNet and Beyond: Foundational Models for Human Motion Analysis and Simulation by Prof. Jehee Lee (Seoul National University)

Understanding the relationship between human anatomy and motion is fundamental to effective gait analysis, realistic motion simulation, and the creation of human body digital twins. We will begin with Generative GaitNet (SIGGRAPH 2022), a foundational model for human gait that drives a comprehensive full-body musculoskeletal system comprising 304 Hill-type musculotendons. Generative GaitNet is a pre-trained, integrated system of artificial neural networks that operates in a 618-dimensional continuous space defined by anatomical factors (e.g., mass distribution, body proportions, bone deformities, and muscle deficits) and gait parameters (e.g., stride and cadence). Given specific anatomy and gait conditions, the model generates corresponding gait cycles via real-time physics-based simulation. Next, we will discuss Bidirectional GaitNet (SIGGRAPH 2023), which consists of forward and backward models. The forward model predicts the gait pattern of an individual based on their physical characteristics, while the backward model infers physical conditions from observed gait patterns. Finally, we will present MAGNET (Muscle Activation Generation Networks)—another foundational model (SIGGRAPH 2025)—designed to reconstruct full-body muscle activations across a wide range of human motions. We will demonstrate its ability to accurately predict muscle activations from motions captured in video footage. We will conclude by discussing how these foundational models collectively contribute to the development of human body digital twins, and explore their future potential in personalized rehabilitation, surgery planning, and human-centered simulation.

Chair:

Hyewon Seo

Location: Amphie 23

09:35-10:50 Session 11A: Detection & Recognition

Chair:

Michael Adjeisah

Location: Amphie 23

09:35	Hongyu Liu and Zhenyu Gu Perspective Matters: Investigating the Effects of Vibrotactile Mode Design on User Experience in Action-Role Playing Game and Media PRESENTER: Hongyu Liu
09:53	Yejuan Xie, Xinrui Wu, Yichen Zhang, Rongrong Chen, Tulika Saha, Yuehan Dou and Chengtao Ji Exploring Cultural Heritage with AR: The TAM Case Study of Nvshu PRESENTER: Yejuan Xie
10:11	Lanqi Xu, Yifan Zhang, Xu Lang, Jianing Liu, Baiheng Liu, Xianxuan Lin, Jing Zhang, Zheng Wang and Tianming Wu A Design Study on Contextual and Interactive Serious Games for Children’s Learning of Chinese Character Culture PRESENTER: Xu Lang
10:29	Siyao Du, Haoxiang Yang, Yajie Deng, Liuxuan Xie, Yanzhe Kong, Haohan Zhang and Hammadi Nait-Charif Summon Arcane: An AI-Driven Pixel Art Game with Interactive Narrative and Immersive Summoning Experience PRESENTER: Haoxiang Yang

09:35-10:50 Session 11B: AR/VR for Interaction

Chair:

Frederic Cordier

Location: Amphie 24

09:35	Rui Liu, Fangbo Lu, Wanchuang Luo, Tianjian Cao, Hailian Xue and Meili Wang YOLOv8-HAC: Safety helmet detection model for complex underground coal mine scene (abstract) PRESENTER: Rui Liu
10:00	Zhongguang Zhang, Tingwei Wu, Qifei Zhang, Li Wang and Zhao Wang STA-TAD: Spatial-Temporal Adapter on ViT for Temporal Action Detection PRESENTER: Tingwei Wu
10:25	Xiaohui Tan, Weiqi Xu, Jiazheng Wu and Qichuan Geng AU-guided Feature Aggregation for Micro-Expression Recognition (abstract) PRESENTER: Weiqi Xu

10:50-11:05 Coffee break

11:05-12:20 Session 12A: Cross-Modal and Semantic Representation Learning

4 LNCS

Chair:

Xiaosong Yang

Location: Amphie 23

11:05	Haoyuan Du, Xia Yu, Wei Yu, Dan Xue and Yuhan Lin Potential Representation Learning for Visible-Infrared Person Re-Identification in Virtual Surveillance Systems PRESENTER: Haoyuan Du
11:30	Xudong He, Li Wang, Zhao Wang and Jun Xiao Hybrid-Granularity Image-Music Retrieval Using Contrastive Learning between Images and Music PRESENTER: Xudong He
11:55	Yudai Ichimura and Syuhei Sato Text-driven Tree Modeling via CLIP-based Optimization PRESENTER: Yudai Ichimura

11:05-12:20 Session 12B: Image Restoration & Enhancement

2 CAVW 2 LNCS

Chair:

Bin Sheng

Location: Amphie 24

11:05	Hangbin Xu, Changjun Zou and Chuchao Lin UTMCR:3U-Net Transformer with Multi-Contrastive Regularization for Single Image Dehazing (abstract) PRESENTER: Hangbin Xu
11:23	Chuchao Lin, Changjun Zou and Hangbin Xu SCNet: A Dual-Branch Network for Strong Noisy Image Denoising Based on Swin Transformer and ConvNeXt (abstract) PRESENTER: Chuchao Lin
11:41	Xun Chen, Yushi Li, Yunyao Shen, Rong Chen, Chao Xu, Xiaobo Jin, Along Jin and Yu Han ShadowCraft-NeRF: Occlusion and Shadow Mitigation via SAM-Guided NeRF PRESENTER: Xun Chen
11:59	Haoran Jia, Chen Baijun and Nan Xiang Visualizing the Invisible: An Efficient Framework for Microscopic Visualization (abstract) PRESENTER: Haoran Jia

12:20-14:00 Lunch

14:00-15:40 Session 13A: Human Motion & Gesture Synthesis

CAVW

Chair:

Hyewon Seo

Location: Amphie 23

14:00	Ghazanfar Ali, Hwangyoun Kim and Jae-In Hwang RIDGE: Rule-Infused Deep Learning for Realistic Co-Speech Gesture Generation (abstract) PRESENTER: Ghazanfar Ali
14:20	Jiawen Peng, Zhuoran Liu, Jingzhong Lin and Gaoqi He Precise Motion Inbetweening via Bidirectional Autoregressive Diffusion Models (abstract) PRESENTER: Jiawen Peng
14:40	Rui Zeng, Ju Dai, Junxuan Bai and Junjun Pan Motion In-betweening via Recursive Keyframe Prediction (abstract) PRESENTER: Rui Zeng
15:00	Hong Son Nguyen, Daeun Cheong, Andrew Chalmers, Myoung Gon Kim, Taehyun Rhee and Junghyun Han Interaction with Virtual Objects using Human Pose and Shape Estimation (abstract) PRESENTER: Hong Son Nguyen
15:20	Siyao Du, Boyuan Cheng, Yi Wen, Zixuan Zhou and Xiaosong Yang Motion Style Transfer: Methods, Challenges, and Future Directions PRESENTER: Siyao Du

14:00-15:40 Session 13B: 3D Reconstruction & Representation

CAVW

Chair:

Zhao Wang

Location: Amphie 24

14:00	Haowei Xue and Meili Wang LGNet:Local-and-Global Feature Adaptive Network for Single Image Two-Hand Reconstruction (abstract) PRESENTER: Haowei Xue
14:25	Mengyao Zhang, Jie Zhou, Tingyun Miao, Yong Zhao, Xin Si and Jingliang Zhang Joint-learning: A Robust Segmentation Method for 3D Point Clouds under Label Noise (abstract) PRESENTER: Tingyun Miao
14:50	Huiqiang Hu, Changyan He, Xiaojun Liu, Jinyuan Jia and Ting Yu Weisfeiler-Lehman kernel augmented product representation for queries on large-scale BIM scenes (abstract) PRESENTER: Xiaojun Liu
15:15	Xinying Dai and Li Yao DTGS: Defocus-Tolerant View Synthesis using Gaussian Splatting (abstract) PRESENTER: Xinying Dai

15:40-15:55 Session 14: Best Paper Awards and Closing

Chair:

Frederic Cordier

Location: Amphie 23