Runyi Li - Homepage

Education & Experience

Academic training first, then research visits and internships.

Education

Ph.D. Incoming 2026, Department of Information Science and Technology, University of Tokyo. Supervised by Prof. Toshihiko Yamasaki.
M.S. Peking University, 2026. Supervised by Prof. Jian Zhang.
B.E. Sichuan University, 2023.

Research Visits & Internships

ByteDance Research intern at TikTok Group, working on unified vision understanding and generation with multimodal LLMs and diffusion models.
Wurzburg Visiting student at Wurzburg University, working with Prof. Radu Timofte on generative AI for low-level vision.
RabbitPre Internship on trustworthy AI and AIGC protection.

Research

A compact map of the themes that connect my recent projects.

Current Direction

I work on generative models that can understand, synthesize, and protect visual content. Recently, my projects have covered autoregressive visual generation, diffusion-based image restoration, multimodal forgery analysis, watermarking, and 3D Gaussian representations.

I am especially interested in models that plan before they generate, preserve identity and provenance, and remain useful under realistic degradation or manipulation.

Autoregressive Generation

Visual generation models with planning, foresight, and stronger temporal structure.

Low-Level Vision

Diffusion-based super-resolution, restoration, and controllable fidelity-realness trade-offs.

Trustworthy AI

Forgery localization, copyright protection, watermarking, and source attribution.

3D & Multimodal AI

3D Gaussian splatting, multimodal LLM reasoning, and visual-audio protection.

Publications

Selected papers grouped by research theme. Equal contribution is marked with *.

Generative AI and Autoregressive Models

Video-Mirai: Autoregressive Video Diffusion Models Need Foresight

Yonghao Yu, Lang Huang, Runyi Li, Zerun Wang, Toshihiko Yamasaki

arXiv 2026

Paper Project

We close the representation-level planning gap in causal video generators by supervising current states with future-frame foresight during training.

Mirai: Autoregressive Visual Generation Needs Foresight

Yonghao Yu, Lang Huang, Zerun Wang, Runyi Li, Toshihiko Yamasaki

CVPR 2026

Paper Project Code

This study highlights that visual autoregressive models need foresight.

World Models and Agentic AI

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Meng Chu, Xuan Billy Zhang, Kevin Qinghong Lin, ..., Runyi Li, ..., Jiaya Jia

arXiv 2026

Paper Homepage Resources

We synthesize foundations, capability levels, governing laws, evaluation principles, and open problems for agentic world modeling.

Generative AI for Low-Level Vision

OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model

Runyi Li *, Xuhan Sheng *, Weiqi Li, Jian Zhang

ECCV 2024 Oral

Paper Project Code

We achieve zero-shot omnidirectional image super-resolution with both fidelity and realness.

RealOSR: Latent Guidance Boosts Diffusion-based Real-world Omnidirectional Image Super-Resolutions

Xuhan Sheng *, Runyi Li *, Bin Chen, Weiqi Li, Xu Jiang, Jian Zhang

IEEE JCSVT submission

Paper

We introduce RealOSR, an efficient diffusion-based framework for real-world omnidirectional image super-resolution with latent guidance.

CTSR: Controllable Fidelity-Realness Trade-off Distillation for Real-World Image Super Resolution

Runyi Li, Bin Chen, Jian Zhang, Radu Timofte

arXiv 2025

Paper

We achieve controllable real-world image super-resolution, striking a trade-off between fidelity and realness.

LAFR: Efficient Diffusion-based Blind Face Restoration via Latent Codebook Alignment Adapter

Runyi Li, Bin Chen, Jian Zhang, Radu Timofte

arXiv 2025

Paper

We achieve efficient real-world face restoration via latent-space alignment, using only 600 training images from FFHQ.

Generative AI for Trustworthiness and Safety

FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models

Zhipei Xu, Xuanyu Zhang, Runyi Li, Zecheng Tang, Qing Huang, Jian Zhang

ICLR 2025

Paper Code Project

We build an explainable framework for image manipulation localization via multimodal LLMs.

OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking

Xuanyu Zhang, Zecheng Tang, Zhipei Xu, Runyi Li, Youmin Xu, Bin Chen, Feng Gao, Jian Zhang

CVPR 2025

Paper

We propose OmniGuard, a hybrid manipulation localization framework for detecting and localizing image manipulations.

GS-Hider: Hiding Messages into 3D Gaussian Splatting

Xuanyu Zhang, Jiarui Meng, Runyi Li, Zhipei Xu, Yongbing Zhang, Jian Zhang

NeurIPS 2024

Project Paper

We propose a 3DGS steganography framework that hides a 3D scene or image in the original 3D scene and decodes it from 3D Gaussians.

EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection

Xuanyu Zhang, Runyi Li, Jiwen Yu, Youmin Xu, Weiqi Li, Jian Zhang

CVPR 2024

Paper Project Code

We propose a versatile deep forensic watermark against AIGC editing methods such as Stable Diffusion inpainting, ControlNet, and SDXL.

V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection

Xuanyu Zhang, Youmin Xu, Runyi Li, Jiwen Yu, Weiqi Li, Zhipei Xu, Jian Zhang

ACM MM 2024

Paper

We propose a versatile deep forensic watermark against AIGC editing methods for video and audio.

GaussianSeal: Rooting Adaptive Watermarks for 3D Gaussian Generation Model

Runyi Li, Xuanyu Zhang, Chuhan Tong, Zhipei Xu, Jian Zhang

Machine Intelligence Research

Paper

We propose an adaptive watermarking framework for 3D Gaussian generation models.

Protect-Your-IP: Scalable Source-Tracing and Attribution against Personalized Generation

Runyi Li, Xuanyu Zhang, Zhipei Xu, Yongbing Zhang, Jian Zhang

IEEE TIFS submission

Paper

We propose an IP protection framework against personalized generation, jointly supporting source tracing and scalable attribution.

Publications Before 2023

Effectiveness of eHealth Self-management Interventions in Patients with Heart Failure: Systematic Review and Meta-analysis

Siru Liu, Jili Li, Dingyuan Wan, Runyi Li, Zhan Qu, Yundi Hu, Jialin Liu

Journal of Medical Internet Research, 2022

Paper

This study systematically reviews evidence for the effectiveness of eHealth self-management in heart failure patients.

Breast Cancer X-ray Image Staging Based on EfficientNet with Multi-scale Fusion and CBAM Attention

Runyi Li, Sen Wang, Zizhou Wang, Lei Zhang

Journal of Physics, 2021

Paper

We applied EfficientNet with CBAM attention to breast cancer medical image classification.

Image Caption and Medical Report Generation Based on Deep Learning: A Review and Algorithm Analysis

Runyi Li, Zizhou Wang, Lei Zhang

IEEE CISAI, 2021

Paper

A concise review of image captioning methods and medical report generation.

Academic Service

Reviewer ICLR 2025 & 2026, AAAI 2026, CVPR 2025 & 2026, NeurIPS 2025 & 2026, ICCV 2025, ACM MM 2025, IEEE JSTSP, and IEEE TIFS.

Selected Honours

2026Outstanding Graduate, Peking University
2025Merit Student, Peking University
2025Scholarship of Ping'An Bank, Peking University & Ping'An Bank
2024Merit Student, Peking University
2024Scholarship of Public Welfare, Peking University & Guangdong Province
2023Outstanding Graduate, Sichuan Province
2023Outstanding Graduate, Sichuan University
2023National Scholarship, Ministry of Education of China
2021Bronze Award, Kaggle NFL Contest 2021
2021Scholarship of Soochow-Singapore Industrial Park, Soochow City & Sichuan University

Runyi Li 李潤一

Education & Experience

Education

Research Visits & Internships

Research

Current Direction

Recent News

Autoregressive Generation

Low-Level Vision

Trustworthy AI

3D & Multimodal AI

Publications

Generative AI and Autoregressive Models

World Models and Agentic AI

Generative AI for Low-Level Vision

Generative AI for Trustworthiness and Safety

Publications Before 2023

Academic Service

Selected Honours