Fulong Ye

I am currently a Researcher at the ByteDance Seed Vision Application Team, working on Seedance 2.0. Previously, I was part of the ByteDance Intelligent Creation Team, where I focused on human-centric generation and editing tasks, with my primary contribution being the DreamID series (including DreamID, DreamID-V and DreamID-Omni). Prior to that, I interned at BAAI and Zhipu AI, working on foundational image generation models.

I received my Master's degree from BUPT in 2024, under the supervision of Prof.Xiaojie Wang, and my Bachelor's degree from Xidian University in 2021.

My research interests lie in Large Multimodal Models, as well as all product-related topics associated with multimodality. I am always open to collaborations and discussions on multimodal technology and product innovation. Feel free to reach out!

News

[03/18/2026] DreamID-Omni is supported in ComfyUI!
[03/13/2026] DreamID-Omni is supported in vllm!
[03/13/2026] We release DreamID-Omni!
[02/12/2026] Seedance 2.0 is Online!
[01/08/2026] DreamID-V is supported in ComfyUI!
[01/05/2026] We release DreamID-V!

Selected Online Projects

Seedance2.0 🔥

Unified multimodal audio-video joint generation Model

DreamID 👑

Personalized AI Photo in Jimeng

DreamID-V 👸

Personalized AI Video in CapCut

Video Transfer⚡️

Video Effects Transfer in Douyin

Personalized AI Image 👸

AI Image in Douyin

Wool curls for everything! 🧑‍🦱

Video Effects in CapCut

Publications

(* equal contribution)

DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
Xu Guo*, Fulong Ye*, Qichao Sun*, Liyang Chen, Bingchuan Li, Pengze Zhang, Jiawei Liu, Songtao Zhao, Qian He, Xiangwang Hou
Tech Report, 2026
Paper (arXiv) / Project Page / 🔥Codes (GitHub)

DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
Xu Guo*, Fulong Ye*, Xinghui Li*, Pengqi Tu, Pengze Zhang, Qichao Sun, Songtao Zhao, Qian He
Tech Report, 2026
Paper (arXiv) / Project Page / 🔥Codes (GitHub)

OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer
Pengze Zhang, Yanze Wu, Mengtian Li, Xu Bai, Songtao Zhao, Fulong Ye, Chong Mou, Xinghui Li, Zhuowei chen, Qian He, Mingyun Gao
CVPR, 2026
Paper (arXiv) / Project Page / 🔥Codes (GitHub)

InstructX: Towards Unified Visual Editing with MLLM Guidance
Chong Mou, Qichao Sun, Yanze Wu, Pengze Zhang, Xinghui Li, Fulong Ye, Songtao Zhao, Qian He
Tech Report, 2025
Paper (arXiv) / Project Page / 🔥Codes (GitHub)

DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning
Fulong Ye, Miao Hua, Pengze Zhang, Xinghui Li, Qichao Sun, Songtao Zhao, Qian He, Xinglong Wu
Siggrah Asia, 2025
Paper (arXiv) / Project Page / 🔥Codes (GitHub)

AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models
Xinghui Li, Qichao Sun, Pengze Zhang, Fulong Ye, Zhichao Liao, Wanquan Feng, Songtao Zhao, Qian He
CVPR, 2025
Paper (arXiv) / Project Page / 🔥Codes (GitHub)

AltDiffusion: A Multilingual Text-to-Image Diffusion Model
Fulong Ye, Guang Liu, Xinya Wu, Ledell Wu
AAAI, 2024
Paper (arXiv) / 🔥Codes (GitHub)

/ 🤗 demo

AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities
Zhongzhi Chen, Guang Liu, Bo-Wen Zhang, Fulong Ye, Qinghong Yang, Ledell Wu
ACL, 2023
Paper (arXiv) / 🔥Codes (GitHub)

Whether you can locate or not? Interactive Referring Expression Generation
Fulong Ye, Yuxing Long, Fangxiang Feng, Xiaojie Wang
ACM MM, 2023
Paper (arXiv)

SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph
Yuxing Long, Binyuan Hui, Fulong Ye, Yangyang Li, Zhuoxin Han, Caixia Yuan, Yongbin Li, Xiaojie Wang
AAAI, 2023
Paper (arXiv)

Awards

2019.11 National First Prize, National College Students Mathematical Modeling Competition
2018.10 Champion of the University Group, the 40th Hong Kong Rowing Championships
2023.12 Schlumberger Corporate Scholarship, School of Artificial Intelligence, BUPT (Top 3% of the school)

Experiences

ByteDance

Jan. 2026 - Now

Seed Vision Application Team

ByteDance

Apr. 2024 - Jan. 2025

Intelligent Creation Team

Zhipu AI

Nov. 2023 - Mar. 2024

AI Lab

BAAI

Oct. 2022 - Nov. 2023

NLP & Multimodal Team

Contact

I can be contacted directly at fulong_ye [at] 163.com. I typically respond within a few days.