My name is Hongbo Kang (康洪菠). I am a Ph.D. candidate at Tianjin University, advised by Prof. Kun Li. I also collaborate closely with Prof. Yu-Kun Lai at Cardiff University. I received my M.S. from Chongqing University of Technology, co-supervised by Prof. Yong Wang and Prof. Wenming Yang (Tsinghua University), and my B.Eng. from Jishou University.

My research focuses on human-centered reconstruction and simulation, exploring data-driven methods for perception and decision-making of individuals and crowds. Specifically, it includes: individual and crowd reconstruction, crowd simulation and the construction of virtual crowd datasets, as well as closed-loop simulation of crowds in autonomous driving scenarios.

🔥 News

  • 2026.05:  🎉 My paper “Crowd4D: Scene-Aware Monocular 4D Crowd Reconstruction” has been accepted to ICML!
  • 2026.02:  🎉 Our paper “MuRE: Multi-Relationship Encoder for 3D Human Pose Estimation” has been accepted to CVIU!
  • 2026.01:  🎉 Our paper “DRPose: A Diffusion-based Pose Refinement Framework for 3D Human Pose Estimation” has been accepted to TCSVT!
  • 2025.12:  🏆 I am honored to be selected for CAST Youth Science and Technology Talent Cultivation Project Doctoral Student Special Program!
  • 2025.12:  🎉 Our paper “DBMambaPose: Decoupled Spatial-Temporal Bidirectional State Space Model for Efficient 3D Human Pose Estimation” has been accepted to PR!
  • 2025.08:  🎉 My paper “DyCrowd: Towards Dynamic Crowd Reconstruction from a Large-scene Video” has been accepted to TPAMI!
  • 2025.07:  🎉 Our paper “RESCUE: Crowd Evacuation Simulation via Controlling SDM-United Characters” has been accepted to ICCV as Highlight!
  • 2025.05:  🎉 My paper “Double-chain Graph Convolution Transformer for 3D Human Pose Estimation” has been accepted to TMM!
  • 2024.09:  📌 I started my Ph.D. in Prof. Kun Li’s team at Tianjin University.

📝 Publications

  • Selected Publications (* Co-first author, ✉️ Corresponding author)
ICML
sym

Crowd4D: Scene-Aware Monocular 4D Crowd Reconstruction

Hongbo Kang, Tianyi Zhou, Qingyang Yang, Hongwei Wen, Jing Huang, Yu-Kun Lai, Kun Li✉️

Abstract: Recovering scene-consistent 4D crowd motion from monocular video in large-scale scenes remains challenging due to severe depth ambiguity and complex s...

International Conference on Machine Learning, 2026 CCF-A [Project Page ][Code]

TCSVT
sym

DRPose: A Diffusion-based Pose Refinement Framework for 3D Human Pose Estimation

Yong Wang*, Xuguang Liu*, Xiaoqing Wang, Doudou Wu, Wenming Yang, Hongbo Kang✉️

Abstract: Recently, two-stage 3D human pose estimation using monocular cameras has gained significant attention. However, the inherent uncertainty in the upscal...

IEEE Transactions on Circuits and Systems for Video Technology, 2026 CCF-B [Code]

TPAMI
sym

DyCrowd: Towards Dynamic Crowd Reconstruction from a Large-scene Video

Hao Wen*, Hongbo Kang*, Jian Ma, Jing Huang, Yuanwang Yang, Haozhe Lin, Yu-Kun Lai, Kun Li✉️

Abstract: 3D reconstruction of dynamic crowds in large scenes has become increasingly important for applications such as city surveillance and crowd analysis. H...

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025 CCF-A [Project Page ][Code]

ICCV
sym

RESCUE: Crowd Evacuation Simulation via Controlling SDM-United Characters

Xiaolin Liu*, Tianyi Zhou*, Hongbo Kang, Jian Ma, Ziwen Wang, Jing Huang, Wenguo Weng, Yu-Kun Lai, Kun Li✉️

Abstract: Crowd evacuation simulation is critical for enhancing public safety, and demanded for realistic virtual environments. However, existing methods fail t...

International Conference on Computer Vision, 2025 CCF-A Highlight [Project Page ][Code]

TMM
sym

Double-chain Graph Convolution Transformer for 3D Human Pose Estimation

Hongbo Kang, Yong Wang✉️, Mengyuan Liu, Doudou Wu, Peng Liu, Wenming Yang

Abstract: Reconstructing 3D poses from 2D poses lacking depth information is particularly challenging due to the complexity and diversity of human motion. The k...

IEEE Transactions on Multimedia, 2025 CCF-A [Code]

ICASSP
sym

Diffusion-based Pose Refinement and Multi-Hypothesis Generation for 3D Human Pose Estimation

Hongbo Kang, Yong Wang✉️, Mengyuan Liu, Doudou Wu, Peng Liu, Xinlin Yuan, Wenming Yang

Abstract: Previous probabilistic models for 3D Human Pose Estimation (3DHPE) aimed to enhance pose accuracy by generating multiple hypotheses. However, most of ...

International Conference on Acoustics, Speech, and Signal Processing, 2024 CCF-B [Code]

TMM
sym

Global and local spatio-temporal encoder for 3D human pose estimation

Yong Wang*, Hongbo Kang*✉️, Doudou Wu, Wenming Yang, Longbin Zhang

Abstract: Transformers have been used for 3D human pose estimation with excellent performance; however, most transformers focus on encoding the global spatio-te...

IEEE Transactions on Multimedia, 2023 CCF-A

  • Other Publications

CVIU MuRE: Multi-Relationship Encoder for 3D Human Pose Estimation

Yong Wang, Doudou Wu, Hongbo Kang, Peng Liu, Wenming Yang✉️

Abstract: In the mission of 2D-to-3D lifting for human pose estimation, the self-attention mechanism has demonstrated remarkable efficacy in capturing global in...

Computer Vision and Image Understanding, 2026 CCF-B

PR DBMambaPose: Decoupled Spatial-Temporal Bidirectional State Space Model for Efficient 3D Human Pose Estimation

Xiaoqing Wang*, Yong Wang*, Xuguang Liu, Hongbo Kang, Wenming Yang✉️

Abstract: Transformer-based 3D human pose estimation (HPE) methods face efficiency-accuracy trade-offs due to self-attention’s quadratic complexity. While State...

Pattern Recognition, 2026 CCF-B [Code]

Neurocomputing ICFNet: Interactive-complementary fusion network for monocular 3D human pose estimation

Yong Wang*, Peng Liu*✉️, Hongbo Kang, Doudou Wu, Duoqian Miao

Abstract: Most existing methods for 3D human pose estimation from monocular images focus on learning the spatial correlation of either the global or local joint...

Neurocomputing, 2025 CCF-C [Code]

DCN Hierarchical flow learning for low-light image enhancement

Xinlin Yuan, Yong Wang, Yan Li, Hongbo Kang, Yu Chen, Boran Yang

Abstract: Low-light images often have defects such as low visibility, low contrast, high noise, and high color distortion compared with well-exposed images. If ...

Digital Communications and Networks, 2024

🎖 Awards

  • 2025.12 CAST Youth Science and Technology Talent Cultivation Project Doctoral Student Special Program
  • 2024.06 Outstanding Graduate Student of Chongqing (Top 1%)
  • 2023.10 National Scholarship (Top 1%)
  • 2021.10 PaddlePaddle Developers Experts (PPDE) [AI Studio]

🎓 Academic Service

  • Reviewer: IJCV, TMM, TCSVT, PR, MM, etc.