Welcome to my homepage! I am currently an Assistant Professfor in the Department of Computer Science and Engineering at Texas A&M University. Previously, I spent two wonderful years as a Postdoctoral Fellow in the Robotics Institute at Carnegie Mellon University, working with Fernando De la Torre. I completed my Ph.D. in the Department of Computer Science and Engineering at The Ohio State University with Harry Chao and Dong Xuan. Before that, I received my M.S. and B.Eng. degrees from BUPT and Tianjin University, respectively. I have spent time interning at NVIDIA AV, Volvo Cars, FXPAL, and Alibaba. For more information, please see a copy of my CV (Last update: 08/2024).
My research interests are machine learning and its applications to computer vision, multimodal understanding, human modeling for extended reality, and cyber-physical systems. More updates are coming soon!
08/2024 Teaching CSCE 689 Special Topics in Vision Foundation Models in Fall 2024
07/2024 FabricDiffusion on high-fidelity garment generation accepted to SIGGRAPH Asia 2024
07/2024 Two papers accepted to ECCV 2024: Generalizable Human Guassians and LLM-powered Text-to-Image Generation
02/2024 One paper accepted to CVPR 2024 on reliable human image generation
10/2023 ITI-GEN was on the Best Paper Finalist at ICCV 2023 (one of 17 papers out of 8260 submissions)
08/2023 ITI-GEN on Inclusive Text-to-Image Generation accepted to ICCV 2023 as Oral Presentation
07/2022 Defended my PhD dissertation on "Learning with Imperfect Data and Supervision for Visual Perception and Understanding"
07/2022 Check out what our Buckeye AutoDrive Team has done in the Year 1 AutoDrive Challenge II
07/2022 Recognized as an outstanding reviewer (Top 10%) from ICML 2022
07/2022 One paper accepted to ECCV 2022
03/2022 One paper accepted to CVPR 2022 as oral presentation
03/2022 Honored to receive Graduate Research Award (Mike Liu Scholarship) from our CSE department
11/2021 Recognized as an outstanding reviewer from BMVC 2021
10/2021 Invited research talk at LVIS challenge workshop
10/2021 Selected to participate the Doctoral Consortium in ICCV 2021
10/2021 Excited to be part of Buckeye AutoDrive to compete in SAE AutoDrive Challenge II
09/2021 NorCal on model calibration for long-tailed object detection accepted to NeurIPS 2021
09/2021 SimpleAug on visual question answering accepted to EMNLP 2021
07/2021 MosaicOS on long-tailed object detection accepted to ICCV 2021
05/2021 Joined NVIDIA autonomous vehicle team as a perception intern
02/2021 Check out our efforts on ultrasonic-based contact tracing (OSU News, project) and WLAN-log-based superspreader detection
07/2020 One paper accepted by BMVC 2020 on disease localization
06/2020 Started my internship at Volvo Cars AI Research, working on 3D object detection for autonomous driving
03/2020 CSE annual research poster exhibition award (1st place)
07/2019 One paper accepted as oral presentation to BMVC 2019
05/2019 Summer internship at FX Palo Alto Laboratory (FXPAL), working on AI in medicine & imaging
Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation Junyan Wang, Zhenhong Sun, Zhiyu Tan, Xuanbai Chen, Weihua Chen, Hao Li, Cheng Zhang, Yang Song Conference on Computer Vision and Pattern Recognition (CVPR), 2024 Paper arXiv Project Code |
|
ITI-GEN: Inclusive Text-to-Image Generation Cheng Zhang, Xuanbai Chen, Siqi Chai, Chen Henry Wu, Dmitry Lagun, Thabo Beeler, Fernando De la Torre International Conference on Computer Vision (ICCV), 2023 (Oral Presentation, Best Paper Finalist) Paper arXiv Project Code |
|
Learning with Free Object Segments for Long-Tailed Instance Segmentation Cheng Zhang, Tai-Yu Pan, Tianle Chen, Jike Zhong, Wenjin Fu, Wei-Lun Chao European Conference on Computer Vision (ECCV), 2022 Paper arXiv |
On Model Calibration for Long-Tailed Object Detection and Instance Segmentation Tai-Yu Pan*, Cheng Zhang*, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao Conference on Neural Information Processing Systems (NeurIPS), 2021 Paper arXiv Code |
|
Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering Jihyung Kil, Cheng Zhang, Dong Xuan, Wei-Lun Chao Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021 Paper arXiv Code |
|
MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection Cheng Zhang*, Tai-Yu Pan*, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao International Conference on Computer Vision (ICCV), 2021 Paper arXiv Code Poster |
|
WLAN-Log-Based Superspreader Detection in the COVID-19 Pandemic Cheng Zhang, Yunze Pan, Yunqi Zhang, Adam C Champion, Zhaohui Shen, Dong Xuan, Zhiqiang Lin, Ness Shroff Elsevier High-Confidence Computing Journal (HCC), 2021 Paper arXiv |
|
Thoracic Disease Identification and Localization using Distance Learning and Region Verification Cheng Zhang, Francine Chen, Yan-Ying Chen British Machine Vision Conference (BMVC), 2020 Paper arXiv Talk Patent (Filed by Fujifilm in US, JP, and CN) |
|
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering Cheng Zhang, Wei-Lun Chao, Dong Xuan British Machine Vision Conference (BMVC), 2019 (Oral Presentation) Paper arXiv Poster Talk |
|
MV-Sports: A Motion and Vision Sensor Integration-Based Sports Analysis System Cheng Zhang, Fan Yang, Gang Li, Qiang Zhai, Yi Jiang, Dong Xuan International Conference on Computer Communications (INFOCOM), 2018 Paper Demo |