Cheng Zhang, Texas A&M University

Cheng Zhang

chzhang at tamu dot edu

Google Scholar | Github 

About Me


Welcome to my homepage. I am an Assistant Professor in the Department of Computer Science and Engineering at Texas A&M University. Before joining TAMU, I spent two wonderful years as a Postdoctoral Fellow in the Robotics Institute at Carnegie Mellon University, working with Fernando De la Torre. I did my Ph.D. in the Department of Computer Science and Engineering at The Ohio State University with Harry Chao and Dong Xuan. I received my M.S. and B.Eng. degrees from BUPT and Tianjin University, respectively. I have spent time interning at NVIDIA AV, Volvo Cars, FXPAL, and Alibaba. For more information, please see a copy of my CV (Last update: 08/2024).

My research interests are machine learning and its applications to computer vision, multimodal understanding, human modeling for extended reality, and cyber-physical systems. More updates are coming soon.

If you are interested in joining our group, please read this.

News


09/2024     One paper accepted to NeurIPS 2024 on mitigating concept co-occurrence biases in visual data

08/2024     Teaching CSCE 689 Special Topics in Vision Foundation Models in Fall 2024

07/2024     FabricDiffusion on high-fidelity garment generation accepted to SIGGRAPH Asia 2024

07/2024     Two papers accepted to ECCV 2024: Generalizable Human Guassians and LLM-powered Text-to-Image Generation

02/2024     One paper accepted to CVPR 2024 on reliable human image generation

10/2023     ITI-GEN was on the Best Paper Finalist at ICCV 2023 (one of 17 papers out of 8260 submissions)

08/2023     ITI-GEN on Inclusive Text-to-Image Generation accepted to ICCV 2023 as Oral Presentation

07/2022     Defended my PhD dissertation on "Learning with Imperfect Data and Supervision for Visual Perception and Understanding"

07/2022     Check out what our Buckeye AutoDrive Team has done in the Year 1 AutoDrive Challenge II

07/2022     Recognized as an outstanding reviewer (Top 10%) from ICML 2022

07/2022     One paper accepted to ECCV 2022

03/2022     One paper accepted to CVPR 2022 as oral presentation

03/2022     Honored to receive Graduate Research Award (Mike Liu Scholarship) from our CSE department

11/2021     Recognized as an outstanding reviewer from BMVC 2021

10/2021     Invited research talk at LVIS challenge workshop

10/2021     Selected to participate the Doctoral Consortium in ICCV 2021

10/2021     Excited to be part of Buckeye AutoDrive to compete in SAE AutoDrive Challenge II

09/2021     NorCal on model calibration for long-tailed object detection accepted to NeurIPS 2021

09/2021     SimpleAug on visual question answering accepted to EMNLP 2021

07/2021     MosaicOS on long-tailed object detection accepted to ICCV 2021

05/2021     Joined NVIDIA autonomous vehicle team as a perception intern

02/2021     Check out our efforts on ultrasonic-based contact tracing (OSU News, project) and WLAN-log-based superspreader detection

07/2020     One paper accepted by BMVC 2020 on disease localization

06/2020     Started my internship at Volvo Cars AI Research, working on 3D object detection for autonomous driving

03/2020     CSE annual research poster exhibition award (1st place)

07/2019     One paper accepted as oral presentation to BMVC 2019

05/2019     Summer internship at FX Palo Alto Laboratory (FXPAL), working on AI in medicine & imaging

Papers  


        Visual Data Diagnosis and Debiasing with Concept Graphs
Rwiddhi Chakraborty, Yinong Wang, Jialu Gao, Runkai Zheng, Cheng Zhang, Fernando De la Torre
Advances in Neural Information Processing Systems (NeurIPS), 2024

Paper    arXiv    Code   


        FabricDiffusion: High-Fidelity Texture Transfer for 3D Garments Generation from In-The-Wild Images
Cheng Zhang, Yuanhao Wang, Francisco Vicente, Chenglei Wu, Jinlong Yang, Thabo Beeler, Fernando De la Torre
ACM SIGGRPAH Asia, 2024

Paper    arXiv    Project    Code   


        Generalizable Human Gaussians for Sparse View Synthesis
Youngjoong Kwon, Baole Fang, Yixing Lu, Haoye Dong, Cheng Zhang, Francisco Vicente, Albert Mosella-Montoro, Jianjin Xu, Shingo Takagi, Daeil Kim, Aayush Prakash, Fernando De la Torre
European Conference on Computer Vision (ECCV), 2024

Paper    arXiv    Project    Code   


        An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation
Zhiyu Tan, Mengping Yang, Luozheng Qin, Hao Yang, Ye Qian, Qiang Zhou, Cheng Zhang, Hao Li
European Conference on Computer Vision (ECCV), 2024

Paper    arXiv    Project    Code   


        Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation
Junyan Wang, Zhenhong Sun, Zhiyu Tan, Xuanbai Chen, Weihua Chen, Hao Li, Cheng Zhang, Yang Song
Conference on Computer Vision and Pattern Recognition (CVPR), 2024

Paper    arXiv    Project   


        ITI-GEN: Inclusive Text-to-Image Generation
Cheng Zhang, Xuanbai Chen, Siqi Chai, Chen Henry Wu, Dmitry Lagun, Thabo Beeler, Fernando De la Torre
International Conference on Computer Vision (ICCV), 2023 (Oral Presentation, Best Paper Finalist)

Paper    arXiv    Project    Code   


        Learning with Free Object Segments for Long-Tailed Instance Segmentation
Cheng Zhang, Tai-Yu Pan, Tianle Chen, Jike Zhong, Wenjin Fu, Wei-Lun Chao
European Conference on Computer Vision (ECCV), 2022

Paper    arXiv   


        Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations
Zirui Peng*, Shaofeng Li*, Guoxing Chen, Cheng Zhang, Haojin Zhu, Minhui Xue
Conference on Computer Vision and Pattern Recognition (CVPR), 2022 (Oral Presentation)

Paper    arXiv   


        On Model Calibration for Long-Tailed Object Detection and Instance Segmentation
Tai-Yu Pan*, Cheng Zhang*, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao
Advances in Neural Information Processing Systems (NeurIPS), 2021

Paper    arXiv    Code   


        Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Jihyung Kil, Cheng Zhang, Dong Xuan, Wei-Lun Chao
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Paper    arXiv    Code   


        MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection
Cheng Zhang*, Tai-Yu Pan*, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao
International Conference on Computer Vision (ICCV), 2021

Paper    arXiv    Code    Poster   


        WLAN-Log-Based Superspreader Detection in the COVID-19 Pandemic
Cheng Zhang, Yunze Pan, Yunqi Zhang, Adam C Champion, Zhaohui Shen, Dong Xuan, Zhiqiang Lin, Ness Shroff
Elsevier High-Confidence Computing Journal (HCC), 2021

Paper    arXiv   


        Thoracic Disease Identification and Localization using Distance Learning and Region Verification
Cheng Zhang, Francine Chen, Yan-Ying Chen
British Machine Vision Conference (BMVC), 2020

Paper    arXiv    Talk    Patent (Filed by Fujifilm in US, JP, and CN)   


        An Empirical Study on Leveraging Scene Graphs for Visual Question Answering
Cheng Zhang, Wei-Lun Chao, Dong Xuan
British Machine Vision Conference (BMVC), 2019 (Oral Presentation)

Paper    arXiv    Poster    Talk   


        MV-Sports: A Motion and Vision Sensor Integration-Based Sports Analysis System
Cheng Zhang, Fan Yang, Gang Li, Qiang Zhai, Yi Jiang, Dong Xuan
International Conference on Computer Communications (INFOCOM), 2018

Paper    Demo