Minghui Hu

Currently, I am a Ph.D Candidate at Nanyang Technological University in Singapore, advised by Prof. P.N.Suganthan. Previousely, I received my MSc. degree in Electric and Electrical Engineering from Nanyang Technological University in 2019. I'm also serving as a research scientist at Temasek Lab @ NTU, under the supervision of Dr. Sirajudeen s/o Gulam Razul.

I am fortunate to collaborate closely with Prof. T.J.Cham at Nanyang Technological University.

Email  /  CV  /  Google Scholar  /  Github  /  LinkedIn

profile photo
Research

My research focuses on generative models, multi-modality learning, and its applications in many domains, particularly 2D Image Generation and Music/Audio Generation. Prior to this, I also had some work about random neural networks.

Unified Discrete Diffusion for Simultaneous Vision-Language Generation
Minghui Hu, Chuanxia Zheng, Heliang Zheng, Tat-Jen Cham, Chaoyue Wang, Zuopeng Yang, Dacheng Tao, P.N.Suganthan
ICLR, 2023  
project page / arXiv / PDF

We construct a unified discrete diffusion model for simultaneous vision-language generation.

Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
Minghui Hu, Yujie Wang, Tat-Jen Cham, Jianfei Yang, P.N.Suganthan
CVPR, 2022  
arXiv / PDF

Instead of AutoRegresive Transformers, we use Discrete Diffusion Model to obtain a better global context for image generation.


Yep it's another Jon Barron website.
Last updated Feb. 2023.