Research
My research focuses on generative models, multi-modality learning, and its applications in many domains, particularly 2D Image Generation and Music/Audio Generation. Prior to this, I also had some work about random neural networks.
|
|
Unified Discrete Diffusion for Simultaneous Vision-Language Generation
Minghui Hu,
Chuanxia Zheng,
Heliang Zheng,
Tat-Jen Cham,
Chaoyue Wang,
Zuopeng Yang,
Dacheng Tao,
P.N.Suganthan
ICLR, 2023  
project page
/
arXiv
/
PDF
We construct a unified discrete diffusion model for simultaneous vision-language generation.
|
|
Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
Minghui Hu,
Yujie Wang,
Tat-Jen Cham,
Jianfei Yang,
P.N.Suganthan
CVPR, 2022  
arXiv
/
PDF
Instead of AutoRegresive Transformers, we use Discrete Diffusion Model to obtain a better global context for image generation.
|
Yep it's another Jon Barron website.
Last updated Feb. 2023.
|
|