Follow
Bin Zhu
Bin Zhu
Assistant Professor, Singapore Management University
Verified email at smu.edu.sg - Homepage
Title
Cited by
Cited by
Year
R2GAN: Cross-modal recipe retrieval with generative adversarial network
B Zhu, CW Ngo, J Chen, Y Hao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1452019
CookGAN: Causality based Text-to-Image Synthesis
B Zhu, CW Ngo
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2020
802020
A study of multi-task and region-wise deep learning for food ingredient recognition
J Chen, B Zhu, CW Ngo, TS Chua, YG Jiang
IEEE Transactions on Image Processing 30, 1514-1526, 2020
772020
Epic-kitchens visor benchmark: Video segmentations and object relations
A Darkhalil, D Shan, B Zhu, J Ma, A Kar, R Higgins, S Fidler, D Fouhey, ...
Advances in Neural Information Processing Systems 35, 13745-13758, 2022
752022
Person-level action recognition in complex events via tsd-tsm networks
Y Hao, ZN Liu, H Zhang, B Zhu, J Chen, YG Jiang, CW Ngo
Proceedings of the 28th ACM International Conference on Multimedia, 4699-4702, 2020
122020
Learning from web recipe-image pairs for food recognition: Problem, baselines and performance
B Zhu, CW Ngo, WK Chan
IEEE Transactions on Multimedia 24, 1175-1185, 2021
112021
Cross-domain cross-modal food transfer
B Zhu, CW Ngo, J Chen
Proceedings of the 28th ACM International Conference on Multimedia, 3762-3770, 2020
102020
CgT-GAN: CLIP-guided Text GAN for Image Captioning
J Yu, H Li, Y Hao, B Zhu, T Xu, X He
Proceedings of the 31st ACM International Conference on Multimedia, 2252-2263, 2023
82023
Mix-dann and dynamic-modal-distillation for video domain adaptation
Y Yin, B Zhu, J Chen, L Cheng, YG Jiang
Proceedings of the 30th ACM International Conference on Multimedia, 3224-3233, 2022
82022
Unsupervised video hashing with multi-granularity contextualization and multi-structure preservation
Y Hao, J Duan, H Zhang, B Zhu, P Zhou, X He
Proceedings of the 30th ACM International Conference on Multimedia, 3754-3763, 2022
72022
Foodlmm: A versatile food assistant using large multi-modal model
Y Yin, H Qi, B Zhu, J Chen, YG Jiang, CW Ngo
arXiv preprint arXiv:2312.14991, 2023
62023
Learning to match anchor-target video pairs with dual attentional holographic networks
Y Hao, CW Ngo, B Zhu
IEEE Transactions on Image Processing 30, 8130-8143, 2021
52021
Pyramid fusion dark channel prior for single image dehazing
Q Liang, B Zhu, CW Ngo
arXiv preprint arXiv:2105.10192, 2021
52021
Cross-lingual adaptation for recipe retrieval with mixup
B Zhu, CW Ngo, J Chen, WK Chan
Proceedings of the 2022 International Conference on Multimedia Retrieval …, 2022
42022
Text-driven Video Prediction
X Song, J Chen, B Zhu, Y Jiang
ACM Transactions on Multimedia Computing, Communications, and Applications …, 2024
32024
From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios
G Liu, Y Jiao, J Chen, B Zhu, YG Jiang
IEEE Transactions on Multimedia, 2024
32024
CAR: consolidation, augmentation and regulation for recipe retrieval
F Song, B Zhu, Y Hao, S Wang, X He
arXiv preprint arXiv:2312.04763, 2023
22023
RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models
P Jiao, X Wu, B Zhu, J Chen, CW Ngo, Y Jiang
arXiv preprint arXiv:2407.12730, 2024
12024
Model Inversion Attacks Through Target-Specific Conditional Diffusion Models
O Li, Y Hao, Z Wang, B Zhu, S Wang, Z Zhang, F Feng
arXiv preprint arXiv:2407.11424, 2024
12024
Video Editing for Video Retrieval
B Zhu, K Flanagan, A Fragomeni, M Wray, D Damen
arXiv preprint arXiv:2402.02335, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–20