Zheng Ge
Zheng Ge
Megvii Technology
Verified email at - Homepage
Cited by
Cited by
Yolox: Exceeding yolo series in 2021
Z Ge, S Liu, F Wang, Z Li, J Sun
arXiv preprint arXiv:2107.08430, 2021
Ota: Optimal transport assignment for object detection
Z Ge, S Liu, Z Li, O Yoshie, J Sun
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
Bevdepth: Acquisition of reliable depth for multi-view 3d object detection
Y Li, Z Ge, G Yu, J Yang, Z Wang, Y Shi, J Sun, Z Li
Proceedings of the AAAI conference on artificial intelligence, 2022
Nms by representative region: Towards crowded pedestrian detection by proposal pairing
X Huang, Z Ge, Z Jie, O Yoshie
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
Bevstereo: Enhancing depth estimation in multi-view 3d object detection with dynamic temporal stereo
Y Li, H Bao, Z Ge, J Yang, J Sun, Z Li
Proceedings of the AAAI conference on artificial intelligence, 2022
Dreamllm: Synergistic multimodal comprehension and creation
R Dong, C Han, Y Peng, Z Qi, Z Ge, J Yang, L Zhao, J Sun, H Zhou, H Wei, ...
arXiv preprint arXiv:2309.11499, 2023
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
Z Qi, R Dong, G Fan, Z Ge, X Zhang, K Ma, L Yi
International Conference on Machine Learning (ICML), 2023, 2023
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
R Dong, Z Qi, L Zhang, J Zhang, J Sun, Z Ge, L Yi, K Ma
International Conference on Learning Representations (ICLR), 2023, 2022
Dense teacher: Dense pseudo-labels for semi-supervised object detection
H Zhou, Z Ge, S Liu, W Mao, Z Li, H Yu, J Sun
Proceedings of the European conference on computer vision (ECCV), 2022
Implicit identity leakage: The stumbling block to improving deepfake detection generalization
S Dong, J Wang, R Ji, J Liang, H Fan, Z Ge
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
Sts: Surround-view temporal stereo for multi-view 3d detection
Z Wang, C Min, Z Ge, Y Li, Z Li, H Yang, D Huang
arXiv preprint arXiv:2208.10145, 2022
Lla: Loss-aware label assignment for dense pedestrian detection
Z Ge, J Wang, X Huang, S Liu, O Yoshie
Neurocomputing 462, 272-281, 2021
Exploring recurrent long-term temporal fusion for multi-view 3d perception
C Han, J Yang, J Sun, Z Ge, R Dong, H Zhou, W Mao, Y Peng, X Zhang
IEEE Robotics and Automation Letters, 2024
Ps-rcnn: Detecting secondary human instances in a crowd via primary object suppression
Z Ge, Z Jie, X Huang, R Xu, O Yoshie
2020 IEEE international conference on multimedia and expo (ICME), 1-6, 2020
Chatspot: Bootstrapping multimodal llms via precise referring instruction tuning
L Zhao, E Yu, Z Ge, J Yang, H Wei, H Zhou, J Sun, Y Peng, R Dong, ...
arXiv preprint arXiv:2307.09474, 2023
Vary: Scaling up the vision vocabulary for large vision-language models
H Wei, L Kong, J Chen, L Zhao, Z Ge, J Yang, J Sun, C Han, X Zhang
arXiv preprint arXiv:2312.06109, 2023
Matrixvt: Efficient multi-camera to bev transformation for 3d perception
H Zhou, Z Ge, Z Li, X Zhang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
Delving deep into the imbalance of positive proposals in two-stage object detection
Z Ge, Z Jie, X Huang, C Li, O Yoshie
Neurocomputing 425, 107-116, 2021
Align-DETR: Improving DETR with simple IoU-aware BCE loss
Z Cai, S Liu, G Wang, Z Ge, X Zhang, D Huang
arXiv preprint arXiv:2304.07527, 2023
Small Language Model Meets with Reinforced Vision Vocabulary
H Wei, L Kong, J Chen, L Zhao, Z Ge, E Yu, J Sun, C Han, X Zhang
arXiv preprint arXiv:2401.12503, 2024
The system can't perform the operation now. Try again later.
Articles 1–20