Qwen2 technical report A Yang, B Yang, B Hui, B Zheng, B Yu, C Zhou, C Li, C Li, D Liu, F Huang, ... arXiv preprint arXiv:2407.10671, 2024 | 135 | 2024 |
Incorporating bert into parallel sequence decoding with adapters J Guo, Z Zhang, L Xu, HR Wei, B Chen, E Chen Advances in Neural Information Processing Systems 33, 10843-10854, 2020 | 70 | 2020 |
Polylm: An open source polyglot large language model X Wei, H Wei, H Lin, T Li, P Zhang, X Ren, M Li, Y Wan, Z Cao, B Xie, ... arXiv preprint arXiv:2307.06018, 2023 | 45 | 2023 |
Online distilling from checkpoints for neural machine translation HR Wei, S Huang, R Wang, X Dai, J Chen Proceedings of the 2019 Conference of the North American Chapter of the …, 2019 | 39 | 2019 |
Generating diverse translation by manipulating multi-head attention Z Sun, S Huang, HR Wei, X Dai, J Chen Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 8976-8983, 2020 | 31 | 2020 |
Iterative domain-repaired back-translation HR Wei, Z Zhang, B Chen, W Luo arXiv preprint arXiv:2010.02473, 2020 | 28 | 2020 |
Non-parametric online learning from human feedback for neural machine translation D Wang, H Wei, Z Zhang, S Huang, J Xie, J Chen Proceedings of the AAAI Conference on Artificial Intelligence 36 (10), 11431 …, 2022 | 24 | 2022 |
Continual learning for neural machine translation Y Cao, HR Wei, B Chen, X Wan Proceedings of the 2021 Conference of the North American Chapter of the …, 2021 | 21 | 2021 |
Gret: Global representation enhanced transformer R Weng, H Wei, S Huang, H Yu, L Bing, W Luo, J Chen Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 9258-9265, 2020 | 11 | 2020 |
Qwen2 technical report, 2024 A Yang, B Yang, B Hui, B Zheng, B Yu, C Zhou, C Li, C Li, D Liu, F Huang, ... URL https://arxiv. org/abs/2407.10671, 0 | 7 | |
Competency-aware neural machine translation: Can machine translation know its own translation quality? P Zhang, B Yang, H Wei, D Liu, K Fan, L Si, J Xie arXiv preprint arXiv:2211.13865, 2022 | 1 | 2022 |