Google's neural machine translation system: Bridging the gap between human and machine translation Y Wu, M Schuster, Z Chen, QV Le, M Norouzi, W Macherey, M Krikun, ... arXiv preprint arXiv:1609.08144, 2016 | 8035 | 2016 |
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 2743 | 2018 |
Conformer: Convolution-augmented transformer for speech recognition A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ... arXiv preprint arXiv:2005.08100, 2020 | 2140 | 2020 |
Google’s multilingual neural machine translation system: Enabling zero-shot translation M Johnson, M Schuster, QV Le, M Krikun, Y Wu, Z Chen, N Thorat, ... Transactions of the Association for Computational Linguistics 5, 339-351, 2017 | 2127 | 2017 |
Tacotron: Towards end-to-end speech synthesis Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017 | 2115* | 2017 |
State-of-the-art speech recognition with sequence-to-sequence models CC Chiu, TN Sainath, Y Wu, R Prabhavalkar, P Nguyen, Z Chen, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 1333 | 2018 |
Exploring the limits of language modeling R Jozefowicz, O Vinyals, M Schuster, N Shazeer, Y Wu arXiv preprint arXiv:1602.02410, 2016 | 1291 | 2016 |
Gpipe: Efficient training of giant neural networks using pipeline parallelism Y Huang, Y Cheng, A Bapna, O Firat, D Chen, M Chen, HJ Lee, J Ngiam, ... Advances in neural information processing systems 32, 2019 | 1269 | 2019 |
Transfer learning from speaker verification to multispeaker text-to-speech synthesis Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ... Advances in neural information processing systems 31, 2018 | 811 | 2018 |
Development and implementation of high-throughput SNP genotyping in barley TJ Close, PR Bhat, S Lonardi, Y Wu, N Rostoks, L Ramsay, A Druka, ... BMC genomics 10 (1), 1-13, 2009 | 705 | 2009 |
Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 639 | 2019 |
Coca: Contrastive captioners are image-text foundation models J Yu, Z Wang, V Vasudevan, L Yeung, M Seyedhosseini, Y Wu arXiv preprint arXiv:2205.01917, 2022 | 626 | 2022 |
Libritts: A corpus derived from librispeech for text-to-speech H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu arXiv preprint arXiv:1904.02882, 2019 | 608 | 2019 |
Efficient and accurate construction of genetic linkage maps from the minimum spanning tree of a graph Y Wu, PR Bhat, TJ Close, S Lonardi PLoS genetics 4 (10), e1000212, 2008 | 556 | 2008 |
The best of both worlds: Combining recent advances in neural machine translation MX Chen, O Firat, A Bapna, M Johnson, W Macherey, G Foster, L Jones, ... arXiv preprint arXiv:1804.09849, 2018 | 497 | 2018 |
Scaling autoregressive models for content-rich text-to-image generation J Yu, Y Xu, JY Koh, T Luong, G Baid, Z Wang, V Vasudevan, A Ku, Y Yang, ... arXiv preprint arXiv:2206.10789 2 (3), 5, 2022 | 441 | 2022 |
Sequence-to-sequence models can directly translate foreign speech RJ Weiss, J Chorowski, N Jaitly, Y Wu, Z Chen arXiv preprint arXiv:1703.08581, 2017 | 370 | 2017 |
Massively multilingual neural machine translation in the wild: Findings and challenges N Arivazhagan, A Bapna, O Firat, D Lepikhin, M Johnson, M Krikun, ... arXiv preprint arXiv:1907.05019, 2019 | 336 | 2019 |
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 307 | 2023 |
Pushing the limits of semi-supervised learning for automatic speech recognition Y Zhang, J Qin, DS Park, W Han, CC Chiu, R Pang, QV Le, Y Wu arXiv preprint arXiv:2010.10504, 2020 | 283 | 2020 |