Symbolic Knowledge Distillation: from General Language Models to Commonsense Models P West, C Bhagavatula, J Hessel, JD Hwang, L Jiang, RL Bras, X Lu, ... NAACL 2022, 2021 | 292 | 2021 |
Faith and Fate: Limits of Transformers on Compositionality N Dziri, X Lu, M Sclar, XL Li, L Jiang, BY Lin, S Welleck, P West, ... NeurIPS 2023, 2024 | 276 | 2024 |
Can Machines Learn Morality? The Delphi Experiment L Jiang, JD Hwang, C Bhagavatula, RL Bras, J Liang, J Dodge, ... Accepted in Principle to Nature Machine Intelligence, 2021 | 232* | 2021 |
Quizbot: A Dialogue-Based Adaptive Learning System for Factual Knowledge S Ruan, L Jiang, J Xu, BJK Tham, Z Qiu, Y Zhu, EL Murnane, E Brunskill, ... CHI 2019, 2019 | 224 | 2019 |
Quark: Controllable Text Generation with Reinforced Unlearning X Lu, S Welleck, J Hessel, L Jiang, L Qin, P West, P Ammanabrolu, Y Choi NeurIPS 2022, 2022 | 171 | 2022 |
Neurologic A* esque Decoding: Constrained Text Generation with Lookahead Heuristics X Lu, S Welleck, P West, L Jiang, J Kasai, D Khashabi, RL Bras, L Qin, ... NAACL 2022, 2021 | 153 | 2021 |
Soda: Million-Scale Dialogue Distillation with Social Commonsense Contextualization H Kim, J Hessel, L Jiang, P West, X Lu, Y Yu, P Zhou, RL Bras, M Alikhani, ... EMNLP 2023, 2022 | 121 | 2022 |
Bookbuddy: Turning Digital Materials into Interactive Foreign Language Lessons through a Voice Chatbot S Ruan, A Willis, Q Xu, GM Davis, L Jiang, E Brunskill, JA Landay Proceedings of the sixth (2019) ACM conference on learning@ scale, 1-4, 2019 | 111 | 2019 |
ProsocialDialog: A Prosocial Backbone for Conversational Agents H Kim, Y Yu, L Jiang, X Lu, D Khashabi, G Kim, Y Choi, M Sap EMNLP 2022, 2022 | 101 | 2022 |
Englishbot: An AI-Powered Conversational System for Second Language Learning S Ruan, L Jiang, Q Xu, Z Liu, GM Davis, E Brunskill, JA Landay IUI 2021, 2021 | 77 | 2021 |
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties T Sorensen, L Jiang, JD Hwang, S Levine, V Pyatkin, P West, N Dziri, ... AAAI 2024, 2024 | 66* | 2024 |
A Roadmap to Pluralistic Alignment T Sorensen, J Moore, J Fisher, M Gordon, N Mireshghallah, CM Rytting, ... ICML 2024, 2024 | 64* | 2024 |
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement L Qiu, L Jiang, X Lu, M Sclar, V Pyatkin, C Bhagavatula, B Wang, Y Kim, ... ICLR 2024, 2023 | 57* | 2023 |
Aligning to Social Norms and Values in Interactive Narratives P Ammanabrolu, L Jiang, M Sap, H Hajishirzi, Y Choi NAACL 2022, 2022 | 38 | 2022 |
ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations V Pyatkin, JD Hwang, V Srikumar, X Lu, L Jiang, Y Choi, C Bhagavatula ACL 2023, 2022 | 37* | 2022 |
"I'm Not Mad": Commonsense Implications of Negation and Contradiction L Jiang, A Bosselut, C Bhagavatula, Y Choi NAACL 2021, 2021 | 35 | 2021 |
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs S Han, K Rao, A Ettinger, L Jiang, BY Lin, N Lambert, Y Choi, N Dziri NeurIPS D&B 2024, 2024 | 23 | 2024 |
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models L Jiang, K Rao, S Han, A Ettinger, F Brahman, S Kumar, N Mireshghallah, ... NeurIPS 2024, 2024 | 16* | 2024 |
The Generative AI Paradox:“What It Can Create, It May Not Understand” P West, X Lu, N Dziri, F Brahman, L Li, JD Hwang, L Jiang, J Fisher, ... ICLR 2024, 2023 | 14 | 2023 |
CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs'(Lack of) Multicultural Knowledge YY Chiu, L Jiang, M Antoniak, CY Park, SS Li, M Bhatia, S Ravi, ... arXiv preprint arXiv:2404.06664, 2024 | 11 | 2024 |