Follow
Liwei Jiang
Liwei Jiang
PhD Student, Paul G. Allen School of Computer Science & Engineering, University of Washington
Verified email at cs.washington.edu - Homepage
Title
Cited by
Cited by
Year
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models
P West, C Bhagavatula, J Hessel, JD Hwang, L Jiang, RL Bras, X Lu, ...
NAACL 2022, 2021
2922021
Faith and Fate: Limits of Transformers on Compositionality
N Dziri, X Lu, M Sclar, XL Li, L Jiang, BY Lin, S Welleck, P West, ...
NeurIPS 2023, 2024
2762024
Can Machines Learn Morality? The Delphi Experiment
L Jiang, JD Hwang, C Bhagavatula, RL Bras, J Liang, J Dodge, ...
Accepted in Principle to Nature Machine Intelligence, 2021
232*2021
Quizbot: A Dialogue-Based Adaptive Learning System for Factual Knowledge
S Ruan, L Jiang, J Xu, BJK Tham, Z Qiu, Y Zhu, EL Murnane, E Brunskill, ...
CHI 2019, 2019
2242019
Quark: Controllable Text Generation with Reinforced Unlearning
X Lu, S Welleck, J Hessel, L Jiang, L Qin, P West, P Ammanabrolu, Y Choi
NeurIPS 2022, 2022
1712022
Neurologic A* esque Decoding: Constrained Text Generation with Lookahead Heuristics
X Lu, S Welleck, P West, L Jiang, J Kasai, D Khashabi, RL Bras, L Qin, ...
NAACL 2022, 2021
1532021
Soda: Million-Scale Dialogue Distillation with Social Commonsense Contextualization
H Kim, J Hessel, L Jiang, P West, X Lu, Y Yu, P Zhou, RL Bras, M Alikhani, ...
EMNLP 2023, 2022
1212022
Bookbuddy: Turning Digital Materials into Interactive Foreign Language Lessons through a Voice Chatbot
S Ruan, A Willis, Q Xu, GM Davis, L Jiang, E Brunskill, JA Landay
Proceedings of the sixth (2019) ACM conference on learning@ scale, 1-4, 2019
1112019
ProsocialDialog: A Prosocial Backbone for Conversational Agents
H Kim, Y Yu, L Jiang, X Lu, D Khashabi, G Kim, Y Choi, M Sap
EMNLP 2022, 2022
1012022
Englishbot: An AI-Powered Conversational System for Second Language Learning
S Ruan, L Jiang, Q Xu, Z Liu, GM Davis, E Brunskill, JA Landay
IUI 2021, 2021
772021
Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
T Sorensen, L Jiang, JD Hwang, S Levine, V Pyatkin, P West, N Dziri, ...
AAAI 2024, 2024
66*2024
A Roadmap to Pluralistic Alignment
T Sorensen, J Moore, J Fisher, M Gordon, N Mireshghallah, CM Rytting, ...
ICML 2024, 2024
64*2024
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement
L Qiu, L Jiang, X Lu, M Sclar, V Pyatkin, C Bhagavatula, B Wang, Y Kim, ...
ICLR 2024, 2023
57*2023
Aligning to Social Norms and Values in Interactive Narratives
P Ammanabrolu, L Jiang, M Sap, H Hajishirzi, Y Choi
NAACL 2022, 2022
382022
ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations
V Pyatkin, JD Hwang, V Srikumar, X Lu, L Jiang, Y Choi, C Bhagavatula
ACL 2023, 2022
37*2022
"I'm Not Mad": Commonsense Implications of Negation and Contradiction
L Jiang, A Bosselut, C Bhagavatula, Y Choi
NAACL 2021, 2021
352021
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
S Han, K Rao, A Ettinger, L Jiang, BY Lin, N Lambert, Y Choi, N Dziri
NeurIPS D&B 2024, 2024
232024
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models
L Jiang, K Rao, S Han, A Ettinger, F Brahman, S Kumar, N Mireshghallah, ...
NeurIPS 2024, 2024
16*2024
The Generative AI Paradox:“What It Can Create, It May Not Understand”
P West, X Lu, N Dziri, F Brahman, L Li, JD Hwang, L Jiang, J Fisher, ...
ICLR 2024, 2023
142023
CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs'(Lack of) Multicultural Knowledge
YY Chiu, L Jiang, M Antoniak, CY Park, SS Li, M Bhatia, S Ravi, ...
arXiv preprint arXiv:2404.06664, 2024
112024
The system can't perform the operation now. Try again later.
Articles 1–20