Language · Intelligence · Modeling · Evaluation

Building trustworthy AI — by people who actually check.

LIME Lab is part of the USC NLP Group, led by Prof. Jieyu Zhao. We work on two sides of AI models and agents — modeling them to be more capable, aligned, and reliable in the real world, and designing evaluations that surface where they fall short, with a strong human-centered thread throughout.

Read our papers Group Wiki We're hiring →

LIME moments — AI-generated lab portrait — Seed Dream 4.0 · prompted by Taiwei Shi

Latest

2026-05 paper

ProMediate accepted to ACL 2026 Findings.
2026-04 paper

Data attribution paper for filtering unsafe training data accepted to ICML 2026.
2026-03 paper

ICLR 2026 accepted: CoAct-1, Doxing via the Lens, and TrustGen.
2026-03 release

ERL code is now available — play with it.
2026-03 paper

New paper out: Video-Based Reward Modeling for Computer-Use Agents.
2026-03 team

Welcomed Jiaqi Deng to the group.
2026-03 paper

Two papers at EACL: "What's Missing in Vision-Language Models?" (Causal Order Reasoning) and GRAVITY.
2026-03 team

Reopened the open office hour. 🍋‍🟩
2026-02 paper

New paper out: Experiential Reinforcement Learning.
2026-02 team

Ziyi passed her PhD qualifying exam.

All news

Spotlight

Lifelong Agent Workshop @ ICLR 2026

Experiential Reinforcement Learning

Treat past episodes as a retrievable experience bank rather than gradient updates only — letting the policy reason about what worked last time before deciding what to try next.

Taiwei Shi, Sihao Chen, Bowen Jiang, Linxin Song, Longqi Yang, Jieyu Zhao

📄 Paper 💻 Code

ICML 2026

Detecting and Filtering Unsafe Training Data via Data Attribution with Denoised Representation

Find which training examples are pushing a model toward unsafe behavior, by attributing model outputs back through denoised representations — and filter them before they do damage.

Yijun Pan, Taiwei Shi, Jieyu Zhao, Jiaqi W. Ma

📄 Paper

ACL 2026 Findings

ProMediate: A Socio-cognitive Framework for Evaluating Proactive Agents in Multi-party Negotiation

Most agent benchmarks are single-turn task completion. ProMediate puts agents in the middle of multi-party negotiations and asks whether they can read the room, mediate, and act proactively.

Ziyi Liu, Bahar Sarrafzadeh, Pei Zhou, Longqi Yang, Jieyu Zhao, Ashish Sharma

📄 Paper

ACL 2026

WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback

Align LLMs with the messy, in-the-moment feedback users actually leave during real interactions — not the curated preference pairs that look clean on paper.

Taiwei Shi, Zhuoer Wang, Longqi Yang, Ying-Chun Lin, Zexue He, Mengting Wan, Pei Zhou, Sujay Jauhar, Sihao Chen, Shan Xia, Hongfei Zhang, Jieyu Zhao, Xiaofeng Xu, Xia Song, Jennifer Neville

📄 Paper

ICLR 2026

CoAct-1: Computer-using Agents with Coding as Actions

A computer-using agent that doesn't just point and click — it writes code as a first-class action, finishing tasks faster and with higher success than GUI-only agents.

Linxin Song, Yutong Dai, Viraj Prabhu, Jieyu Zhang, Taiwei Shi, Li Li, Junnan Li, Silvio Savarese, Zeyuan Chen, Jieyu Zhao, Ran Xu, Caiming Xiong

📄 Paper 💻 Code 🌐 Website 📰 VentureBeat

Research

All papers →

We study AI systems beyond static benchmarks: how they adapt, act, fail, and become trustworthy in real-world settings.

Learning & Alignment

We study how models adapt through post-training, reinforcement finetuning, user feedback, personalization, and curriculum design.

Explore

Agents & Computer Use

We build and evaluate agents that use computers, tools, and domain knowledge to plan, act, recover, and collaborate.

Explore

Evaluation & Measurement

We design evaluations that reveal what models actually know, understand, and fail to do across reasoning, multimodal, cultural, and agentic tasks.

Explore

Trustworthiness & Safety

We audit failures that matter in real use: bias, hallucination, privacy leakage, unsafe data influence, and brittle model judgments.

Explore

Awards

100 Brilliant Women in AI Ethics

2025
EMNLP SAC Highlight Award Huanxin Sheng

Analyzing Uncertainty of LLM-as-a-Judge: Interval Evaluations with Conformal Prediction

2025
Capital One Student Fellowship Ziyi Liu

2025
Capital One Student Fellowship Priyanka Dey

2025
EMNLP Outstanding AC Award

2024
USC–Capital One Research Award

2024
WiSE and EMNLP Travel Grants Ziyi Liu

2024
Amazon Responsible AI Funding

2023
Gabilan Assistant Professor Fellowship

2023
SoCal NLP Symposium Best Poster Award

Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods

2018
EMNLP 2017 Best Long Paper Award

Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints

2017

Featured In

Research Sponsors