Designing Staged Evaluation Workflows for LLMs: Integrating Domain Experts, Lay Users, and Model-Generated Evaluation Criteria
Published in Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '26), Barcelona, Spain, USA., 2026