Exploring Human Perceptions of AI Responses: Insights from a Mixed-Methods Study on Risk Mitigation in Generative Models

By: Heloisa Candello, Muneeza Azmat, Uma Sushmitha Gunturi, Raya Horesh, Rogerio Abreu de Paula, Heloisa Pimentel, Marcelo Carpinette Grave, Aminat Adebiyi, Tiago Machado, Maysa Malfiza Garcia de Macedo

Published: 2025-12-01

View on arXiv →
#cs.AI

Abstract

This study investigates human perception and evaluation of AI-generated responses modified by a mitigator model to reduce harm, focusing on mitigation performance, transparency, and metrics to bridge the socio-technical gap in AI evaluation.

FEEDBACK

Projects

No projects yet

Exploring Human Perceptions of AI Responses: Insights from a Mixed-Methods Study on Risk Mitigation in Generative Models | ArXiv Intelligence