Journal Search Engine

Download PDF Export Citation Korean Bibliography
ISSN : 1226-4822(Print)
The Sociolinguistic Journal of Korea Vol.33 No.4 pp.43-80
DOI : http://dx.doi.org/10.14353/sjk.2025.33.4.02

AI responses to unethical directive speech acts: The case of indirect and evasive strategies

Jaehee Kim*, Hansaem Kim**
Yonsei University

Abstract

This study examines how Large Language Models (LLMs) recognize and refuse unethical directive speech acts by analyzing their responses to indirect and evasive user requests. Based on the Cross-Cultural Speech Act Realization Project (CCSARP), directive prompts were constructed by varying degrees of indirectness to evaluate the models’ pragmatic inference abilities. The study was conducted in two stages. First, a high rate of information leakage was observed for indirect directives using ChatGPT-4o (February 2025 version). Second, newer models—GPT-5, Claude Sonnet 3.7 and 4, and Gemini 2.5 Flash—were tested across four categories of unethical directives through multiturn dialogues. Logistic regression with Benjamini–Hochberg FDR correction revealed that although newer models displayed improved refusal performance overall, they remained vulnerable to highly indirect and non-conventional directives, particularly those related to discrimination and harmful behaviors. These results suggest that current AI safety systems rely heavily on surface-level keyword filtering, indicating the need for models to better learn diverse directive strategies and expressions in Korean. Moving beyond technology-centered safety evaluation, this study experimentally analyzes AI pragmatic response mechanisms and proposes directions for fostering ethical communication in future human–AI interactions.

비윤리적 지시화행에 대한 인공지능의 응대 양상
: 우회적 지시 전략을 중심으로

초록

Vol. 40 No. 4 (2022.12)

Journal Abbreviation j. Korea Saf. Manag. Sci.
Frequency Published four times annually in March, June, September, and December
Doi Prefix 10.14353/sjk.
Year of Launching 1993
Publisher The Sociolinguistic Society of Korea
Indexed/Tracked/Covered By

Editorial Office Contact Information

- Tel: +81-10-3894-3164
- E-mail: schang@hoseo.edu