Chapter

Explainable and Transparent AI Architectures

Deepak GuptaInstitute of Technology and Management, Gwalior, IndiaAbduraimova NigoraTermez University of Economics and Service, Termez, UzbekistanGulkhayo GulkhayoErgashev NuriddinKarshi State Technical University. Karshi, UzbekistanShokhzod KarimovTashkent State University of Economics, UzbekistanMamatkhujaev OtabekAlfraganus University, Tashkent, UzbekistanSeitnazarov KuanishbayNukus State Pedagogical Institute, Uzbekistan

Advances in computational intelligence and robotics book seriesbook series2026

ABI

Abstract

Explainability and transparency have emerged as foundational pillars in the secure deployment of artificial intelligence (AI) systems, especially large language models (LLMs). This chapter examines the evolving landscape of explainable AI (XAI) architectures through the lens of cybersecurity, adversarial robustness, and regulatory compliance. The authors survey core XAI methodologies—including LIME, SHAP, mechanistic interpretability, attention attribution, and causal tracing—evaluating their effectiveness against adversarial threats such as jailbreaking, prompt injection, data poisoning, and hallucination exploitation. The dual nature of XAI is critically examined: while transparency mechanisms bolster defense and trust, they simultaneously introduce novel attack surfaces that adversaries can exploit to subvert explanation systems.

Topics

Adversarial Robustness in Machine Learning Explainable Artificial Intelligence (XAI)Ethics and Social Impacts of AI

Identifiers

DOI: 10.4018/979-8-3373-8252-4.ch011

Citations and references

Cited by 00 references

Metrics — AkademScholar