Language Model Interpretability - Explainable AI Methods

Exploring explainable AI methods for interpreting and explaining the decisions made by language models to enhance transparency and trustworthiness

Authors

  • Srihari Maruthi University of New Haven, West Haven, CT, United States Author
  • Sarath Babu Dodda Central Michigan University, MI, United States Author
  • Ramswaroop Reddy Yellu Independent Researcher, USA Author
  • Praveen Thuniki Independent Researcher & Program Analyst, Georgia, United States Author
  • Surendranadha Reddy Byrapu Reddy Sr. Data Architect at Lincoln Financial Group, Greensboro, NC, United States Author

Keywords:

Language models, Explainable AI, Interpretability, Transparency, Trustworthiness

Abstract

Language models have achieved remarkable success in various natural language processing tasks, but their complex inner workings often lack transparency, leading to concerns about their reliability and ethical implications. Explainable AI (XAI) methods aim to address this issue by providing insights into how language models make decisions. This paper presents a comprehensive review of XAI methods for interpreting and explaining the decisions made by language models. We discuss key approaches such as attention mechanisms, saliency maps, and model-agnostic techniques, highlighting their strengths and limitations. Additionally, we explore the implications of XAI for enhancing the transparency and trustworthiness of language models in real-world applications.

Downloads

Download data is not yet available.

Downloads

Published

2022-12-31

How to Cite

[1]
S. Maruthi, S. Babu Dodda, R. Reddy Yellu, P. Thuniki, and S. Reddy Byrapu Reddy, “Language Model Interpretability - Explainable AI Methods: Exploring explainable AI methods for interpreting and explaining the decisions made by language models to enhance transparency and trustworthiness”, Australian Journal of Machine Learning Research & Applications, vol. 2, no. 2, pp. 1–9, Dec. 2022, Accessed: Jul. 04, 2024. [Online]. Available: https://sydneyacademics.com/index.php/ajmlra/article/view/19

Similar Articles

1-10 of 13

You may also start an advanced similarity search for this article.