What does a Text Classifier Learn about Morality? An Explainable Method for Cross-Domain Comparison of Moral Rhetoric - code
What does a Text Classifier Learn about Morality? An Explainable Method for Cross-Domain Comparison of Moral Rhetoric - code
Description
Code for the paper "What does a Text Classifier Learn about Morality? An Explainable Method for Cross-Domain Comparison of Moral Rhetoric", published at ACL '23. This code implements Tomea, an Explainable AI method for investigating the difference in how language models represent morality across domains. Given a pair of datasets and models trained on the datasets, Tomea generates 10 m-distances and one d-distance to measure the difference between the datasets, based on the SHAP method. We make pairwise comparisons of seven models trained on the MFTC datasets (available at this DOI: 10.4121/646b20e3-e24f-452d-938a-bcb6ce30913c).
- MIT
 
Reference papers
Mentions
- 1.Author(s): Luana Bulla, Aldo Gangemi, Misael MongiovìPublished in Lecture Notes in Computer Science, Value Engineering in Artificial Intelligence by Springer Nature Switzerland in 2024, page: 98-11310.1007/978-3-031-58202-8_7
 - 2.Author(s): Andrea Agiollo, Luciano Cavalcante Siebert, Pradeep Kumar Murukannaiah, Andrea OmiciniPublished in Lecture Notes in Computer Science, Explainable and Transparent AI and Multi-Agent Systems by Springer Nature Switzerland in 2023, page: 97-11510.1007/978-3-031-40878-6_6
 
- 1.Author(s): Vjosa Preniqi, Iacopo Ghinassi, Julia Ive, Charalampos Saitis, Kyriaki KalimeriPublished in Proceedings of the 2024 International Conference on Information Technology for Social Good by ACM in 2024, page: 433-44210.1145/3677525.3678694
 - 2.Author(s): Michiel van der Meer, Piek Vossen, Catholijn Jonker, Pradeep MurukannaiahPublished in Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing by Association for Computational Linguistics in 2023, page: 15986-1600810.18653/v1/2023.emnlp-main.992
 
- 1.Author(s): Luana Bulla, Stefano De Giorgis, Misael Mongiovì, Aldo GangemiPublished in Computers in Human Behavior Reports by Elsevier BV in 2025, page: 10060910.1016/j.chbr.2025.100609
 - 2.Author(s): Andrea Agiollo, Luciano Cavalcante Siebert, Pradeep K. Murukannaiah, Andrea OmiciniPublished in Autonomous Agents and Multi-Agent Systems by Springer Science and Business Media LLC in 202410.1007/s10458-024-09663-8
 - 3.Author(s): Taylor Sorensen, Liwei Jiang, Jena D. Hwang, Sydney Levine, Valentina Pyatkin, Peter West, Nouha Dziri, Ximing Lu, Kavel Rao, Chandra Bhagavatula, Maarten Sap, John Tasioulas, Yejin ChoiPublished in Proceedings of the AAAI Conference on Artificial Intelligence by Association for the Advancement of Artificial Intelligence (AAAI) in 2024, page: 19937-1994710.1609/aaai.v38i18.29970
 - 4.Author(s): Aida Ramezani, Jennifer E. Stellar, Matthew Feinberg, Yang XuPublished in Open Mind by MIT Press in 2024, page: 1153-116910.1162/opmi_a_00164
 - 5.Author(s): Sergio Muñoz, Carlos Á. IglesiasPublished in Applied Sciences by MDPI AG in 2023, page: 1169510.3390/app132111695
 
- 1.Author(s): Suhaib Abdurahman, Mohammad Atari, Farzan Karimi-Malekabadi, Mona J. Xue, Jackson Trager, Peter S. Park, Preni Golazizian, Ali Omrani, Morteza DehghaniPublished by Center for Open Science in 202310.31234/osf.io/d695y
 - 2.Author(s): Taylor Sorensen, Liwei Jiang, Jena Hwang, Sydney Levine, Valentina Pyatkin, Peter West, Nouha Dziri, Ximing Lu, Kavel Rao, Chandra Bhagavatula, Maarten Sap, John Tasioulas, Yejin ChoiPublished by arXiv in 202310.48550/arxiv.2309.00779
 - 3.Author(s): Suhaib Abdurahman, Mohammad Atari, Farzan Karimi-Malekabadi, Mona J. Xue, Jackson Trager, Peter S. Park, Preni Golazizian, Ali Omrani, Morteza DehghaniPublished by Center for Open Science in 202310.31219/osf.io/tg79n