TY - CONF T1 - Deconstructing Attention: Investigating Design Principles for Effective Language Modeling. JO - IJCNLP-AACL (long papers) UR - https://aclanthology.org/volumes/2025.ijcnlp-long/ PY - 2025/01/01 AU - Xue H AU - Moosavi NS AU - Aletras N ED - Inui K ED - Sakti S ED - Wang H ED - Wong DF ED - Bhattacharyya P ED - Banerjee B ED - Ekbal A ED - Chakraborty T ED - Singh DP PB - The Asian Federation of Natural Language Processing and The Association for Computational Linguistics SN - 979-8-89176-298-5 SP - 708 EP - 727 Y2 - 2026/03/07 ER -