Anurag Acharya

Data Scientist

Biography

Anurag Acharya is a data scientist at Pacific Northwest National Laboratory (PNNL). He is an active researcher in the overlapping fields of artificial intelligence (AI) and natural language processing, with a key focus on evaluating and analyzing large language models (LLMs) in their application for scientific domains and in their correctness and bias. He has a record of using AI to detect misinformation and disinformation surrounding major world events, from the White Helmets operations in Syria to the COVID-19 pandemic. During his work in EXPERT, Acharya led the development of the first ever expert-crafted evaluation benchmark for LLMs for the nuclear nonproliferation domain. His current works include MegaAI, which focuses on using LLMs for molecular chemistry and to detect and classify vulnerabilities in code to protect critical cyber infrastructure, PolicyAI, which develops a new AI-driven capability to help make faster decisions in the National Environmental Policy Act (NEPA) process by streamlining environmental reviews, and ACCELERATE, an effort to analyze and predict the degradation of catalysts for sustainable conversion of alternate feedstocks to fuels and chemicals. In addition to these works, Acharya’s research interest is in understanding and mitigating biases in AI systems, and working towards building ethical AI. In addition to Department of Energy agencies, his research works have been funded by the Defense Advanced Research Projects Agency, the Air Force Research Laboratory, and IBM.

Disciplines and Skills

Artificial Intelligence
AI Safety and Trustworthiness
AI Ethics
Natural Language Processing
Large Language Models
Generative AI
Computational Linguistics
Computational Social Sciences

Education

PhD in computer science, Florida International University
MS in computer science, Florida International University
BEng in computer engineering, Tribhuvan University, Nepal
BA in English and political science, Tribhuvan University, Nepal

Affiliations and Professional Service

Professional Membership

Association for the Advancement of Artificial Intelligence
Association for Computational Linguistics
Association for Computing Machinery

Program and Organizing Committee

Organizing Committee, Social Development through NLP-driven Interdisciplinary Collaborations (SocioNLP) Workshop, 2024
Reviewer, Association for Computational Linguistics (ACL) Rolling Review, 2024
Program Committee, Workshop on Responsible Language Models (ReLM), 2024
Program Committee, Seventh International Workshop on Narrative Extraction from Texts, 2024
Ethics Reviewer, Conference on Neural Information Processing Systems, 2023
Session Chair, Ninth Annual Conference on Advances in Cognitive Systems Conference, 2021
Organizing Committee, Communicating Science Workshop for Graduate Students, 2021

Review Committee (Journals)

Natural Language Engineering, 2023 – Present
IEEE Transactions on Artificial Intelligence, 2023 – Present
Humanities & Social Sciences Communications, 2023 – Present
International Journal of Data Science and Analytics, 2023 – Present

Awards and Recognitions

Best Paper Award, Advanced Engineering and ICT-Convergence Proceedings, Transfer Learned Mobilenets with Shrinking Hyperparameters for Classifying Covid-19 Based on X-ray Images, 2021

Publications

Saldanha E.G., A. Acharya, M. Ocal, J. Eshun, M.F. Glenski, and S. Volkova. 2024. "Detecting and Summarizing Narratives in the Information Environment: A Case Study of Misinformation and Disinformation Campaigns." In Detecting Online Propaganda and Misinformation, edited by Mark Last, Marina Litvak, Miao Lin. PNNL-SA-171527. doi:10.1142/13556
Yarlott, W.V.H., A. Acharya, D. Castro-Estrada, D. Gomez, and M.A. Finlayson. 2024. “GOLEM: GOld standard for Learning and Evaluation of Motifs.” The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (COLING-LREC). Torino, Italy.
Munikoti, S., A. Acharya, S. Wagle, and Y. S. Horawalavithana. 2024. “ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science.” Workshop on AI to Accelerate Science and Engineering, The Thirty-Eighth Annual AAAI Conference on Artificial Intelligence. Vancouver, Canada.
Wagle, S., S. Munikoti, A. Acharya, S. Smith, and Y. S. Horawalavithana. 2024. “Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science.” Workshop on Scientific Document Understanding, The Thirty-Eighth Annual AAAI Conference on Artificial Intelligence. Vancouver, Canada.
Yarlott, W.V.H., A. Ochoa, A. Acharya, L. Bobrow, D. Castro-Estrada, D. Gomez, J. Zheng, D. McDonald, C. Miller, and M.A. Finlayson. 2021. “Finding Trolls Under Bridges: Preliminary Work on a Motif Detector.” Advances in Cognitive Systems. Virtual Conference
Yarlott, W.V.H., A. Ochoa, A. Acharya, L. Bobrow, D. Castro-Estrada, D. Gomez, J. Zheng, D. McDonald, C. Miller, and M.A. Finlayson. 2021. “AI models for detecting motifs in a text collection” Literature & Culture and/as Intelligent Systems. Stuttgart, Germany.
Acharya, A., K. Talamadupula, and M.A. Finlayson. 2021. “Towards an Atlas of Cultural Commonsense for Machine Reasoning.” Workshop on Common Sense Knowledge Graphs, The Thirty-Fifth AAAI Conference on Artificial Intelligence. Virtual Conference.
KC, K., A. Acharya, A. Acharya, and S. Shrestha. 2021. “Transfer Learned Mobilenets with shrinking hyperparameters for classifying Covid-19 based on X-ray images.” Advanced Engineering and ICT-Convergence Proceedings. Vol 4, No. 2. Bangkok, Thailand.v