Maksim E. Eren

Scientist

Los Alamos National Laboratory

Biography

Maksim E. Eren is an early career scientist in the Computational Intelligence & Modeling (A-1) group at Los Alamos National Laboratory (LANL) and a LANL Center for National Security and International Studies (CNSIS) Fellow. He is an alumnus of the Scholarship for Service CyberCorps program. Maksim graduated Summa Cum Laude with a Bachelor’s degree in Computer Science from the University of Maryland Baltimore County (UMBC) in 2020 and earned his Master’s degree from the same institution in 2022. In 2024, he received his Ph.D. from UMBC, focusing on tensor decomposition methods for malware characterization.

Maksim’s research interests span an interdisciplinary set of topics in artificial intelligence (AI) and applied data science. He is particularly interested in leveraging AI to address challenges across diverse domains, including biology and cybersecurity. Maksim’s work in AI and data science include tensor decomposition, pattern extraction, natural language processing (NLP), malware characterization, anomaly detection, text mining, large language models (LLMs), knowledge graphs (KGs), high-performance computing (HPC), and data privacy. In addition to research, Maksim actively develops high-performance software and efficient machine learning (ML) pipelines optimized for extra-large datasets and real-world applications. At LANL, Maksim was a member of the 2021 R&D 100 winning project SmartTensors AI, where he has released a fast tensor decomposition and anomaly detection software, contributed to the design and development of various other tensor decomposition libraries, and developed state-of-the-art text mining tools.

CV
Portfolio Headshot

Interests

Artificial Intelligence
Data Science
Tensor Decomposition
Cybersecurity
Natural Language Processing
High Performance Computing
Knowledge Representation
Pattern Extraction

Education

PhD in Computer Science, 2024
University of Maryland, Baltimore County (UMBC)
MS in Computer Science, 2022
University of Maryland, Baltimore County (UMBC)
BS in Computer Science, 2020
University of Maryland, Baltimore County (UMBC)
AA in Computer Science, 2018
Montgomery College (MC)

Featured Publications

Maksim E. Eren, Eric Michalak, Brian Cook, Johnny Seales Jr

March 2026 Under review in ACM Symposium on Document Engineering 2026 (DocEng ’26), 2026

Prompt Programming for Cultural Bias and Alignment of Large Language Models

Culture shapes reasoning, values, prioritization, and strategic decision-making, yet large language models (LLMs) often exhibit cultural biases that misalign with target populations. As LLMs are increasingly used for strategic decision-making, policy support, and document engineering tasks such as summarization, categorization, and compliance-oriented auditing, improving cultural alignment is important for ensuring that downstream analyses and recommendations reflect target-population value profiles rather than default model priors. Previous work introduced a survey-grounded cultural alignment framework and showed that culture-specific prompting can reduce misalignment, but it primarily evaluated proprietary models and relied on manual prompt engineering. In this paper, we validate and extend that framework by reproducing its social sciences survey based projection and distance metrics on open-weight LLMs, testing whether the same cultural skew and benefits of culture conditioning persist outside closed LLM systems. Building on this foundation, we introduce use of prompt programming with DSPy for this problem-treating prompts as modular, optimizable programs-to systematically tune cultural conditioning by optimizing against cultural-distance objectives. In our experiments, we show that prompt optimization often improves upon cultural prompt engineering, suggesting prompt compilation with DSPy can provide a more stable and transferable route to culturally aligned LLM responses.

Preprint PDF Details

Maksim E. Eren, Dorianis M. Perez

November 2025 Self-published. November 2025

Rethinking Science in the Age of Artificial Intelligence

Artificial intelligence (AI) is reshaping how research is conceived, conducted, and communicated across fields from chemistry to biomedicine. This commentary examines how AI is transforming the research workflow. AI systems now help researchers manage the information deluge, filtering the literature, surfacing cross-disciplinary links for ideas and collaborations, generating hypotheses, and designing and executing experiments. These developments mark a shift from AI as a mere computational tool to AI as an active collaborator in science. Yet this transformation demands thoughtful integration and governance. We argue that at this time AI must augment but not replace human judgment in academic workflows such as peer review, ethical evaluation, and validation of results. This paper calls for the deliberate adoption of AI within the scientific practice through policies that promote transparency, reproducibility, and accountability.

Preprint PDF Details

Afia Anjum, Maksim E. Eren, Ismael Boureima, Boian S. Alexandrov, Manish Bhattarai

August 2024 In IEEE Conference on Machine Learning and Applications (ICMLA 2024) with Best Paper Runner Up Award, 2024

Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs

In recent years, Large Language Models (LLMs) have demonstrated remarkable capabilities across a wide range of natural language processing (NLP) tasks, such as question-answering, sentiment analysis, text summarization, and machine translation. However, the ever-growing complexity of LLMs demands immense computational resources, hindering the broader research and application of these models. To address this, various parameter-efficient fine-tuning strategies, such as Low-Rank Approximation (LoRA) and Adapters, have been developed. Despite their potential, these methods often face limitations in compressibility. Specifically, LoRA struggles to scale effectively with the increasing number of trainable parameters in modern large scale LLMs. Additionally, Low-Rank Economic Tensor-Train Adaptation (LoRETTA), which utilizes tensor train decomposition, has not yet achieved the level of compression necessary for fine-tuning very large scale models with limited resources. This paper introduces Tensor Train Low-Rank Approximation (TT-LoRA), a novel parameter-efficient fine-tuning (PEFT) approach that extends LoRETTA with optimized tensor train (TT) decomposition integration. By eliminating Adapters and traditional LoRA-based structures, TT-LoRA achieves greater model compression without compromising downstream task performance, along with reduced inference latency and computational overhead. We conduct an exhaustive parameter search to establish benchmarks that highlight the trade-off between model compression and performance. Our results demonstrate significant compression of LLMs while maintaining comparable performance to larger models, facilitating their deployment on resource-constraint platforms.

DOI Preprint PDF Details

News

AI tensor network-based computational framework cracks a 100-year-old physics challenge

THOR AI breaks curse of dimensionality in statistical mechanics calculations.

Last updated on Sep 19, 2025

AI tensor network-based computational framework cracks a 100-year-old physics challenge

Using AI to develop enhanced cybersecurity measures

New research helps identify an unprecedented number of malware families.

Last updated on Sep 19, 2025

Using AI to develop enhanced cybersecurity measures

Not too big - Machine learning tames huge datasets

Using the Summit supercomputer, Los Alamos algorithm breaks the exabyte barrier.

Last updated on Apr 14, 2025

Not too big - Machine learning tames huge datasets

Our paper that sets a new world record

A new world record by simultaneously classifying an unprecedented number of malware families under extreme class imbalance, surpassing prior work by a factor of 29

Last updated on Sep 27, 2023

R&D 100 winner of the day - SmartTensors AI Platform

The SmartTensors AI Platform, developed at Los Alamos National Laboratory, is a scalable, unsupervised machine-learning software suite capable of identifying, extracting essential hidden features, and efficiently compressing information in massive datasets.

Last updated on Sep 28, 2023

R&D 100 winner of the day - SmartTensors AI Platform

See all

Recent Publications

Quickly discover relevant content by filtering publications.

Maksim E. Eren, Eric Michalak, Brian Cook, Johnny Seales Jr (2026). Prompt Programming for Cultural Bias and Alignment of Large Language Models. Under review in ACM Symposium on Document Engineering 2026 (DocEng ’26), 2026.

Preprint PDF Details

Maksim E. Eren, Dorianis M. Perez (2026). Rethinking Scientific Practice in the Age of Artificial Intelligenc. In ACM AI Letters, 2026.

DOI Preprint PDF Details

Maksim E. Eren, Dorianis M. Perez (2025). Rethinking Science in the Age of Artificial Intelligence. Self-published. November 2025.

Preprint PDF Details

Ryan Barron, Maksim E. Eren, Valentin Stanev, Cynthia Matuszek, Boian S. Alexandrov (2025). Topic Modeling and Link-Prediction for Material Property Discovery. In 25th ACM Symposium on Document Engineering (DocEng), 2025.

DOI Preprint PDF Details

Ryan Barron, Maksim E. Eren, Olga M. Serafimova, Cynthia Matuszek, Boian S. Alexandrov (2025). Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization. In 20th International Conference on Artificial Intelligence and Law (ICAIL), 2025.

DOI Preprint PDF Details

Manish Bhattarai, Ryan Barron, Maksim E. Eren, Minh Vu, Vesselin Grantcharov, Ismael Boureima, Valentin Stanev, Cynthia Matuszek, Vladimir Valtchinov, Kim Rasmussen, Boian Alexandrov (2025). HEAL: Hierarchical Embedding Alignment Loss for Improved Retrieval and Representation Learning. In 13th International Conference on Learning Representations, Workshop on Scaling Self-Improving Foundation Models without Human Supervision (ICLR 2025 SSI-FM).

DOI Preprint PDF Details

Maksim E. Eren, Boian S. Alexandrov, Charles Nicholas (2024). Classifying Malware Using Tensor Decomposition. Chapter in Springer Nature book Malware; Handbook of Prevention and Detection, 2024.

DOI Preprint PDF Details

Ryan Barron, Vesselin Grantcharov, Selma Wanna, Maksim E. Eren, Manish Bhattarai, Nicholas Solovyev, George Tompkins, Charles Nicholas, Kim O. Rasmussen, Cynthia Matuszek, Boian S. Alexandrov (2024). Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization. In IEEE Conference on Machine Learning and Applications, Special Session on Machine Learning for Natural Language Processing (ICMLA 2024).

DOI Preprint PDF Details

Maksim E. Eren (2024). Advanced Semi-supervised Tensor Decomposition Methods for Malware Characterization. Ph.D. Dissertation in Computer Science at the University of Maryland, Baltimore County Department of Computer Science and Electrical Engineering.

Dissertation PDF Code Details

Afia Anjum, Maksim E. Eren, Ismael Boureima, Boian S. Alexandrov, Manish Bhattarai (2024). Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs. In IEEE Conference on Machine Learning and Applications (ICMLA 2024) with Best Paper Runner Up Award, 2024.

DOI Preprint PDF Details

See all publications

Software

lanl/THOR

The THOR Project (Tensors for High-dimensional Object Representation) aims to advance the state-of-the-art in tensor calculations, manipulation, and research. We strive to provide a high-performance tensor library for various scientific applications, containing ready-to-use utilities and applicaions in Fortran, Matlab, and Python.

lanl/T-ELF

Tensor Extraction of Latent Features (T-ELF) is one of the machine learning software packages developed as part of the R&D 100 winning SmartTensors AI project at Los Alamos National Laboratory (LANL). T-ELF presents an array of customizable software solutions crafted for analysis of datasets.

pyCP_ALS

pyCP_ALS is the Python implementation of CP-ALS algorithm that was originally introduced in the MATLAB Tensor Toolbox.

RFoT

Random Forest of Tensors (RFoT) is a novel ensemble semi-supervised classification algorithm based on tensor decomposition. We show the capabilities of RFoT when classifying Windows Portable Executable (PE) malware and benign-ware.

lanl/pyDNTNK

pyDNTNK is a software package for applying non-negative Hierarchical Tensor decompositions such as Tensor train and Hierarchical Tucker decompositons in a distributed fashion to large datasets. It is built on top of pyDNMFk.

lanl/pyQBTNs

pyQBTNs is a Python library for boolean matrix and tensor factorization using D-Wave quantum annealers.

lanl/pyCP_APR

pyCP_APR is a Python library for tensor decomposition and anomaly detection that is developed as part of the R&D 100 award wining SmartTensors project. It is designed for the fast analysis of large datasets by accelerating computation speed using GPUs.

lanl/pyDNMFk

pyDNMFk is a software package for applying non-negative matrix factorization in a distributed fashion to large datasets. It has the ability to minimize the difference between reconstructed data and the original data through various norms (Frobenious, KL-divergence).

lanl/pyDRESCALk

pyDRESCALk is a software package for applying non-negative RESCAL decomposition in a distributed fashion to large datasets. It can be utilized for decomposing relational datasets.