Maksim E. Eren

Maksim E. Eren

Scientist

Los Alamos National Laboratory

Biography

Maksim E. Eren is an early career scientist in A-4, Los Alamos National Laboratory (LANL) Advanced Research in Cyber Systems division. He is an alumnus of the Scholarship for Service CyberCorps program. Maksim graduated Summa Cum Laude with a Bachelor’s degree in Computer Science from the University of Maryland Baltimore County (UMBC) in 2020 and earned his Master’s degree from the same institution in 2022. In 2024, he received his Ph.D. from UMBC, focusing on tensor decomposition methods for malware characterization.

Maksim’s interdisciplinary research interests lie at the intersection of machine learning and cybersecurity, with a focus on tensor decomposition. His tensor decomposition-based research projects encompass large-scale malware detection and characterization, cyber anomaly detection, data privacy, biology, text mining, large language models, knowledge graphs, and high-performance computing. Maksim has developed and published state-of-the-art solutions in anomaly detection and malware characterization. He has also worked on various other machine learning research projects, including detecting malicious hidden code, adversarial analysis of malware classifiers, and federated learning. At LANL, Maksim was a member of the 2021 R&D 100 winning project SmartTensors AI, where he has released a fast tensor decomposition and anomaly detection software, contributed to the design and development of various other tensor decomposition libraries, and developed state-of-the-art text mining tools.

Interests

  • Machine Learning
  • Data Science
  • Tensor Decomposition
  • Cybersecurity
  • Natural Language Processing
  • High Performance Computing
  • Knowledge Representation
  • Pattern Extraction

Education

  • PhD in Computer Science, 2024

    University of Maryland, Baltimore County (UMBC)

  • MS in Computer Science, 2022

    University of Maryland, Baltimore County (UMBC)

  • BS in Computer Science, 2020

    University of Maryland, Baltimore County (UMBC)

  • AA in Computer Science, 2018

    Montgomery College (MC)

Recent Publications

Quickly discover relevant content by filtering publications.
(2024). Classifying Malware Using Tensor Decomposition. Chapter in Springer Nature book Malware; Handbook of Prevention and Detection, 2024.

DOI Preprint PDF Details

(2024). Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization. In IEEE Conference on Machine Learning and Applications, Special Session on Machine Learning for Natural Language Processing (ICMLA 2024).

Preprint PDF Details

(2024). Advanced Semi-supervised Tensor Decomposition Methods for Malware Characterization. Ph.D. Dissertation in Computer Science at the University of Maryland, Baltimore County Department of Computer Science and Electrical Engineering.

Dissertation PDF Code Details

(2024). Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs. In IEEE Conference on Machine Learning and Applications (ICMLA 2024) with Best Paper Award, 2024.

Preprint PDF Details

(2024). Binary Bleed: Fast Distributed and Parallel Method for Automatic Model Selection. In the IEEE High Performance Extreme Computing (HPEC) Conference, 2024.

Preprint PDF Details

(2024). TopicTag: Automatic Annotation of NMF Topic Models Using Chain of Thought and Prompt Tuning with LLMs. In ACM Symposium on Document Engineering 2024 (DocEng ’24), 2024.

DOI Preprint PDF Details

(2024). Cyber-Security Knowledge Graph Generation by Hierarchical Nonnegative Matrix Factorization. In IEEE 12th International Symposium on Digital Forensics and Security (ISDFS), 2024.

DOI Preprint PDF Details

(2024). Catch'em all: Classification of Rare, Prominent, and Novel Malware Families. In IEEE 12th International Symposium on Digital Forensics and Security (ISDFS), 2024.

DOI Preprint PDF Details

(2023). Electrical Grid Anomaly Detection via Tensor Decomposition. In IEEE Military Communications Conference, Articial Intelligence for Cyber Workshop (MILCOM), 2023.

DOI Preprint PDF Details

(2023). Interactive Distillation of Large Single-Topic Corpora of Scientific Papers. In IEEE Conference on Machine Learning and Applications (ICMLA 2023), 2023.

DOI Preprint PDF Details

Software

*
lanl/T-ELF

lanl/T-ELF

Tensor Extraction of Latent Features (T-ELF) is one of the machine learning software packages developed as part of the R&D 100 winning SmartTensors AI project at Los Alamos National Laboratory (LANL). T-ELF presents an array of customizable software solutions crafted for analysis of datasets.

pyCP_ALS

pyCP_ALS

pyCP_ALS is the Python implementation of CP-ALS algorithm that was originally introduced in the MATLAB Tensor Toolbox.

RFoT

RFoT

Random Forest of Tensors (RFoT) is a novel ensemble semi-supervised classification algorithm based on tensor decomposition. We show the capabilities of RFoT when classifying Windows Portable Executable (PE) malware and benign-ware.

lanl/pyDNTNK

lanl/pyDNTNK

pyDNTNK is a software package for applying non-negative Hierarchical Tensor decompositions such as Tensor train and Hierarchical Tucker decompositons in a distributed fashion to large datasets. It is built on top of pyDNMFk.

lanl/pyQBTNs

lanl/pyQBTNs

pyQBTNs is a Python library for boolean matrix and tensor factorization using D-Wave quantum annealers.

lanl/pyCP_APR

lanl/pyCP_APR

pyCP_APR is a Python library for tensor decomposition and anomaly detection that is developed as part of the R&D 100 award wining SmartTensors project. It is designed for the fast analysis of large datasets by accelerating computation speed using GPUs.

lanl/pyDNMFk

lanl/pyDNMFk

pyDNMFk is a software package for applying non-negative matrix factorization in a distributed fashion to large datasets. It has the ability to minimize the difference between reconstructed data and the original data through various norms (Frobenious, KL-divergence).

lanl/pyDRESCALk

lanl/pyDRESCALk

pyDRESCALk is a software package for applying non-negative RESCAL decomposition in a distributed fashion to large datasets. It can be utilized for decomposing relational datasets.

Recent Talks