Skip to content

Machine Learning Model

Machine Learning (ML) encompasses a family of statistical and computational methods that build predictive models from data, without being explicitly programmed for the target task 1. On the Mat3ra platform, ML is applied in two main areas: universal machine-learned force fields (MLFFs) for atomistic simulations, and property prediction models that map materials descriptors to target properties.

Supported Approaches

Universal Machine-Learned Force Fields

Universal MLFFs are pre-trained interatomic potentials that predict energies, forces, and stresses for arbitrary compositions across the periodic table. They enable molecular dynamics and structural relaxation at near-DFT accuracy with orders-of-magnitude lower computational cost.

The platform supports running Python-based MLFFs through the general Python workflow infrastructure. Models that can be installed via pip and invoked through the Atomic Simulation Environment (ASE) are compatible with the platform. See the MatterSim tutorial for a step-by-step example covering bank workflows, custom workflows, and GPU execution.

Specific models with pre-built workflow templates include:

Model Architecture Coverage Reference
MatterSim Graph neural network Periodic table Yang et al. (2024) 2

Running other Python-based models

Any Python-based MLFF (e.g. MACE, CHGNet, SevenNet) can be run using the general Python workflow template. Add the model's package to requirements.txt and write the inference script in script.py. See Section 3 of the MatterSim tutorial for details.

Custom Training with DeePMD

For training custom neural network potentials from ab-initio molecular dynamics data, the platform supports end-to-end workflows combining Quantum ESPRESSO (Car–Parrinello MD), DeePMD-kit (training), and LAMMPS (production MD). See the DeePMD tutorial.

Statistical / Scikit-Learn Models

The platform also supports traditional ML workflows (regression, classification, clustering) through the Python ML infrastructure. These models use tabulated materials descriptors to predict target properties such as band gaps. Available tutorials:

Parameters

The list of parameters affecting ML models is presented in this page.

Properties

The classification of properties in the context of ML is discussed in a separate section of the documentation.

Units

Machine Learning-specific unit types are introduced here.

Structured Representation

This page contains an example structured representation for the ML model.

Example Workflow

The structure of an example Machine Learning workflow is reviewed in this page.

Accuracy

Important considerations when evaluating the accuracy of a machine-learned model are discussed in this page.


  1. Wikipedia Machine Learning 

  2. H. Yang et al., "MatterSim: A Deep Learning Atomistic Model Across Elements, Temperatures and Pressures," arXiv:2405.04967 (2024). arXiv