Program (final)

05.06.2025

The first author is the presenter.

DLCP2026 program (final).

Section 1. Machine Learning in Fundamental Physics

Нобелевская премия по физике 2024: почему её получили не физики, или Как физические принципы легли в основу методов машинного обучения

С.А.Доленко
НИИЯФ МГУ имени М.В.Ломоносова

Приглашенный доклад

Решение Нобелевского комитета присудить премию по физике 2024 года Джону Хопфилду (John Hopfield) и Джеффри Хинтону (Geoffrey Hinton) вызвало горячие споры. Имена этих учёных широко известны в сообществе, занимающемся машинным обучением и искусственным интеллектом, но почти незнакомы физикам. Оба они – люди с широким кругом научных интересов и работ в разных областях знания, но… они не физики! Как же могло получиться, что самая престижная в мире премия по физике была отдана людям, работающим хотя и в естественных, но в других науках? Является ли эта премия просто следствием общего всплеска общественного интереса к методам искусственного интеллекта и машинного обучения, или это фактическое признание не слишком известного факта, что в основе многих методов машинного обучения лежат открытия и инструменты исследования не только из нейробиологии и когнитивных наук, но и из физики? Попробуем разобраться.

3. Применение графовых нейронных сетей для сегментации хитов в глубоководном нейтринном телескопе Baikal-GVD

//Лев Осадчий (1), Иван Харук (2,3)
(1) физический факультет МГУ им. М. В. Ломоносова, (2) ИЯИ РАН, (3) МФТИ //

Исследуется применение графовых нейронных сетей для сегментации сигнальных хитов на фоне оптического шума в данных телескопа Baikal-GVD. В работе сравниваются архитектуры на основе графовых сверток, механизмов внимания и парадигмы Message Passing. Рассматриваются особенности адаптации моделей к данным детектора, в частности — использование финальных полносвязных слоев для сохранения различимости признаков между узлами. На основе анализа метрик определены наиболее эффективные подходы и функции агрегации для выделения полезного сигнала.

14. Kohn–Sham Spectral Embedding on Sparse Graphs at the Nishimori Temperature for Image Classification

//Vasiliy Stanislavovich Usatyuk(1,2), Denis Alekseevich Sapozhnikov(1), Sergei Ivanovich Egorov(2)
(1) T8 LLC, Moscow, Russia, (2) SWSU University, Kursk, Russia //

We present a physics-inspired spectral embedding framework that compresses high-dimensional image classifiers by mapping deep CNN features onto random-bond Ising models (RBIM) defined on sparse quasi-cyclic LDPC graphs at the Nishimori temperature [1,2]. Leveraging a Kohn–Sham-like self-consistency principle within an effective density functional theory (DFT) formulation [3], we construct a regularized graph Laplacian whose spectral structure captures global feature correlations. To avoid costly dense eigendecomposition, we exploit the quasi-circulant graph topology via a fast Fourier transform (FFT) to obtain a coarse spectral profile, then apply localized first-order corrections through Rayleigh subspace refinement around the absolute energy minima. Crucially, the global graph topology is optimized by resolving the trapping-set ontology—hierarchies [4] of small subgraphs that act as spurious attractors in the Bethe free-energy landscape [5]. We quantify these instabilities with a spectral-valley energy penalty derived from negative Bethe–Hessian modes and characterize landscape complexity via multiscale fractal analysis. These geometric invariants guide a graph-surgery procedure that suppresses deep energy valleys. Evaluated on the 1000-class ImageNet-1K dataset, our method achieves 88.83% top‑1 accuracy, outperforming strong baselines from the ResNet, DenseNet, ConvNeXt, ViT-Huge and Hierarchical ViT Swin Large families—while using only a lightweight EfficientNet‑B4 CNN feature extractor and an FFT+Rayleigh refinement backbone. The framework unifies concepts from statistical mechanics, topological coding theory, and fractal geometry into a compact, training‑ and inference‑efficient paradigm for large‑scale image classification, feature clustering. [1] Alaa Saade, Florent Krzakala, and Lenka Zdeborova. Spectral density of the non-backtracking operator on random graphs. EPL, 107(5):50005, 2014. [2] Lorenzo Dall'Amico, Romain Couillet, and Nicolas Tremblay. 2021. Nishimori meets Bethe: a spectral method for node classification in sparse weighted graphs. Journal of Statistical Mechanics: Theory and Experiment 2021, 9 (Sep 2021), Article 093405. https://doi.org/10.1088/1742-5468/ac21d3 [3] Penz, M., & van Leeuwen, R. 2021. Density-functional theory on graphs. The Journal of chemical physics, 155(24), 244111. https://doi.org/10.1063/5.0074249 [4] B. Vasić, S. K. Chilappagari, D. V. Nguyen and S. K. Planjery, “Trapping set ontology,” 2009 47th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 2009, pp. 1-7, doi: 10.1109/ALLERTON.2009.5394825 [5] Usatyuk, V.S., Sapozhnikov, D.A. & Egorov, S.I. Natural Image Classification via Quasi-Cyclic Graph Ensembles and Random-Bond Ising Models at the Nishimori Temperature. Moscow Univ. Phys. 80 (Suppl 3), S1042–S1056 (2025). https://doi.org/10.3103/S0027134925702947

24. Machine-Learning-Driven Particle Identification with the FARICH Detector under Realistic Operating Conditions

P. Rogozhin(1), M. Chadeeva(2), T. Uglov(1)
(1) HSE University, (The Lebedev Physical Institute of the Russian Academy of Sciences, (2) The Lebedev Physical Institute of the Russian Academy of Sciences

We present a machine-learning-based study of particle identification performance for the Focusing Aerogel Ring Imaging Cherenkov detector subsystem proposed for the future charm superfactory experiment. A dedicated Cherenkov-ring reconstruction pipeline is implemented in the detector simulation to account for realistic geometric distortions, detector response effects, and photosensor noise. The reconstructed FARICH observables are then used as input features for boosted-decision-tree classifiers optimized for particle separation under different assumptions on the photosensor dark count rate, including a conservative scenario with noise levels up to 1 MHz/mm². The performance of the ML-based PID approach is first evaluated using single-particle samples generated within the Aurora simulation framework. Its impact on physics analysis is then validated with the decay channel (D^0 \to K^- \mu^+ \nu_\mu), where reliable (\pi/\mu) separation is essential for background suppression. The developed classifier demonstrates high efficiency and robustness in realistic detector conditions, reducing the systematic uncertainty and background contribution associated with (\pi/\mu) misidentification to below 0.5%. These results show that machine-learning-based FARICH reconstruction and classification can provide a powerful and stable PID solution for precision charm-physics measurements.

31. Реконструкция ориентации многоканального изображающего детектора с помощью нейросетевой оценки отношения

//Р.Е. Сараев(1,2), С.А. Шаракин(1)
(1) НИИЯФ МГУ, (2) Физический факультет МГУ //

Наземные детекторы, входящие в состав системы PAIPS (Pulsating Aurora Imaging Photometers System), требуют обширного набора калибровок для проведения точной фотометрии, одной из которых является определение ориентации детектора. В данной работе реконструкция ориентации осуществляется с помощью байесовского вывода. В принципе, возможен и подход с использованием явной функции правдоподобия: простой метод предполагает использование одной или нескольких идентифицированных звезд. Однако, в детекторах PAIPS с низким угловым разрешением идентификация звезд затруднена, а ожидаемые звездные сигналы часто ненадежны из-за неопределенностей в спектральной чувствительности и процедуре выравнивания чувствительностей пикселей. Учет этих факторов вводит чрезмерное количество вспомогательных параметров (nuisance parameters), что делает сэмплирование целевого апостериорного распределения вычислительно невозможным. Для решения этой проблемы мы предлагаем метод с использованием неявной функции правдоподобия на основе нейросетевой оценки отношения (Neural Ratio Estimation, NRE). Данный подход позволяет стабильно восстанавливать ориентацию с приемлемой точностью, требуя лишь базовой предобработки данных. Метод может быть расширен для восстановления дополнительных параметров детектора, таких как фокусное расстояние и параметры дисторсии.

21. Feature-Based and Deep Learning Methods for Astronomical Light Curve Classification

Valentin Baymakanov(1), Khakim Khamitov(1), Alina Raimova(1), Anastasia Isaeva(2), Roman Taran(2), Mohammad Zedan(1), Matwey Kornilov(1,3,4), Mariia Demianenko(5,6), Konstantin Malanchev(7), Mikhail Hushchyn(1)
(1) HSE University, (2) Southern Federal University HSE University, (3)Lomonosov Moscow State University, (4)Sternberg astronomical institute, (5)Max-Planck-Institut für Astronomie, (6) Heidelberg University, (7) Carnegie Mellon University

Modern astronomical surveys produce vast amounts of time-domain data, making automated analysis of astronomical light curves increasingly important. Light curves describe the brightness variations of celestial objects over time and are widely used for the classification of variable stars, supernovae, and other transient phenomena. Their analysis is challenging due to irregular sampling, missing observations, and measurement uncertainties. In this work, we investigate machine learning and deep learning approaches for light curve classification. We employ datasets containing diverse classes of variable stars and transient events. For each photometric filter, statistical descriptors are extracted to form compact and interpretable feature embeddings. These embeddings are used to train machine learning models of varying complexity, while recurrent and Transformer-based architectures are explored for directly modeling temporal dependencies in multi-band observations. The proposed framework is evaluated on the task of multi-class classification of stellar types and achieves strong performance. The results demonstrate that statistical feature embeddings provide an efficient and interpretable representation of light curves, while LSTM and Transformer architectures effectively capture their temporal dynamics.

17. Lorentz-Equivariant Geometric Algebra Transformers for Top-Quark Event Reconstruction

Abasov E.(1), Dudko L.(1), Iudin E.(1), Markina A.(1), Perfilov M.(1), Volkov P.(1), Vorotnikov G.(1), Zaborenko A.(1)
(1) Skobeltsyn Institute of Nuclear Physics, Lomonosov Moscow State University

We develop a Lorentz-equivariant Geometric Algebra Transformer (LGaTr) as the encoder of a foundational model for top-quark physics. Each hard event is encoded as a multivector in the spacetime algebra Cl(1,3): tokens carry their raw 4-momentum in the grade-1 channel and invariant flavour/charge tags in grade-0, while the geometric product makes higher grades — decay-plane bivectors, 3-body trivectors, and the CP-odd pseudoscalar — emerge automatically. Since every grade transforms under the same rotor sandwich, the encoder is Lorentz-equivariant by construction, so invariant masses and angles are exact rather than learned. By the Cayley–Menger collapse GA generates no new invariant scalars beyond the 1-bit CP-odd pseudoscalar sign; its value is structural — enforcing equivariance and reducing multi-step covariant constructions such as top-spin correlations to a single operation. On a `pp → tWb` proof-of-concept the GATr-lite encoder reaches higher AUC compared to a hand-crafted-feature baseline with ~3 times fewer parameters.

66. Высокопроизводительное хранилище по запросу для поддержки инференса – преимущество подхода дизаггрегированной компонуемой инфраструктуры.

А.А. Московский
группа компаний РСК.

Приглашенный доклад

27. Application of machine learning for the analysis of four-jet final states in the CEPC experiment

//A.Staritsyna(1), M.Chadeeva(2)
(1)Moscow Institute of Physics and Technology, (2) Lebedev Physical Institute of the Russian Academy of Sciences //

An important part of the CEPC program is the precision measurement of the Higgs boson parameters. The purpose of this work is to study the process e-e+ → ZH at sqrt(s) = 240 GeV with the four-jet final states from decays Z → qqbar and H → bbbar. The four-jet final states have a high branching ratio but suffer from jet-merging problem and large background contribution. The CEPC detector simulation and event reconstruction using Delphes package have been performed for signal and different background processes corresponding to 5.6 ab-1 of integrated luminosity. Several approaches to signal from background separation have been studied: a set of cuts on kinematic variables, a classifier trained on the mixed dataset before cuts and a classifier trained on the mixed dataset after cuts. The set of cuts was based on 14 kinematic variables, the input data of the model include 84 kinematic variables. For high signal efficiency, the cut-based approach provides rejection of leptonic and semileptonic backgrounds to the few percent level, but cannot effectively suppress the four-jet backgrounds. Using the PyTorch framework, a Transformer model was trained for binary classification on a dataset consisting of 1,000,000 signal events and 1,000,000 events from each background channel. The best signal/background separation was achieved by combining loose kinematic cuts with the trained Transformer model.

36. Towards Foundational Models for HEP: Learning Universal Top-Quark Event Representations

//Zaborenko A.(1), Abasov E.(1), Dudko L.(1), Markina A.(1), Perfilov M.(1), Volkov P.(1), Vorotnikov G.(1)
(1) SINP MSU //

We present a framework for universal foundational models for top-quark physics, following the pre-train-then-fine-tune paradigm established in computer vision and natural language processing. Modern HEP experiments produce datasets of unprecedented scale and diversity, yet conventional ML pipelines require task-specific engineering for every analysis. Our goal is to learn general event representations that transfer across downstream tasks without hand-crafted features. The pre-training corpus spans five process groups (0–4 top quarks), covering both Standard Model and BSM final states, totalling approximately 8 million events generated with MadGraph5 and CompHEP. We compare two encoder architectures with progressively stronger inductive biases. An MLP baseline flattens each event into a fixed 130-dimensional vector, with permutation structure and Lorentz symmetry learned entirely from data. A Detection Transformer (DETR) treats each event as an unordered set of particle tokens with type-wise permutation-equivariant attention and dedicated per-particle regression, classification, bipartite-assignment, and high-level reconstruction heads. Across all evaluated tasks, DETR substantially outperforms the MLP baseline, with the performance gap widening on complex topologies and rare decay classes. These results establish the transformer-based set encoder as a strong backbone for HEP foundational models and motivate future work on fine-tuning pre-trained representations for specific physics analyses.

49. Deep learning methods for gamma event selection in TAIGA-IACT image analysis in stereo-mode

Е.О. Гресь(1,2), А.П. Крюков(2), П.А. Волчугов(2), А.П. Демичев(2), Ю.Ю. Дубенская(2), Д.П. Журов(1,2), С.П. Поляков(2), Е.Б. Постников(2), А.Ю. Разумов(2)
(1) НИИПФ ИГУ, Иркутск, (2) НИИЯФ МГУ, Москва

In astrophysical complex TAIGA there are five Imaging Atmospheric Cherenkov telescopes (IACTs), located at a distance of 300-500 m from each other, due to which extensive air showers (EAS) generated by cosmic and gamma radiation are recorded with multiple telescopes (stere-mode). The analysis of EAS images from two or more Cherenkov telescopes together is of particular interest, since for this approach the reconstruction of physical parameters of the primary gamma ray is much more accurate than in mono-observations. However, the gamma-ray event reconstruction strongly depends on the quality of the hadron background suppression of the classifier, therefore, the gamma event selection stage is one of the important one in the further analysis of IACT data. In this work we propose and study several neural network methods for gamma event selection in joint TAIGA-IACT image analysis. The first neural network method we suggested involves a single-stage analysis of IACT images from two telescopes, while the second method considers the creation and study of image essential features extracted by autoencoders that contain the EAS physical parameters, including the type of primary particle.

50. Methodology for Processing Open Data for Machine Learning Models in BSM Searches

Volkov P(1), Abasov E.(1), Dudko L.(1), Markina A.(1), Vorotnikov G.(1), Perfilov M.(1), Zaborenko A.(1), Savkova N.(1), Gorin D.E.(2), Vasilevskii O.S.(2)
(1) SINP MSU, (2) Physical Faculty MSU

The open data published by CERN allow us to access a portion of the LHC dataset, but they come in raw form. To build ML models for beyond‑the‑Standard‑Model searches, one must apply a variety of processing techniques, such as jet energy corrections (JEC/JER), PDF reweighting, trigger selection, event weighting, and others. In this talk, we will demonstrate how these tools are used in a specific analysis with a Lepton+jets final state.

54. On neural inverse problem of celestial mechanics

Shorokhov Sergey
RUDN University named after Patrice Lumumba Moscow

The inverse problem of celestial mechanics seeks to determine the potential forces that constrain a particle to move along trajectories belonging to a prescribed family of curves. Since paths within the same family may be traversed at different speeds and energy levels, the classical formulation requires ensuring the consistency of an overdetermined system of first-order partial differential equations for two unknowns: the potential function and the energy function (the latter being a first integral along each trajectory). We propose a data-driven analog of this problem by constructing and training second-order Neural Ordinary Differential Equations (Neural ODEs) directly from discrete trajectory observations. The neural network maps 3D spatial coordinates to a scalar potential value. Given an initial position and velocity, along with sparse spatiotemporal measurements of position and velocity, our objective is to learn a potential field such that a particle released from the initial state reproduces the observed trajectory with high fidelity. The problem is solved within the Neural ODE framework by deriving an extended adjoint system and integrating it backward in time. We present a stable and efficient training algorithm for the potential network, demonstrating how modern differentiable programming can bridge classical mechanics and machine learning for dynamical system identification.

52. Extraction of the light component of cosmic rays from TAIGA-IACT data

//П.А. Волчугов(1), А.П. Крюков(1), Е.О. Гресь(1,2), А.П. Демичев(1), Ю.Ю. Дубенская(1), Д.П. Журов(1,2), С.П. Поляков(1), Е.Б. Постников(1), А.Ю. Разумов(1)
(1) НИИЯФ МГУ, Москва, (2) НИИПФ ИГУ, Иркутск //

The TAIGA astrophysical complex [1], located in the Tunka Valley (Republic of Buryatia), covers an area of about 1.1 km2 and is designed to study gamma rays with energies above 3-4 TeV and cosmic rays with energies above 200 TeV. The complex includes the TAIGA-IACT facility, which currently consists of four atmospheric Cherenkov telescopes. TAIGA-IACT makes it possible to distinguish extensive air showers (EAS) induced by primary gamma rays from hadronic EAS with high accuracy, and therefore TAIGA-IACT data are used to solve gamma-ray astronomy problems. In this work, we propose a technique for using TAIGA-IACT data to extract the light component of cosmic rays.
1. L. A. Kuzmichev et al. Cosmic ray study at the astrophysical complex taiga: Results and plans. Physics of Atomic Nuclei. — 2021 — Vol. 84, no. 6 — P. 966–974.

7. Neural Network-Based Shower Core Reconstruction in the HiSCORE Experiment using Autoencoder-Derived Essential Features

//Ю.Ю. Дубенская(1), А.П. Крюков(1), П.А. Волчугов(1), Е.О. Гресь(1,2), А.П. Демичев(1), Д.П. Журов(1,2), С.П. Поляков(1), Е.Б. Постников(1), А.Ю. Разумов(1)
(1) НИИЯФ МГУ, Москва, (2) НИИПФ ИГУ, Иркутск //

The goal of this work is to investigate how to prepare and process HiSCORE data so as to minimize losses, while also bringing it into a form suitable for combining with data from other detector types of the TAIGA project. Our main idea is to use a neural network that reconstructs the core position using essential features (i.e., a compact representation of the data) as input. The essential features are not related to the dimensions and format of the data, so the essential features for different types of detectors can be easily combined. As essential features we propose to use a latent space of an autoencoder (AE) trained on HiSCORE data. We demonstrate that the latent space of a properly trained autoencoder can be used to reconstruct the core position of the EAS, and that adding additional pre-computed angle parameters to the input further improves the reconstruction accuracy. The standard deviation of the core position reconstruction error for the proposed method is 15-17 meters, which is comparable with the results obtained by other methods.

1. Invertible Neural Networks and the Possibility of their Applications to Cosmic Rays Data Analysis

//А.П. Крюков(1), П.А. Волчугов(1), Е.О. Гресь(1,2), А.П. Демичев(1), Ю.Ю. Дубенская(1), Д.П. Журов(1,2), С.П. Поляков(1), Е.Б. Постников(1), А.Ю. Разумов(1)
(1) НИИЯФ МГУ, Москва, (2) НИИПФ ИГУ, Иркутск //

14

Section 2. Machine Learning in Natural Sciences

13. Метод автоматизированной обработки научных публикаций

//Хвостова Мария Олеговна (1), Артамонов Алексей Анатольевич(2), Пряхина Дарья Игоревна (3)
(1) Филиал МГУ в г. Дубне, (2) НИЯУ МИФИ, (3) ОИЯИ //

В работе представлен метод автоматизированной обработки полнотекстовых научных публикаций в формате PDF, который выполняет анализ основных типов информации – таблиц, рисунков, текста. Таблицы и рисунки структурируются на уровне распознавания названий, извлечения и упорядочивания ячеек. Это позволяет преобразовать визуальные объекты в машиночитаемый формат для дальнейшего индексирования, а сохранение соответствующих областей документа-источника в виде изображений даёт возможность верифицировать извлечённые данные вручную. Текст анализируется по предикатно-аргументной модели. Аргументы, выполняют разные синтаксические роли – агент, пациенс, локатив, темпоратив, мера, инструмент, результат – и несут информацию об объектах, задействованных в эксперименте (материалы, оборудование и установки), условиях проведения эксперимента (место, время, параметры), количественных характеристиках. Предикаты устанавливают отношения между аргументами, благодаря чему формируются структурированные смысловые единицы, которые извлекаются с помощью набора продукционных правил. Атрибутирование источника осуществляется при извлечении ключевых библиографических сведений – авторов, названия, журнала и даты публикации, цифрового идентификатора . В работе используются: свёрточные нейронные сети для оптического распознавания символов и детекции структурных объектов на страницах публикации; трансформерная модель для морфо-синтаксического анализа предложений; языковая модель для извлечения неграмматичных данных; лингвистический анализатор на основе продукционных правил для грамматичных данных. Метод устойчив к вариативности визуально-структурной модели документа, не требует обучения на размеченных данных и может быть адаптирован для различных научно-технических дисциплин. Результаты ориентированы на практическое применение в наукометрических системах и базах знаний.

28. A Physics-Informed Neural Network Framework for Multidimensional Heat Transfer: Benchmarking Against Finite-Difference Methods

Kiura Stanley Matheka (1), Dmitry Efremenko (2), Fedor Buzaev (2)
(1) Russian State Social University, (2) HSE Univeversity

We develop a physics-informed neural network framework for steady-state and transient heat transfer in one, two, and three dimensions. The governing heat equation, together with initial and boundary conditions, is embedded directly into the loss function, allowing the network to learn a continuous temperature field without labeled data. Spatial and temporal derivatives are evaluated by automatic differentiation at collocation points, resulting in a mesh-free residual-minimization method. The novelty of the framework lies in the combination of several design choices that improve the stability and accuracy of PINN training for heat-transfer problems. Smooth activation functions are used to preserve the differentiability required for evaluating the PDE residual, while adaptive loss weighting balances the contributions of the governing equation, boundary conditions, and initial conditions. Dirichlet and Neumann boundary conditions are imposed through analytic mask functions, which reduces the need for the network to learn boundary constraints from penalty terms. Training is performed in two stages, with Adam used for the initial optimization and L-BFGS applied for final refinement. For transient problems, a warm-start strategy is introduced, where the steady-state solution is used to initialize the time-dependent model, improving convergence and temporal accuracy. The proposed method is benchmarked against finite-difference reference solutions. We analyze accuracy, convergence behavior, and residual distributions near steep thermal gradients and material interfaces. The results provide practical design guidelines for applying PINNs to multidimensional heat transfer problems. Keywords: physics-informed neural networks; heat transfer; automatic differentiation; residual minimization; finite-difference benchmarks; mesh-free methods; scientific machine learning.

22. An Approach to Solving Model Problems in Computational Fluid Dynamics Based on Physics-Informed Neural Networks

Tarasov Alexey Alexandrovich(1), Stepanova Margarita Mikhailovna(1)
(1) Saint-Petersburg State University

In recent years, physics-informed neural networks (PINNs) have frequently been used to solve applied problems in mathematical physics. PINNs incorporate governing equations and boundary conditions directly into the model architecture through automatic differentiation. This approach makes it possible to solve differential equations without constructing a computational mesh, which is particularly advantageous for multidimensional and time-dependent problems. In the long term, the considered method may be used to model nonlinear heat-transfer processes in cooling systems with complex geometries, including problems related to nuclear power engineering. The main difficulties in using PINNs are associated with the nature of the method itself: direct minimization of the equation residual does not guarantee convergence to a physically meaningful result. Training may proceed slowly or even converge to a trivial solution. To prevent this, special techniques are employed; however, their effectiveness strongly depends on the particular problem being solved, while the practice of applying such techniques remains poorly systematized. This work examines and implements four such techniques — Fourier feature embedding, causal training, loss balancing, and time window marching — on problems of increasing complexity. The numerical experiments show that the contributions of individual training techniques are highly nonuniform. The most stable improvement is primarily obtained with Fourier feature embedding, whereas the influence of the other techniques is weak or even negative and requires careful interpretation. Particular attention is given to unsteady regimes of fluid flow in a channel, including the case with an obstacle, where PINNs exhibit limitations that cannot be reduced merely to hyperparameter tuning. The neural network solution is validated by comparison with a reference solution obtained using the finite element method.

34. Detection of Slowly Developing Anomalies in Engineering Systems Based on Matrix Profile Family Algorithms and Multivariate Time Series Representation Methods

Kalita Anastasiia(1), Sergey Ivanov(1), Vasilii Borisov(1), Alexander Manzhurov(1), Stanislav Polyakov(1), Mikhail Ronkin(1), Mikhail Hushchyn(2), Pavel Solovev(3)
(1) Ural Federal University, (2) HSE University, (3) DATARK LLC

This paper investigates the applicability of Matrix Profile family algorithms for detecting slowly developing anomalies in multivariate telemetry time series of engineering systems. Six representation schemes are considered – per-channel aggregation, principal component analysis (PCA), uniform manifold approximation and projection (UMAP), an autoencoder (AE), a variational autoencoder (VAE), and contrastive learning – in combination with the scalable time-series ordered-search Matrix Profile (STOMP) and discords aware Matrix Profile (DAMP) algorithms, as well as the native multivariate extensions mSTOMP (multivariate STOMP) and Multidim MP (Multidimensional Matrix Profile). The experiments were carried out on real telemetry data from a data center heating, ventilation, and air conditioning (HVAC) system (60 channels, 30-minute sampling interval). The primary metric is the anomaly Detection Delay under the constraint of false positive rate (FPR) ≤ 0.05. The best result among the semi-supervised methods was achieved by STOMP | Contrastive with feature selection: Matthews Correlation Coefficient (MCC) = 0.27 ± 0.10, Detection Delay = 4960 ± 134 min – a result comparable to supervised methods (support vector classifier: MCC = 0.29 ± 0.03). Five of the eight deep-learning methods failed to detect a single anomalous segment in the semi-supervised setting.

33. Large Language Models for Tabular Data Classification

//Timofei Izhitskii(1), Kim Minjung(1), Egor Filatov(1), Denis Matveev(1), Andrey Larionov(1), Mikhail Hushchyn(1)
(1) HSE University //

Large Language Models (LLMs) are increasingly being explored as tools for processing structured data. This study evaluates the effectiveness of LLMs in tabular data classification and compares their performance with classical machine learning algorithms and specialized neural architectures. Experiments were conducted on benchmark datasets containing numerical and categorical features. The robustness of the models was assessed under varying levels of missing data, and the possibility of training a single language model on multiple datasets simultaneously was investigated. The results demonstrate that LLMs achieve competitive classification performance on tabular data, although classical machine learning methods generally provide higher accuracy. All evaluated models exhibit a gradual decline in performance as the proportion of missing values increases. At the same time, language models offer an important advantage over traditional machine learning approaches: a single model can be trained on several datasets and subsequently applied to multiple classification tasks. The findings confirm the potential of LLMs for structured data analysis and highlight their ability to serve as a unified framework for solving diverse classification problems. Although classical machine learning methods remain highly effective for individual tabular tasks, language models open new opportunities for developing versatile multi-task classification systems.

32. Сверточные нейронные сети для разработки углеродных наносенсоров с управляемыми люминесцентными свойствами

Г. Н. Чугреева(1), К. А. Лаптинский(2), Т. А. Доленко(1)
(1) МГУ имени М.В.Ломоносова, Физический факультет, (2) МГУ имени М.В.Ломоносова, НИИЯФ им.Д.В.Скобельцына

Углеродные точки (УТ) - наночастицы с интенсивной и чувствительной к окружению фотолюминесценцией (ФЛ), обладающие широкими перспективами для использованbя в качестве оптических наносенсоров параметров среды. Большинство работ определяют ФЛ по заранее известным условиям синтеза наночастиц, однако задача по поиску параметров синтеза, при которых отклик ФЛ наиболее чувствителен к определенному параметру среды, остаётся нерешённой. В данной работе было показано, что нейронные сети позволяют подобрать параметры синтеза УТ (соотношение этилендиамина и лимонной кислоты (CA:ED), температура и время) для максимальной чувствительности к ионам Cu²⁺ и Co²⁺. Было синтезировано 74 образца УТ при варьировании параметров синтеза; для каждого приготовлено 100 растворов с линейно меняющейся концентрацией металлов (всего 7400 спектров). Двумерная свёрточная сеть определяла концентрацию исследуемых ионов по матрицам возбуждения-испускания ФЛ, после чего полносвязная сеть по полученным средним абсолютным ошибкам находила оптимальные условия синтеза наночастиц. Было выяснено, что наиболее чувствительные УТ для Cu²⁺ получаются при соотношении CA:ED = 1:2, 120 °C, 240 мин; для Co²⁺ - при том же соотношении, 160 °C, 240 мин. Таким образом, сочетание свёрточной и полносвязной архитектур нейронных сетей успешно решает задачу «синтез–свойства–применение» для создании углеродных наносенсоров с высокой чувствительностью к катионам в водных растворах. Исследование выполнено за счёт гранта Российского научного фонда № 22-12-00138-П, https://rscf.ru/project/22-12-00138/. Работа Г.Н. Чугреевой поддержана Фондом развития теоретической физики и математики «БАЗИС» (договор № 23-2-2-22-1)

35. OPTIMIZATION OF THE OPERATING TEMPERATURE RANGE OF SEMICONDUCTOR GAS SENSORS USING MACHINE LEARNING

Isaev I.V.(1,2,3), Chernov K.N.(4), Dolenko S.A.(1), Krivetskiy V.V.(2, 5)
(1) D.V. Skobeltsyn Institute of Nuclear Physics, M.V. Lomonosov Moscow State University, (2) Scientific-Manufacturing Complex Technological Centre, (3) MIREA – Russian Technological University, (4) Physics Department, M.V. Lomonosov Moscow State University, (5) Chemistry Department, M.V. Lomonosov Moscow State University

This study addresses the application of semiconductor gas sensors to detect gases and volatile organic compounds for air quality monitoring in urban and industrial areas. To provide selectivity in detecting specific gases, as well as high temporal resolution of the sensors, nonlinear temperature operating conditions, the so-called heating dynamics, were used. Since the physicochemical models describing the interaction processes between gases and sensors are highly complex, machine learning methods based on experimental data were used to process the sensor responses. Heating dynamics utilizing a wide temperature range provide more information, which may increase sensitivity and selectivity of the sensor. However, using wide temperature ranges leads to increased measurement time, increased power consumption, and accelerated sensor degradation. Also, the corresponding increase in the number of measurement instances and therefore of the feature space dimensionality leads to a decrease in the stability and performance of machine learning models. Therefore, for each target gas there is a need to determine not only the optimal heating dynamics, but also the optimal temperature range covered by this dynamics. This study was devoted to finding optimal operating temperature ranges based on the performance of machine learning models, ensuring maximum selectivity and response rate for target gases. The study was carried out at the expense of the grant No. 22-19-00703-P from the Russian Science Foundation, https://rscf.ru/en/project/22-19-00703/.

38. Сравнение глубоких архитектур при различных уровнях предобработки фМРТ в задаче бинарной классификации мозговой активности

А.С.Макаров(1), И.М.Гаджиев(1), С.А.Доленко(2)
(1) Физический факультет МГУ имени М.В.Ломоносова, (2) НИИЯФ МГУ

Данная работа является продолжением предыдущих исследований, в которых рассматривалась возможность решения задачи распознавания когнитивных состояний на основе анализа нейрофизиологических данных о мозговой активности с помощью классических методов машинного обучения. В настоящей работе в качестве методов машинного обучения рассматриваются глубокие нейросетевые архитектуры. В отличие от предыдущего исследования, где решались также задачи многоклассовой классификации (различение состояний мозга при решении 6 типов задач и состояния покоя), в настоящей работе мы фокусируемся только на более простой задаче бинарной классификации: определение того, решает ли испытуемый в данный момент когнитивную задачу или находится в состоянии покоя. Для получения нейрофизиологических данных использовался метод функциональной магнитно-резонансной томографии (фМРТ). Были рассмотрены три глубокие архитектуры: CNNLSTM (3D свёрточная нейронная сеть (СНС) с двунаправленной рекуррентной обработкой типа LSTM), CNNSimpleAttention (3D СНС с механизмом временного внимания) и CNNAttentionZ (оригинальная архитектура с перестановкой осей координат, позволяющая 3D СНС обрабатывать время как дополнительное пространственное измерение). Все модели обучались на коротких временных окнах из 5 разностных 3D изображений (2,5 секунды) с разделением по испытуемым (23 на обучение, 8 на тестирование). Поскольку выбор уровня предобработки данных фМРТ может существенно влиять на качество классификации, были рассмотрены три варианта: сырые данные (Raw), данные только с коррекцией артефактов движения (MC) и данные, прошедшие полный стандартный конвейер предобработки (Full), включающий восемь этапов обработки. По результатам работы были сделаны выводы о выборе архитектуры и целесообразности использования полного цикла предобработки или её отсутствия. Показано, что в ряде случаев минимальная предобработка может быть достаточной, тогда как применение полного конвейера не всегда даёт преимущество. Полученные результаты определяют направления для дальнейшего поиска оптимальных сочетаний архитектур и схем предобработки при решении задач классификации типов мозговой активности.

41. Transfer Learning via Source-Target Domain Data Mixing for Solving Multi-Parameter Inverse Problems in Optical Spectroscopy

Mardanov A.P.(1), Guskov A.A. (1, 2), Isaev I.V. (2), Laptinskiy K.A. (2), Dolenko T.A. (1, 2), Dolenko S.A. (2)
(1) Faculty of Physics, M.V. Lomonosov Moscow State University, Moscow, Russia (2) D.V. Skobeltsyn Institute of Nuclear Physics, M.V. Lomonosov Moscow State University, Moscow, Russia

This study is devoted to solving a multi-parameter inverse problem of photoluminescence spectroscopy – determining the concentrations of metal ions in multicomponent aqueous solutions from the excitation-emission matrices of carbon dots introduced into the solution. The problem is complicated by the lack of an adequate physico-mathematical model capable of explicitly describing the relationship between the photoluminescence spectra of carbon dots and the concentrations of individual components in a multicomponent solution. In this regard, we use machine learning methods based on experimental data to solve this complex inverse problem of optical spectroscopy. In practical applications, the analysis of the composition of various liquid media may sometimes require expanding the set of monitored components, for example, upon the appearance of a new substance not present in the initial system. In such cases, building machine learning models from scratch to determine the concentrations of components of the expanded system requires collecting a new representative experimental dataset, which entails significant time and financial costs. In this context, transfer learning methods are of particular interest, as they allow using the knowledge accumulated during solving the source problem to build models for the new target task. In this study, transfer learning techniques are applied to transition from a 6-parameter photoluminescence spectroscopy inverse problem (source domain, 6P) to a 7-parameter problem (target domain, 7P), which differs by the inclusion of an additional component in the analyzed system. The goal of this study is to investigate the effectiveness of different transfer learning strategies for the transition from the source 6-parameter to the target 7-parameter photoluminescence spectroscopy problem. Particular attention is paid to different source-target domain data mixing strategies implemented at different stages of neural network training. The investigated strategies include adaptation of a model pretrained on the source domain to the target domain (6P → 7P, fine-tuning), multi-stage transfer learning involving intermediate training of the model on the combined source and target domain dataset (6P → 6P+7P → 7P), dynamic adjustment of the contribution of source and target domain data during model optimization (dynamic source-target weighting), and target-aware early stopping, where target domain data is used to select the optimal state of the model at intermediate stages of training. The effectiveness of the proposed strategies is evaluated based on their performance on an independent test set of the target 7-parameter task. This study has been conducted at the expense of the Russian Science Foundation grant No. 24-11-00266; https://rscf.ru/en/project/24-11-00266/.

42. One Model for All Columns: Unified Learning for Tabular Data

//Kirill Katsuba(1), Roman Degtiarev(1), Nikita Sevriukov(1), Mikhail Hushchyn(1)
(1) HSE University //

Modeling tabular datasets requires learning the statistical dependencies between variables rather than solving a single prediction task. A model that captures the joint structure of the data can be used for a variety of applications, including missing-value imputation, conditional generation, and prediction of arbitrary features from the remaining columns. For example, such a model should be able to predict any column of a dataset given the values of all other columns. Conventional machine-learning approaches are typically target-specific. Methods such as gradient boosting require training a separate model for each prediction target, making them less flexible when multiple prediction tasks must be solved within the same dataset. In this work, we investigate whether a single model can learn all dependencies present in tabular data and be reused to predict arbitrary columns without additional training. We study neural and generative approaches for modeling conditional feature distributions and propose architectural modifications aimed at improving the reconstruction of heterogeneous tabular data. Experiments on five benchmark datasets demonstrate that the proposed approach effectively captures inter-feature relationships and achieves strong performance in conditional prediction tasks. The results highlight the potential of unified tabular modeling as a flexible alternative to target-specific machine-learning pipelines.

43. Преодоление доменного сдвига в спектрах КР природных вод: подход на основе доменной адаптации без учителя для многокомпонентной регрессии

Л.С.Утегенова(1), С.А.Буриков(1), А.М.Вервальд(1,2), И.В.Пластинин(1,2), Т.А.Доленко(1,2), С.А.Доленко(2)
(1) Московский государственный университет имени М.В.Ломоносова, физический факультет, (2) Научно-исследовательский институт ядерной физики имени Д.В.Скобельцына Московского государственного университета имени М. В.Ломоносова

Антропогенное загрязнение природных вод ионами металлов остаётся одной из актуальных проблем экологического мониторинга. Высокая токсичность этих соединений и их способность к накоплению в экосистемах требуют создания систем оперативного контроля, способных заменить трудоёмкие лабораторные методики, не приспособленные для массового экспресс-анализа. В этом контексте спектроскопия комбинационного рассеяния (КР) света рассматривается как перспективная основа для дистанционного и неразрушающего анализа. Однако её применение сдерживается сложностью интерпретации многокомпонентных спектров. Поэтому актуальным направлением развития спектроскопических методов анализа водных сред становится интеграция этих методов с технологиями машинного обучения. Ключевым препятствием на пути внедрения нейросетевых моделей в практику экологического мониторинга является дефицит размеченных спектральных данных для реальных водных объектов. Формирование такой базы требует параллельного проведения дорогостоящего химического анализа каждого образца, что для природных вод с их пространственно-временной изменчивостью становится практически нереализуемым. В результате модели, обученные на лабораторных растворах с известными концентрациями, при переносе на реальные пробы демонстрируют резкое ухудшение точности из-за различий в распределениях данных, - так называемого доменного сдвига, обусловленного присутствием органических примесей, флуоресцентного фона и вариациями физико-химических условий среды. В работе исследуется перенос знаний с базы КР-спектров модельных растворов (исходный домен) на неразмеченные спектры природной воды Москвы-реки (целевой домен). Для преодоления указанного ограничения в настоящей работе исследуется применение доменной адаптации без учителя (Unsupervised Domain Adaptation) — подхода, позволяющего перенести знания, полученные на размеченном исходном домене, на неразмеченный целевой домен. Для адаптации в сеть вводится доменный дискриминатор, соединённый с экстрактором признаков через слой обратного градиента. В ходе состязательного обучения экстрактор учится формировать признаки, неразличимые для дискриминатора, что позволяет переносить знания на целевой домен без использования меток. Таким образом, сотни неразмеченных целевых спектров задействованы только для уменьшения доменного сдвига, тогда как прямое определение концентраций осуществляется без их участия; успешность этой процедуры подтверждена визуализацией в виде t‑SNE-проекций, показывающих значительное перекрытие распределений. Применение разработанного подхода позволило достичь снижения средней абсолютной ошибки определения концентраций ионов в растворе по сравнению с неадаптированной моделью на 70%. Полученные результаты подтверждают, что доменная адаптация без учителя является эффективным инструментом для преодоления нехватки размеченных данных в задачах спектроскопического анализа природных вод. Разработанная методология может быть распространена на другие типы водных объектов и другие спектроскопические задачи. Исследование выполнено за счёт гранта Российского научного фонда № 24-11-00266, https://rscf.ru/en/project/24-11-00266/. Работа Л.Утегеновой поддержана фондом теоретической физики и математики «Базис» (Договор № 25-2-1-136-1).

46. Использование алгоритмов машинного обучения для доменной адаптации при переходе между данными космических аппаратов

Пятикоп А.В.(1), Гаджиев И.М. (1), Баринов О.Г. (2), Доленко С.А. (2)
(1) Физический факультет, Московский государственный университет имени М.В. Ломоносова (МГУ), (2) Научно-исследовательский институт ядерной физики имени Д.В. Скобельцына МГУ (НИИЯФ МГУ)

Геомагнитные возмущения являются одним из важнейших факторов космической погоды: они способны нарушать работу радиосвязи, систем навигации, линий электропередач, что делает задачу их прогнозирования практически значимой. Прогнозирование геомагнитного индекса Dst основывается на параметрах солнечного ветра (СВ) и межпланетного магнитного поля (ММП), измеряемых космическими аппаратами (КА) ACE и DSCOVR в точке Лагранжа L1 между Землёй и Солнцем. Однако при прямом переносе обучения с данных одного КА на данные другого возникает проблема ухудшения качества прогнозирования из-за различий в конструктивных особенностях аппаратов и статистических свойствах измерений, а также их положения в пространстве. Для решения этой проблемы в работе исследуются методы доменной адаптации, направленные на построение отображения данных из исходного домена в целевой, позволяющие обеспечить совместное использование данных из доменов обоих КА. В качестве входных данных используются компоненты ММП, параметры СВ (скорость, плотность, температура) и значения индекса Dst, представленные в виде отрезков многомерного временного ряда с различными задержками для отражения предыстории каждого из параметров. Для прогнозирования применяются линейные модели с регуляризацией, градиентный бустинг (в реализации LightGBM) и полносвязные нейронные сети. В качестве методов доменной адаптации рассматриваются линейные модели, градиентный бустинг, метод проекции на латентные структуры, фильтр Калмана и доменно-состязательные нейронные сети. Исследуются два практически значимых сценария доменной адаптации: DSCOVR→ACE, позволяющий использовать накопленный исторический архив данных ACE за счёт адаптации в домен ACE более новых измерений DSCOVR, и ACE→DSCOVR – для обеспечения непрерывности прогноза при возможном отказе КА ACE. Проводится сравнительный анализ эффективности рассматриваемых методов адаптации между собой и с базовыми моделями, обученными без адаптации, с целью выбора оптимального подхода для каждого из сценариев.

47. Нейросетевое определение концентраций ионов в природной воде с использованием стратегий параметрически эффективной адаптации

М.К.Шалеев (1), И.В.Исаев (2), С.А.Доленко (2)
(1) Физический факультет Московского государственного университета имени М.В. Ломоносова, Москва, (2) Научно-исследовательский институт ядерной физики имени Д.В.Скобельцына Московского государственного университета имени М.В. Ломоносова, Москва

Оперативное определение состава природных вод является важной задачей экологического мониторинга. Особый интерес представляет оценка концентраций ионных компонентов водных растворов, включая катионы Zn²⁺, Cu²⁺, Li⁺, Fe³⁺, Ni²⁺, NH₄⁺ и анионы SO₄²⁻, NO₃⁻. Традиционные химико-аналитические методы позволяют получать точные результаты, однако требуют пробоподготовки, расхода реагентов и значительного времени проведения анализа. Поэтому в качестве перспективной альтернативы рассматриваются методы оптической спектроскопии, позволяющие проводить экспресс-диагностику многокомпонентных водных растворов. В данной работе рассматривается обратная задача спектроскопического определения концентраций ионов в природной воде по данным спектроскопии комбинационного рассеяния света (КР), инфракрасного поглощения (ИК) и оптического поглощения (ОП). Используется комплексирование трёх физических методов: спектральные данные КР, ИК и ОП одновременно подаются на вход нейросетевой модели. Сложность рассматриваемой задачи связана с её высокой размерностью, нелинейным влиянием компонентов раствора на форму спектров и чувствительностью спектральных данных к примесям, содержащимся в природной воде. Примеси, в том числе органического происхождения, специфичны для каждого источника, весьма вариабельны и могут существенно изменять спектры. Основная цель работы состоит в адаптации нейросетевых моделей к данным природной воды при ограниченном числе обучающих примеров. Для этого используется перенос обучения: модель предварительно обучается на базовом наборе спектров растворов в дистиллированной воде, а затем адаптируется к целевым данным, соответствующим природной воде. Для каждого определяемого иона строится отдельная нейросетевая модель регрессии с одним выходом. В работе исследуются несколько стратегий адаптации. В качестве базового варианта рассматривается полное дообучение модели на целевом наборе. Наряду с ним применяются параметрически эффективные подходы: заморозка части слоёв нейронной сети, BitFit и LoRA. Эти методы позволяют изменять не все параметры нейронной сети, а только их небольшую часть, или вводить компактную поправку к уже обученной модели. Результаты показывают, что применение переноса обучения понижает погрешность определения концентраций ионов в природной воде по сравнению с обучением модели только на ограниченном целевом наборе. Параметрически эффективные стратегии дополнительно снижают требования к объёму экспериментальных данных и позволяют адаптировать модель к новым условиям без полного переобучения. Таким образом, сочетание комплексирования методов оптической спектроскопии и стратегий переноса обучения является перспективным направлением для решения обратных задач анализа природных вод. Исследование выполнено за счёт гранта Российского научного фонда № 24-11-00266, https://rscf.ru/en/project/24-11-00266/.

56. Оптимизация генераторов ветропарка

Греков П.П., Рамазян Т.А., Фомичева Н.В.
НИУ ВШЭ

Оптимальная расстановка генераторов ветропарка - задача многомерной невыпуклой оптимизации черного ящика, в которой годовая выработка энергии является целевой функцией, и зависит от вычислительно затратной симуляции аэродинамических следов турбин. В настоящей работе исследуются четыре метода оптимизации нулевого порядка совместного выбора координат турбин и углов рыскания на площадке с неоднородным набегающим потоком при ограничении на минимальное расстояние. Предложенный метод суррогатной оптимизации WU-GO, который учитывает случайность эксперимента напрямую, стабильно выдает наибольший прирост, особенно с ростом количества турбин.

57. Оптимизация для инженерного проектирования

Фомичева Н.В., Чжоу Ц., Рамазян Т.А.
НИУ ВШЭ

Инженерное проектирование представляет собой задачу оптимизации с ограничениями черного ящика - каждая возможная конфигурация должна удовлетворять жёстким физическим ограничениям, а испытание конфигураций обходится дорого. Мы сравниваем взаимодействие машинного обучения и механического проектирования на основе четырех классических задач оптимизации с ограничениями: проектированию редуктора, пружины растяжения-сжатия, сосуда под давлением, и сварной балки. В рамках единого подхода, основанном на Байесовской оптимизации, мы сравниваем несколько стратегий выбора кандидатов, и предлагаем схему на основе динамики Ланжевена. Оценивая все четыре задачи по общим показателям — сходимости наилучшей допустимой стоимости, доле допустимых решений и времени до первого допустимого решения, — мы показываем, как алгоритмы семплирования позволяют получать лучшие решения по сравнению со стандартными подходами поиска оптимума.

45. Вычисление моментов как способ параметризации 2D-карт фотолюминесценции при решении обратной задачи спектроскопии

Г.А.Куприянов(1, 2), И.В.Исаев(2), К.А.Лаптинский(1, 2), С.А.Доленко(1, 2), Т.А.Доленко(1)
(1) Физический факультет Московского государственного университета имени М.В. Ломоносова, (2) Научно-исследовательский институт ядерной физики им. Д.В.Скобельцына

Фотолюминесцентные наносенсоры на основе углеродных точек являются перспективным инструментом для экспрессного контроля ионного состава водных сред. При решении многопараметрической обратной задачи спектроскопии важную роль играет представление 2D-карт фотолюминесценции (ФЛ), поскольку исходная матрица возбуждения–испускания содержит большое число взаимно коррелированных признаков. В связи с этим актуальна разработка способов параметризации, позволяющих уменьшить размерность входного пространства без потери информации, существенной для определения концентраций ионов. В работе исследовано представление 2D-карт фотолюминесценции через разложение по базисам Тейлора, Лежандра, Фурье и Цернике. Такой подход можно рассматривать как вычисление набора моментов карты ФЛ, причем число признаков регулируется максимальным порядком разложения. Использование моментов Цернике ранее показало эффективность при анализе 2D-карт ФЛ природных вод и биологических объектов [1, 2]; в настоящей работе проведено сравнение нескольких функциональных базисов применительно к обратной задаче определения концентраций ионов металлов. Исследование выполнено на 7-параметрической базе данных, содержащей 7813 2D-карт ФЛ водных растворов углеродных точек с катионами Ni2+, Cu2+, Co2+, Pb2+, Al3+, Cr3+ и анионом NO3-. Полученные коэффициенты разложения использовались как входные признаки для моделей машинного обучения: линейной регрессии, случайного леса, градиентного бустинга, многослойного персептрона; в качестве референсного подхода рассматривалась сверточная нейронная сеть, обучаемая непосредственно на 2D-картах ФЛ. Показано, что параметризация 2D-карт ФЛ на основе функциональных базисов позволяет достигать точности на уровне сверточной нейронной сети при использовании существенно меньшего числа входных признаков: достаточно 78 коэффициентов разложения и простой модели линейной регрессии. Наиболее эффективными оказались базисы Лежандра и Фурье, что связано с их ортогональностью и определением на всей области 2D-карты ФЛ. Таким образом, разложение 2D-карт фотолюминесценции по функциональным базисам является эффективным способом сжатия и структурирования спектральной информации при решении обратных задач фотолюминесцентной спектроскопии. Предложенный подход особенно полезен для задач с ограниченным объемом экспериментальных данных, поскольку позволяет уменьшить размерность входного пространства, сохранить физически значимые особенности формы 2D-карт ФЛ и повысить устойчивость моделей машинного обучения. Исследование выполнено за счет гранта Российского научного фонда № 22-12-00138-П, https://rscf.ru/project/22-12-00138/. Г.А.Куприянов является стипендиатом Фонда развития теоретической физики и математики «Базис». [1] Wang X., Li B.Q., Zhai H.L. et al. An efficient approach to the quantitative analysis of humic acid in natural water using Zernike moments // Food Chemistry. 2016. V. 190. P. 1033–1039. https://doi.org/10.1016/j.foodchem.2015.06.074 [2] Chen Y., Chen T., Duan W. et al. Rapid measurement of brown tide algae using Zernike moments and ensemble learning based on excitation-emission matrix fluorescence // Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy. 2023. V. 294. Art. 122547. https://doi.org/10.1016/j.saa.2023.122547

4. Топовые вычислительные системы для суперкомпьютинга и ИИ

//Валерий Егоршев, КОГНИТАР //

В обсуждении мы пройдем сверху вниз по трем этажам храма промышленных вычислений:
3. Сравним суперкомпьютеры из списка ТОП-500 с планом и прогрессом строительства ИИ-дата-центров в мире, обсудим производительность в гигаваттах и парадокс Джевонса.
2. Этажом ниже посмотрим аппаратную магию - вычислительную начинку ИИ-дата-центров.
1. Внизу увидим создание источников магии - фаундри-строительство в Западном полушарии.

15

Section 3. Machine Learning for Environmental Sciences

2. Convolutional Neural Networks and Bayesian Classification for Risk Assessment of High-Latitude Critical Infrastructure

//Vorobev A.V.(1), Lapin A.N.(2), Vorobeva G.R.(1)
(1) Geophysical Center of the Russian Academy of Sciences, (2) Ufa University of Science and Technology //

Geomagnetic activity at high latitudes poses significant risks to the operation of certain technological systems within the critical infrastructure of the Russian Arctic zone. In the absence or scarcity of instrumental data in hard-to-reach regions, the use of participants#list_of_paticipantsnatural indicators of space weather, including polar aurorae, becomes highly relevant. In this work, we develop an approach based on convolutional neural networks and a Bayesian classifier to estimate the posterior probabilities of extreme geomagnetically induced currents and additional inclinometric measurement errors arising from the highly variable magnetic environment of the Arctic region. Using an analysis of synchronous auroral observations over 7 years (2015–2021), training datasets are constructed and corresponding statistical relationships, consistent with the physical mechanisms of the processes under study, are established. Application of ensemble models to the acquired datasets reveals that the highest risks of extreme measurement errors in azimuthal and zenith angles in the Arctic region occur during the observation of bright, well-defined aurorae in the southern part of the sky (4.9% and 14.7%, respectively). The results of our research robustly demonstrate the feasibility of using visual observations of aurorae in the optical range as a natural indicator of geomagnetic conditions to support decision-making in the operation of technological facilities in high-latitude infrastructure.

16. Автоматизация обнаружения нерки в данных визуальной оптической съемки с борта БПЛА в районе озера Начикинское (Камчатка) с применением искусственных нейронных сетей

//Шаталов Дмитрий Дмитриевич (1), Криницкий Михаил Алексеевич (1,2), Шубкин Сергей Викторович (3)
(1) МФТИ, (2) ИО РАН, (3) Камчатский филиал ГНЦ РФ ФГБНУ «ВНИРО» (“КамчатНИРО”) //

Мониторинг нерестового хода тихоокеанских лососей, в частности нерки (Oncorhynchus nerka), представляет собой фундаментальную задачу для оценки динамики популяций и обеспечения рационального природопользования. Традиционные визуальные методы учета, включая наземные маршрутные обследования и аэровизуальные наблюдения, характеризуются значительной трудоемкостью и существенной субъективностью оценок операторов. Развитие технологий дистанционного зондирования с применением беспилотных летательных аппаратов (БПЛА) открыло возможность получения ортофотопланов сверхвысокого разрешения, покрывающих протяженные нерестилища в труднодоступных районах [1]. Озеро Начикинское, являясь крупным нерестовым водоемом Камчатского края, характеризуется высокой плотностью производителей нерки в период массового нереста, что порождает существенный объем визуальных данных и делает их ручную обработку практически нереализуемой. Настоящее исследование посвящено разработке и апробации метода автоматического обнаружения особей нерки на ортофотопланах с применением нейросетевых моделей семейства YOLO как одного из перспективных подходов для последующего анализа поведения. Эмпирические данные были получены в ходе аэрофотосъемки побережья и мелководных участков нерестилищ озера Начикинское с применением БПЛА Геоскан 201М. Полученные изображения были обработаны в Agisoft Metashape Professional с построением ортофотоплана высокого пространственного разрешения, после чего он был разбит на тайлы фиксированного размера и размечен с выделением особей нерки ограничивающими прямоугольниками. Для автоматизированного обнаружения использовались модели семейства YOLO, выбор которых обусловлен необходимостью совместить приемлемую точность распознавания с высокой скоростью обработки больших объемов аэровизуальных данных. Близкая постановка задачи компьютерного зрения для учета производителей нерки по данным БПЛА уже показала практическую применимость подобных подходов [2]. Апробация модели показала, что основными факторами, ограничивающими точность обнаружения, остаются неоднородность условий съемки и несбалансированность обучающей выборки, связанная с неравномерным распределением рыбы в пределах нерестилищ. Полученные результаты позволяют рассматривать YOLO-детекторы как основу для ускоренного автоматизированного подсчета нерки и как задел для следующего этапа исследования — мультиобъектного трекинга особей на путях нерестовой миграции. [1] - Мазникова О.А. и др. “Беспилотные летательные аппараты как один из инструментов оценки подходов тихоокеанских лососей в Камчатском крае” (2025) [2] - Потапов А.С., Фадеев Е.С., Малых К.М. “Применение методов компьютерного зрения для аэровизуального учёта производителей нерки с борта БПЛА в бассейне оз. Азабачье в 2024 г.” (2025)

23. Structured Training Ablation and GSD-Adaptive Orthophoto Inference for UAV-Based Pavement Defect Detection with YOLO26

//Ahsan Mustafa (1,2), Mikhail Yu. Kataev (3), Dmitry S. Efremenko (1)
(1) HSE University, Moscow, (2) National University of Science and Technology MISIS, Moscow, (3) Tomsk State University of Control Systems and Radioelectronics, Tomsk //

Automated detection of pavement surface defects from unmanned aerial vehicle (UAV) imagery can substantially reduce the cost and labour of road condition surveys. Two questions that remain largely unanswered in current literature are which training configuration choices actually drive accuracy gains, and how well a model trained on low-altitude UAV images generalises to the higher-altitude orthophotos used in real-world deployment. This paper addresses both. We train YOLO26 on the UAV-PDD2023 benchmark dataset (2,439 images, six defect classes) and run a controlled five-configuration ablation study that independently varies hyperparameter settings, preprocessing, and data augmentation. The best combination reaches mAP50 = 0.859 and mAP50–95 = 0.631 on the held-out test split, a 57% and 133% improvement over the default baseline respectively. Raising input resolution from 640 px to 1280 px is the single largest contributor; preprocessing and augmentation each add com plementary gains. We then deploy the best model on three real-world GeoTIFF orthophotos through a ground-sampling-distance (GSD) adaptive tiling pipeline. The pipeline selects tile dimensions to match the ground coverage seen during training, applies cross-tile non-maximum suppression, and converts bounding boxes to WGS84 coordinates. Despite a measurable domain gap, the pipeline identifies all six defect classes across scenes spanning up to 322 × 843 m, with 2,201 detections on the largest mosaic. Exported GeoJSON and annotated GeoTIFF outputs are validated in QGIS against source rasters.

29. Integrated Study of Seismoacoustic Facies Types Using Deep Learning Methods in the Romanche Fracture Zone

//Popov M.S.(1,2), Borisov M.A. (2), Borisov D.G.(1)
(1) Shirshov's Institute of Oceanology RAS, (2) MIPT //

This paper presents the development of a methodology and software for automated identification and classification of seafloor seismoacoustic facies types using high-resolution data acquired during the 63rd cruise of the R/V Akademik Ioffe (2023) to the Central Atlantic, organized by the Shirshov Institute of Oceanology, Russian Academy of Sciences. The classification targets are rare anomalous seafloor morphostructures identified by the SES-2000 Deep parametric sub-bottom profiler, including contourite drifts and moats, transform faults, mid-ocean ridges, sediment waves, and basin plains. Given the geological rarity of the target features (anomalies constituting less than 3–5% of profile lengths), the key approach employs self-supervised learning using MoCo v3 on the complete unlabeled data archive (~20,000 nautical miles), whereas supervised detector training relies on a labeled dataset of no fewer than 500 annotated features. Detection is performed using the YOLO-v8 architecture, with the architecture selection verified through a comparative experiment (RT-DETR, Swin Transformer, Mask R-CNN). Annotation follows an 11-class taxonomy (A; B1–B5; C1–C5) developed by the staff of the Laboratory of Paleoceanology at the Shirshov Institute of Oceanology. The research was carried out within the state assignment of Ministry of Science and Higher Education of the Russian Federation for IO RAS (theme No. FMWE-2024-0020).

26. Автоматическая предварительная разметка судовых снимков в задаче обнаружения плавающего морского мусора

И. А. Герасимов(1), М. А. Борисов(1), М. П. Погожева(2, 3, 4), В. А. Спирина(3), М. А. Криницкий(1, 4)
(1) Московский физико-технический институт (национальный исследовательский университет), (2) Московский государственный университет имени М.В. Ломоносова, Географический факультет, Москва, Россия, (3) Государственный океанографический институт им. Н.Н. Зубова, Москва, Россия, (4) Институт океанологии им. П. П. Ширшова РАН

Автоматическое наблюдение плавающего морского мусора по данным судовой фотосъёмки требует обработки большого количества снимков высокого разрешения, в которых объекты интереса — фрагменты пластика, плавающая древесина — занимают малую часть площади кадра, а подавляющее большинство кадров не содержит целевых объектов совсем. Традиционные модели обнаружения, обучаемые с учителем, требуют размеченных выборок, формирование которых затруднено ввиду вариативности условий съёмки и редкости наблюдаемых объектов. В настоящей работе представлена автоматическая система предварительной разметки на основе модели OWLv2, осуществляющей детекцию объектов по текстовым описаниям без предварительного обучения на целевом домене: изображения разбиваются на перекрывающиеся фрагменты для сохранения чувствительности к малоразмерным объектам, а первичные обнаружения проходят фильтры для подавления ложных срабатываний. Результаты автоматически разделяются на кадры, содержащие и не содержащие объекты интереса, что исключает пустые снимки из очереди ручного просмотра и сокращает нагрузку на экспертов. В перспективе планируется на основе предварительной разметки, верифицированной экспертами, сформировать обучающую выборку достаточного объёма для тренировки специализированных моделей обнаружения и провести повторную разметку массива данных с помощью обученной модели, адаптированной к условиям данной задачи.

20. Реализация локальных моделей машинного обучения для аппроксимации коротковолновой радиации у поверхности земли в облачных условиях по геофизическим предикторам для Московского региона

Петров Н.А.(1), Чубарова Н.Е.(1), Варенцов М.И.(1,2)
(1) Московский государственный университет имени М. В. Ломоносова, Москва, (2) Научно-исследовательский вычислительный центр МГУ имени М.В. Ломоносова, Москва, 119991, Россия

Атмосферная радиация в коротковолновом (солнечном) спектре у поверхности земли является важной частью радиационного и теплового балансов, играет значимую роль в формировании приземной температуры воздуха. При этом компоненты коротковолновой радиации сильно изменяются в условиях неоднородного поля облачного покрова. На территории метеорологической обсерватории МГУ им. М.В. Ломоносова (МО МГУ) с 2021 г. функционирует комплекс высокоточных радиационных приборов RAD-MSU (BSRN), который проводит надёжные измерения плотностей потоков компонент коротковолновой и длинноволновой радиации [Чубарова и др., 2022; Piskunova et al., 2024]. Однако, во временных рядах измерений комплекса периодически возникают пропущенные данные, связанные с плановыми работами. В связи с этой проблемой, в настоящее время актуальна задача по заполнению имеющихся пропусков в данных измерений по известным геофизическим предикторам. Таким образом, целью данной работы является реализация локальных регрессионных моделей для аппроксимации плотностей потоков компонент коротковолновой радиации (прямая, рассеянная, суммарная) в облачных условиях «классическими» алгоритмами машинного обучения для МО МГУ. Обучение моделей проведено на данных измерений RAD-MSU (BSRN) c августа 2021 г. по июнь 2024 г. с часовым усреднением. При этом рассматривались случаи с высотами Солнца более 10°, баллом облачности по наблюдениям больше 0 и продолжительностью солнечного сияния менее 60 мин. в часе, что характерно для условий облачного неба. В ходе работ по реализации моделей проведён отбор наиболее важных геофизических предикторов отдельно для прямой, рассеянной и суммарной солнечной радиации. В работе проведены реализация и тестирование различных регрессионных моделей машинного обучения, включающих в себя линейную регрессию, алгоритм k-ближайших соседей, метод случайного леса (Random forest regression), а также алгоритм градиентного бустинга (Catboost) [https://catboost.ai/]. Статистическая оценка воспроизведения целевых переменных проводилась с помощью метода случайной генерации подвыборок «Bootstrap». Проведённая работа необходима для физически корректного заполнения пропусков в данных измерений RAD-MSU, а также важна для понимания роли различных геофизических факторов в формировании коротковолновой радиации.
Работа выполнена при поддержке Российского научного фонда (проект №25-27-00014).
1. Чубарова Н.Е., Розенталь В.А., Жданова Е.Ю., Полюхов А.А. Новый радиационный комплекс Метеорологической обсерватории МГУ стандарта BSRN: методические аспекты и первые результаты измерений. - Оптика атмосферы и океана, 2022, Т. 35, № 8, С. 670–678. DOI: 10.15372/AOO20220811
2. Piskunova D., Chubarova N., Poliukhov A., Zdanova E. Radiative Regime According to the New RAD-MSU (BSRN) Complex in Moscow: The Roles of Aerosol, Surface Albedo, and Sunshine Duration. - Atmosphere, 2024, vol. 15, No. 2, P. 144. DOI: 10.3390/atmos15020144;
3. Алгоритм CatBoost [Электронный ресурс]. URL: https://catboost.ai/ (дата обращения: 20.12.2025).

19. Self-Supervised Eddy Embeddings for Irminger Ring Analysis in the Labrador Sea

Mikhail Kalinin(1), Mikhail Krinitsky(1,2)
(1) P.P.Shirshov Institute of Oceanology, (2) Moscow Institute of Physics and Technology

Deep convection in the Labrador Sea is a key component of the Atlantic Meridional Overturning Circulation (AMOC), and its variability is strongly modulated by mesoscale eddies, particularly Irminger Rings (IRs). In our previous work, we presented a convolutional neural network for IR segmentation in the high-resolution NNATL12 ocean model output (1994–2015). In this study, we developed a self-supervised contrastive learning approach applied to over 53,000 eddy snapshots identified by that segmentation model. With our neural network trained on the data from 1994 till 2015, we map each eddy snapshot described by surface relative vorticity and sea surface height (SSH) fields to a compact vector embedding space, where similar eddies cluster together. Despite using only surface signatures as input, the resulting embeddings capture information about both surface dynamics and upper-ocean heat and salt content, learned through a weakly supervised regression objective. This demonstrates that the surface expression of Irminger Rings encodes their subsurface structure, making the approach suitable for satellite-observable applications. We found that without explicit supervision, the resulting embeddings capture similarity regarding key physical characteristics — vorticity, heat content, salinity, and size. Once the eddies are embedded, one may group them by type, retrieve by similarity, and analyze at scale to reveal relationships between eddy properties and their environment. This finding opens the door to systematic studies of different eddy families influence on deep convection. The approach is easily transferable to other eddy-rich regions and ocean model configurations.

12. Ускорение численного моделирования в задаче рассеяния атмосферных примесей в городской застройке методами глубокого обучения

И. А. Герасимов(1), М. А. Криницкий(1, 2), Е. В. Мортиков(3,4), А. В. Дебольский(3,5)
(1) Московский физико-технический институт (национальный исследовательский университет), (2) Институт океанологии им. П. П. Ширшова РАН, (3) Научно-исследовательский вычислительный центр Московского государственного университета имени М.В. Ломоносова, (4) Московский центр фундаментальной и прикладной математики, Россия, (5) Институт физики атмосферы им. А.М. Обухова РАН

Прогнозирование пространственного распределения концентрации мелкодисперсных частиц (PM 2.5) в городской застройке методом крупных вихрей (LES) требует продолжительных расчётов для достижения стационарного режима, что затрудняет его применение для оперативного анализа качества воздуха и проектирования городских зон. Полуэмпирические гауссовы модели рассеяния демонстрируют низкую точность в условиях сложной городской геометрии, где взаимодействие потока с застройкой порождает рециркуляционные зоны и каньонные эффекты. В настоящей работе предложена модель на основе свёрточной архитектуры типа кодер-декодер, обучаемая на результатах LES и аппроксимирующая стационарное поле концентрации. Модель принимает многоканальное представление пространственных и физических характеристик расчётной области и формирует поля концентрации на пяти высотных уровнях. Ключевым решением является послойное внедрение ветровой информации в декодер, что позволяет учитывать направление и скорость ветра на различных пространственных масштабах. Функция потерь включает физически мотивированный штраф за ненулевую аппроксимацию внутри контуров зданий, подавляющий нефизичные решения. Обучающая выборка (~31 000 образцов) получена при варьировании ветра, конфигурации застройки и расположения источников эмиссии. Модель достигает коэффициента детерминации R² = 0,97 при практически нулевом систематическом смещении на валидационной выборке по всем высотным уровням. В перспективе планируется расширение на произвольные конфигурации застройки и неустойчивую стратификацию атмосферы.

11. АНАЛИЗ СТРУКТУРЫ ДАННЫХ СТРАТОСФЕРНОГО ПОЛЯРНОГО ВИХРЯ С ПРИМЕНЕНИЕМ САМОКОНТРОЛИРУЕМОГО ОБУЧЕНИЯ

E. А. Демидова(1), М. А. Криницкий(1,2)
(1) Московский физико-технический институт (национальный исследовательский университет), (2) Институт океанологии им. П. П. Ширшова РАН

Стратосферный полярный вихрь (СПВ) представляет собой крупномасштабную циклоническую циркуляцию, формирующуюся в осенне-зимний период над полярными регионами вследствие возрастания меридионального градиента температуры. Динамика СПВ, характеризующаяся сильной межгодовой изменчивостью, играет фундаментальную роль в климатической системе, выступая основным регулятором переноса тепла, импульса и химических веществ в стратосфере над полярными областями и за их пределами. Сложность и нелинейный характер указанных процессов ограничивают применимость традиционных численных моделей для долгосрочного прогнозирования состояния вихря. В настоящем исследовании предложен подход к выявлению характерных состояний СПВ на основе машинного обучения, заключающийся в последовательном снижении размерности исходных данных с последующей кластеризацией полученных низкоразмерных скрытых представлений. На первом этапе были протестированы методы снижения размерности, включая симметричный сверточный автокодировщик, его вариационную модификацию, а также метод контрастного обучения MoCo. В качестве признакового описания использовались поля потенциальной завихренности (PV) и геопотенциальной высоты (HGT) на уровне 10 гПа над Арктикой в зимний период по данным реанализа JRA-55 за 1958 - 2017 годы. На втором этапе полученные векторы скрытых представлений были подвергнуты плотностной и иерархической кластеризации. Полученные значения коэффициента силуэта (0,034–0,081), а также результаты ансамблевого анализа, который скорее указывает на существование единого кластера, наряду с экспертным пониманием существования выделенных состояний вихря, вместе указывают на определяющую роль подхода выучивания векторных малоразмерных представлений в задаче исследования структуры данных: в зависимости от предпосылок, положенных в основу метода нейросетевого метода снижения размерности, получаемые представления могут быть применимы в различных конечных задачах - от прогнозирования до выявления структуры данных.

37. Cloud Base Height Estimation from Stereo Images Using Deep Feature Matching and Temporal Filtering

Ivan Ustimov(1),Oleg Postylyakov(2), Alexei Chulichkov (1), Michail Krinitskiy(3,4), Michail Borisov(4), Ekaterina Belkova(5), Vasily Lulukin(2)
(1) Moscow State University of Lomonosov, (2) Institute of Atmospheric Physics Russian Academy of Sciences, (3) Moscow Institute of Physics and Technology, (4) Institute of Oceanology Russian Academy of Sciences, (5) Institute of Oceanology Russian Academy of Sciences

Continuous monitoring of cloud base height (CBH) is essential for meteorology, aviation safety, refining measurement models during optical measurements, and climate modelling. Traditional laser ceilometers provide precise but pointwise measurements and are expensive to deploy at scale. We present a passive stereo-vision pipeline that estimates CBH from a pair of ground-based sky-facing images. The processed images represent 30°×30° angular sections captured by fisheye-type cameras. The cameras are installed at the ZNS of the A.M. Obukhov Institute of Atmospheric Physics, Russian Academy of Sciences, with a stereo baseline of 45 m. The report presents the first results of the developed algorithm and its comparison with laser ceilometer measurements. The core of the pipeline is LoFTR (Local Feature TRansformer), a detector-free deep learning architecture based on Vision Transformers that produces dense, sub-pixel correspondences between the two views directly from image pairs without a separate keypoint detection stage. The disparity of each matched pair is converted to height via an equidistant-projection stereo formula derived for the fisheye geometry. Clusters of consistent disparities corresponding to distinct cloud layers are extracted using HDBSCAN, a hierarchical density-based clustering algorithm that automatically rejects outlier correspondences from clear-sky regions or lens artefacts. To enforce physical consistency across time, we introduce a three-stage spatial filter (DBSCAN sub-clustering, convex-hull density check, and neighbour-consistency verification of disparities in image space) and a one-dimensional adaptive Kalman filter with innovation-dependent measurement noise covariance. The adaptive measurement scheme automatically down-weights likely outliers without hard rejection thresholds, suppressing physically unrealistic frame-to-frame jumps of up to 500 m per 10 s. The pipeline is validated against a co-located DOL-2 laser ceilometer (semiconductor pulsed laser, 10–3000 m range, ±(5 + 0.05H) m accuracy) using a 5-hour daytime sequence acquired on May 23, 2025. Raw per-frame estimates exhibit poor global Pearson correlation (r ≈ 0.04) with the ceilometer due to wind-driven decoupling of cloud layers observed by the two instruments. After temporal filtering and splitting the series into 30-minute windows with individual time-shift alignment, the mean correlation rises to r = 0.61, and 72% of windows (13 out of 18) achieve r > 0.5, with a maximum of r = 0.94 under favourable atmospheric conditions. The Kalman filter reduces the median frame-to-frame height jump by a factor of 4.3 and raises the 5-hour autocorrelation of the height series from 0.11 to 0.90. The discrepancy between measurements by the two instruments can be explained by several reasons. Calibration errors of mutual camera orientation likely contribute to systematic bias. Part of the difference may also arise from the fundamentally different principles underlying CBH determination: the stereoscopic method can detect cloud layers that are effectively transparent to the laser ceilometer.

44. Texture Based Explainable Machine Learning for Automated Glacial Crevasse Detection in GPR Radargrams

Ripul Ghosh(1,2), Ashish Lawaniya(1), Anuj Kumar Mishra(1,2), Sergey Popov(3,4), Marina Kashkevich(3_, Margarita Stepanova(5), Satish Kumar(1,2)
(1)Centre of Excellence for Intelligent Sensors and Systems (iSenS), CSIR-Central Scientific Instruments Organisation, India
(2)Academy of Scientific and Innovative Research (AcSIR), India
(3)Institute of Earth Sciences, Saint Petersburg State University, Russia
(4)Gramberg All-Russian Scientific Research Institute for Geology and Mineral Resources of the World Ocean (VNIIOkeangeologia), Russia
(5)The Faculty of Physics, Saint Petersburg State University, Russia

Glacial crevasses across Antarctica’s ice sheets pose risks to human activities and transportation in polar regions. Reliable crevasse detection is essential for safe scientific expeditions and surveys. Though ground penetrating radar (GPR) is widely used for crevasse detection, manual interpretation of radargrams remains time-intensive and dependent on operator expertise. Recently, advances in computer vision and artificial intelligence have enabled automated detection of these crevasses. Although deep learning approaches have shown promising performance, lightweight machine learning frameworks have received relatively less attention for automated crevasse detection. This work integrates an explainable machine learning framework for the automated crevasse detection in GPR images using the texture-based feature extraction techniques. A dataset consisting of 503 real and synthetic radargram image frames representing hyperbolic subsurface reflections and background radar signatures was prepared. For the improvement of the spatial localization and increase sample diversity, the frames were obtained through a sliding window 512 × 512 pixel with an overlap of 50% in both dimensions. Three handcrafted texture feature extraction approaches namely Gabor filtering, histogram of oriented gradients (HOG) and local binary pattern (LBP) were explored to characterise the radargram images. Gabor filtering was used to capture multi-scale texture orientation with the frequency responses that closely align and represent the wave propagation characteristics in the radar images. HOG descriptors were used to represent local edge orientation patterns associated with hyperbolic boundaries. LBP descriptors were extracted for the characterisation of local texture micro-patterns and intensity transitions which are present in radar reflections. Further, these extracted feature representations were evaluated using the four different lightweight machine learning classifiers such as shallow artificial neural network (ANN), support vector machine (SVM), random forest (RF) and extreme gradient boosting (XGBoost). To improve interpretability and understand feature contributions, SHapley Additive exPlanations (SHAP) was integrated as an explainable artificial intelligence (XAI) framework. SHAP analysis enabled the identification of the different dominant texture features which influence the classification decisions. This interpretability component helps in the improvement of the model transparency. Thus, supports trustworthiness in safety-critical applications where automated decisions directly influence operational planning.

48. Параметризация субмезомасштабного вертикального потока плавучести с помощью свёрточной нейронной сети на основе крупномасштабных характеристик перемешанного слоя

Резвов Вадим Юрьевич(1,2), Криницкий Михаил Алексеевич(1,2)
(1) Московский физико-технический институт (2) Институт океанологии им. П.П. Ширшова РАН

Разработка параметризаций субмезомасштабного вертикального потока плавучести (SVBF) остаётся одной из ключевых задач для климатических моделей океана. В данной работе предлагается нейросетевой подход, позволяющий выразить осреднённый по перемешанному слою SVBF через ограниченный набор крупномасштабных характеристик течения и стратификации. В качестве предикторов используются поля, осреднённые по глубине перемешанного слоя: градиент плавучести, завихренность, дивергенция, скорость деформации, глубина перемешанного слоя, глубина турбулентного пограничного слоя и турбулентная кинетическая энергия. Исходные данные предоставлены идеализированным численным экспериментом высокого разрешения, в котором воспроизводится ансамбль субмезомасштабных процессов в домене. Для восстановления пространственной структуры осреднённого SVBF используется свёрточная U‑Net-подобная архитектура. Обучение проводится на расширенном временном ряде, охватывающем различные фазы эволюции поля течения и стратификации. Модель демонстрирует значимое преимущество над опорным решением и хорошо воспроизводит крупномасштабную структуру потока. Анализ важности признаков показывает, что основной вклад в аппроксимацию вносят завихренность, турбулентная кинетическая энергия и глубина перемешанного слоя. Полученные результаты свидетельствуют о перспективности использования свёрточных нейронных сетей для построения физически обоснованных параметризаций субмезомасштабного вертикального потока плавучести, пригодных для внедрения в глобальные модели циркуляции океана.

53. Improving Wind-Wave Estimation from Shipborne Radar Data with CNNs, Synthetic Pretraining, and Weather Forecasts

Alexander Suslov (1), Mikhail Krinitskiy (1, 2), Alexander Gavrikov (1), Mikhail Borisov (2), Natalia Tilinina (1), Vadim Rezvov (2)
(1) Shirshov Institute of Oceanology, Russian Academy of Sciences, Moscow, (2) Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Oblast

Shipborne navigation radars are essential for safe maritime navigation, enabling the detection of vessels and obstacles. Reflections from the sea surface, known as Bragg scattering, generate sea clutter, which is typically filtered out as noise. Under sufficiently rough sea conditions, with wind speeds exceeding 3 m/s and significant wave heights exceeding 0.5 m, this clutter becomes visible in unfiltered radar images and can be used to retrieve wave parameters. Traditional methods for estimating wave parameters rely on 3D Fourier analysis and complex calibration procedures, resulting in high computational costs. Deep learning methods, particularly convolutional neural networks, can overcome these limitations by processing noisy radar images directly, without requiring Fourier transforms or long temporal image sequences. In this study, we present SeaVision, a CNN-based package for estimating wind-wave characteristics from shipborne radar data. We trained a ResNet-52 model to estimate significant wave height and wave period using buoy measurements as ground truth. To improve estimation accuracy, the framework combines single-snapshot radar imagery with auxiliary meteorological information, including real-time onboard sensor observations and Global Forecast System operational forecasts. We also employ a synthetic pretraining strategy: the neural network is first pretrained on synthetically generated radar images, which improves its generalization ability. The proposed approach requires only a single radar snapshot to estimate wave characteristics, whereas traditional 3D Fourier-based methods require more than 12 minutes of continuous radar observations.

55. Cloud Base Height Assessment via Stereo-Based Neural Network Feature Matching

E.A. Belkova¹, M.A. Borisov², M.A. Krinitskii¹,²
¹ Shirshov Institute of Oceanology, Russian Academy of Sciences, ² Moscow Institute of Physics and Technology (National Research University)

Cloud base height (CBH) is a key meteorological parameter linked to atmospheric boundary-layer depth, required for aviation and maritime safety. Conventional instruments such as lidars and radiosondes are accurate but costly and poorly suited to unstable platforms, while visual estimation remains highly subjective. We present a stereo-photogrammetric method for CBH estimation from synchronous fisheye images of the sky hemisphere, acquired by two ground-based cameras with a fixed baseline but imprecisely known mutual orientation. Camera geometry is recovered using the Sun's position: a transformation matrix is computed from 40,062 solar position points using outlier-robust fitting, correctly recovering the true camera rotation. Cloud features in synchronous image pairs are matched using a graph neural network - SuperGlue, and CBH is estimated from the resulting parallax using stereo geometry. These estimates are then compared with co-located instrumental measurements. The validation shows that a general-purpose neural feature matcher trained primarily on structured architectural scenes has limited robustness when applied to diffuse, low-contrast, and rapidly changing cloud imagery. In many cases, the number and spatial distribution of reliable matches are insufficient for stable parallax estimation. These results indicate that reliable stereo-based CBH retrieval from sky images requires further refinement of the matching stage, either through cloud-specific fine-tuning of SuperGlue, alternative dense-matching approaches, or improved preprocessing of cloud structures. The study provides a complete computational pipeline for CBH estimation from stereo sky imagery and identifies the key limitations that must be addressed for future deployment on mobile and maritime observing platforms.

59. Retrieval of Vertical Aerosol Extinction Profiles in the Lower Troposphere from MAX-DOAS Spectral Measurements Using a Deep Learning Transformer Architecture

//D.R. Shamsutdinov(1), O.V. Postylyakov(1), A.N. Borovsky(1)
(1) A.M. Obukhov Institute of Atmospheric Physics RAS //

Retrieval of Vertical Aerosol Extinction Profiles in the Lower Troposphere from MAX-DOAS Spectral Measurements Using a Deep Learning Transformer Architecture Keywords: MAX-DOAS, aerosol, extinction coefficient, vertical profiles, inverse problems, deep learning, Transformer, atmospheric remote sensing, CAMS, AERONET. This paper addresses the problem of retrieving vertical profiles of the aerosol volume extinction coefficient in the lower troposphere using ground-based multi-angle spectral measurements of scattered solar radiation (the MAX-DOAS method). Information on the vertical distribution of aerosols is retrieved from the differential slant column densities (DSCD) of the oxygen collisional dimer O$_4$, measured at various viewing angles. Since the concentration of O$_4$ in the atmosphere is proportional to the square of the known molecular oxygen O$_2$ density, it serves as a sensitive indicator of aerosol extinction along the light path. In this study, we propose a profile retrieval method based on a Transformer architecture adapted for the joint processing of heterogeneous input data: sequences of O$_4$ slant column densities measured at five elevation angles, and geometric information regarding the Sun's position (zenith and azimuth angles). The model was trained on a synthetic dataset generated by a radiative transfer model. As a priori vertical aerosol profiles, reanalysis data from the CAMS system model (PM$_{10}$, SILAM model, ads.atmosphere.copernicus.eu/datasets/cams-europe-air-quality-reanalyses) for the period 2018–2023 were used for the location of the Zvenigorod Scientific Station (ZSS) of the A.M. Obukhov Institute of Atmospheric Physics, Russian Academy of Sciences. The conversion of PM$_{10}$ concentrations into an extinction coefficient profile at a wavelength of 440 nm was performed by calibration against independent aerosol optical depth (AOD) measurements from the AERONET network, taken at ZSS in 2006–2021, using a least-squares fit to the percentiles of the combined sample. To improve the robustness and generalization capability of the model, a two-stage data augmentation strategy was applied: variation of timestamps (shifting the date and time) to expand the range of solar zenith and azimuth angles, and the addition of additive Gaussian noise, the level of which was estimated from real field measurements. The model architecture features a dual-branch structure. The “angular branch” processes the sequence of O$_4$ DSCDs using a self-attention mechanism (a TransformerEncoder with positional encoding), which allows the network to independently identify the spectral-angular dependencies of the signal. The “context branch,” based on a multilayer perceptron, encodes the geometric parameters of the observations. After feature fusion, the regression part of the network outputs a profile across 46 altitude levels from 0 to 5 km. The architecture hyperparameters were optimized using the Optuna framework. Testing results on an independent dataset for 2023 demonstrated high accuracy in profile retrieval: the coefficient of determination was $R^2 \approx 0.94$ for the near-surface layer of 0–1 km and $\approx 0.85$ for the 1–2 km layer, with a gradual decrease in sensitivity at altitudes above 2 km, which is consistent with the physics of the MAX-DOAS method. The difference in metrics between the training and validation sets does not exceed 0.05, indicating the absence of overfitting. Additionally, validation was performed on experimental O$_4$ DSCD data collected by an Ocean Optics USB4000 spectrometer at ZSS during 2011–2019. For 124 clear-sky days, $\alpha_{\text{aer}}(h)$ profiles were retrieved and integrated AOD values were calculated, which were then compared with simultaneous CIMEL photometer (AERONET) measurements. The coefficient of determination was $R^2 = 0.750$ for AERONET Level 1.5 data (automated quality control: exclusion of observations contaminated by clouds in the direction of the Sun) and $R^2 = 0.822$ for Level 2.0 quality-assured data (with manual post-calibration and exclusion of observations contaminated by clouds across the entire sky). Thus, the model trained exclusively on synthetic data based on the CAMS model successfully generalizes to real-world field measurements. It should be noted that the proposed neural network framework is versatile: it was additionally tested on the task of retrieving NO$_2$ profiles, which are characterized by a complex non-Gaussian multi-modal structure. In this task, the neural network approach (ResNet) outperformed the classical linear statistical estimation method in accuracy by 20–200%, confirming its effectiveness for gases with complex spatial distributions. This work was supported by the State Task of the A.M. Obukhov Institute of Atmospheric Physics, Russian Academy of Sciences (topic registration number 1022041800069-7-1.5.9). References [1] Frieß U., Beirle S., Alvarado Bonilla L. et al. Intercomparison of MAX-DOAS vertical profile retrieval algorithms: studies using synthetic data // Atmos. Meas. Tech. 2019. V.12. P. 2155–2181. DOI: 10.5194/amt-12-2155-2019. [2] Postlyakov O.V., Borovsky A.N., Shamsutdinov D.R., Chulichkov A.I. On the comparison of methods for retrieving the nitrogen dioxide profile in the atmosphere from multi-angle spectral ground-based measurements // Current Problems in Remote Sensing of the Earth from Space. (Submitted for publication).

61. Comparing Direct and Autoregressive Neural Forecasting Strategies for Black Sea Ocean State Prediction at Sub-Seasonal Horizons

M. Borisov(1), S.Vostrikova(1), M. Vasiliev(1), M. Krinitskiy(1,2)
(1) Moscow Institute of Physics and Technology , (2) P.P. Shirshov Institute of Oceanology of the Russian Academy of Sciences.

We present a comparative study of deep learning strategies for multi-variable Black Sea ocean state forecasting at 30–60 day lead times. The proposed pipeline follows a modular encoder–predictor–decoder architecture: a MoCo v2 contrastively pre-trained encoder (ResNet-50 or ViT-Small) maps daily GLORYS12 snapshots [1] to a compact latent representation; a sequence model (LSTM or Transformer) predicts future latent states from a 180-day input window; a spatial decoder reconstructs six ocean fields — sea surface temperature, salinity, mixed layer depth, bottom temperature, and surface currents. Three forecasting strategies are evaluated across both architectures: direct prediction at a 30-day lead, direct prediction at a 60-day lead, and autoregressive rollout. All models are trained on GLORYS12 reanalysis (1993–2022) with a strict temporal split; evaluation covers 2023–2024. Skill is assessed against reanalysis ground truth, NOAA OISST [5], independent Argo float profiles [6], and NASA SMAP sea surface salinity. Results reveal a pronounced strategy–architecture interaction. The ResNet direct-60 model outperforms the persistence baseline by a wide margin for temperature and salinity, while the ViT model under the same strategy degrades severely — indicating an architecture-specific vulnerability at longer direct horizons. Autoregressive rollout underperforms climatology for temperature in both architectures due to error accumulation. MoCo pre-training [2] provides consistent improvement for ResNet, whereas ViT [3] achieves competitive skill without it. As a downstream demonstration, predicted surface current fields drive Lagrangian particle tracking in the GNOME framework [7] for an oil spill scenario in the Black Sea.
[1] Jean-Michel, L., et al. “The Copernicus Global 1/12° Oceanic and Sea Ice GLORYS12 Reanalysis.” Frontiers in Earth Science, 9, 698876, 2021.
[2] He, K., et al. “Momentum Contrast for Unsupervised Visual Representation Learning.” CVPR, pp. 9729–9738, 2020.
[3] Dosovitskiy, A., et al. “An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale.” ICLR, 2021.
[4] Vaswani, A., et al. “Attention Is All You Need.” NeurIPS, vol. 30, 2017.
[5] Reynolds, R. W., et al. “Daily High-Resolution-Blended Analyses for Sea Surface Temperature.” Journal of Climate, 20(22), 5473–5496, 2007.
[6] Argo. “Argo float data and metadata from Global Data Assembly Centre.” SEANOE, 2000. doi:10.17882/42182.
[7] Zelenke, B., et al. GNOME Technical Documentation. NOAA Tech. Memo. NOS OR&R 40, 2012.

6. О предподготовке входных данных для спайковых нейронных сетей

//Вячеслав Анатольевич Ильин (1,2.3), Михаил Юрьевич Христиченко (1)
(1) НИЦ «Курчатовский институт»(Москва), (2) Университет ИТМО (Санкт-Петербург), (3) НИВЦ МГУ (Москва) //

В докладе обсуждается проблема отображения численных входных данных в формат спайков, подаваемых на входные нейроны спайковой нейронной сети. В искусственных нейронных сетях входные числовые данные подаются непосредственно на входные нейроны, после той или иной численной предподготовки, например, нормировки на интервал (0,1). Для спайковых нейронных сетей входные данные, подаваемые на входные нейроны, должны быть записаны в спайковом формате, что серьезно меняет математическое содержание процедуры предподготовки входных данных. - числовые входные данные , но на входной нейрон должен подаваться спайк, или последовательность спайков. При этом такое преобразование должно обеспечивать взаимно-однозначное соответствие между числовым и спайковым форматами. В противном случае разные входные числовые данные могут отображаться в совпадающие спайковые данные, что, очевидно, приведет к одному и тому же отклику спайковой нейронной сети. Конечно, для искусственных нейронных сетей используются и другие числовые форматы, в том числе и дискретные - например для изображений часто используется маска из 256 градаций серого цвета. Однако для всех форматов числовых входных данных требование взаимно-однозначного соответствия с их спайковыми образами остается обязательным.

В докладе поставленная выше проблема обсуждается на примере кодирования данных эталонного набора в машинном обучении «Ирисы Фишера» (Iris flower data set). В этом датасете 150 4-х мерных векторов, компоненты которых имеют положительные числовые значения. В качестве спайковой нейронной сети мы используем резервуар из 10-ти нейронов, в каждый из которых входят и выходят по 4 синаптические связи. Входные данные из датасета «Ирисы Фишера», после преобразования в спайковый формат, подаются на четыре нейрона, и выбор таких нейронов фиксирован для всех входных данных.

В нашей статье [1] был предложен метод кодирования входных данных для спайковой нейронной сети резервуарного типа, в которой реализован механизм локальной спайковой пластичности STDP (Spike-Timing-Dependent Plasticity) для динамического изменения весов синаптических связей в сети в целом при подаче входных данных. Было показано, что в результате в такой сети возникают устойчивые направленные подграфы (комплексы путей), совокупность которых взаимно-однозначно соответствуют множеству входных данных. Предложенный метод кодирования исследовался на примере STDP нейронной сети из 10-ти нейронов с 3-мя входящими и 3-мя выходящими синаптическими связями, а также с простейшей моделью входных данных - 4 бинарных пикселя, всего 16 входных бинарных векторов. В этой модели требование взаимно-однозначного преобразования входных данных в спайковый формат было удовлетворено на очевидном уровне.

В докладе предложен подход к построению взаимно-однозначного преобразования числовых входных данных из датасета «Ирисы Фишера» в спайковый формат, в котором они подаются на описанную выше STDP спайковую нейронную сеть с 4-мя входными и выходными синаптическими связями. Взаимно-однозначное соответствие такого преобразования (предобработка входных числовых данных) проверялось через вычисление расстояний между комплексами путей - образами входных данных во внутренних терминах нейронной сети.

[1] V.A. Ilyin, Ya.P. Ivina, M.Yu. Khristichenko, A.V. Serenko and R.B. Rybka. Moscow University Physics Bulletin, 2024, Vol. 79, Suppl. 2, pp. S630–S638. DOI: 10.3103/S0027134924702084

16

Table of Contents