dlcp2023:abstracts
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
dlcp2023:abstracts [05/03/2025 17:31] – removed - external edit (Unknown date) 127.0.0.1 | dlcp2023:abstracts [05/03/2025 17:33] (current) – ↷ Links adapted because of a move operation 156.59.198.135 | ||
---|---|---|---|
Line 1: | Line 1: | ||
+ | ====== Book of Abstracts ====== | ||
+ | |||
+ | {{ dlcp2023: | ||
+ | ===== Plenary reports ===== | ||
+ | |||
+ | ==== Генеративный ИИ для больших моделей и цифровых двойников ==== | ||
+ | |||
+ | __A.Boukhanovsky__, | ||
+ | |||
+ | ==== Methodology for the use of neural networks in the data analysis of the collider experiments ==== | ||
+ | __L.V.Dudko__, | ||
+ | |||
+ | Most of the modern data analysis in the collider experiments uses multivariate methods. The application of neural networks in such analysis has different steps. Possible optimizations for each step are discussed in the talk. The general recipes for the steps can move application of neural networks from state of the art stage to .deterministic procedure. | ||
+ | |||
+ | ==== Machine Learning in Gamma Astronomy ==== | ||
+ | |||
+ | __A.Kryukov__, | ||
+ | A.Demichev, Lomonosov Moscow State University, Moscow | ||
+ | |||
+ | |||
+ | Imaging Atmospheric Cherenkov Telescope (IACT) capture images of extensive air showers (EAS) generated by gamma rays and cosmic rays (charged particles) as they interact with the atmosphere. The much more frequent events associated with charged particles form the main background in the search for sources of gamma radiation, and therefore the sensitivity of IACTs to a large extent depends on the ability to distinguish between these two types of events. A conclusion about the properties of a primary high-energy particle can be drawn from the images in the telescope camera of the EAS which it initiated. In addition to classification (gamma or charged particle), its properties, such as energy and direction of arrival, can be estimated. | ||
+ | Recently, along with the previously developed special methods of appropriate data processing to identify events with gamma rays and determine their parameters, methods based on deep learning have been successfully applied. These techniques, as applied to the analysis of IACT data, are the main subject of this survey report. After very short background information on Cherenkov telescopes and methods for analyzing their data without using deep learning, we discuss the existing deep learning methods for analyzing IACT data, both for classifying the types of detected particles and background rejection, as well as for determining EAS parameters. Showers initiated by gamma rays accounts for an insignificant fraction of the total number of observed EASs, while the background of showers induced by cosmic rays (charged particles) predominates. Therefore, the problem of effective background rejection by deep learning methods is extremely important and is discussed in details. Obtaining the parameters of the reconstructed gamma-ray events, such as the energy and direction of arrival, is the central task of this entire area of research, since it allows one to obtain information about the physical processes in the sources of these gamma rays and, as a result, about the important physical processes in the Universe. We also consider the possibility of using generative neural networks for fast modeling of images of gamma and proton events in IACT cameras, which can significantly speed up and improve the efficiency of experimental data processing. In addition, we shortly discuss two specialized free software packages developed for analyzing IACT data using deep learning methods (available on git-sites). The CTLearn software provides a backend for training neural networks to reconstruct IACT events using TensorFlow. The goal of the GammaLearn project is to find and build optimal neural networks for classifying gamma/ | ||
+ | |||
+ | |||
+ | ==== Методы глубокого обучения для задач построения «цифровых двойников» технологических процессов ==== | ||
+ | |||
+ | __М.И. Петровский__, | ||
+ | И.С. Лазухин, | ||
+ | |||
+ | «Цифровой двойник» технологического процесса представляет собой комплекс математических моделей, | ||
+ | ==== Высокопроизводительные компьютерные системы в задачах машинного обучения ==== | ||
+ | //Sponsor report// | ||
+ | |||
+ | __A.Moskovsky__, | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | ===== Track 1. Machine Learning in Fundamental Physics | ||
+ | |||
+ | ==== 23. Generating Synthetic Images of Gamma-Ray Events for Imaging Atmospheric Cherenkov Telescopes Using Conditional Generative Adversarial Networks ==== | ||
+ | __Ju.Dubenskaya__, | ||
+ | A.Demichev, SINP MSU (Moscow), E.Gres, IGU (Irkutsk), A.Kryukov, SINP MSU (Moscow), S.Polyakov, SINP MSU (Moscow), D.Zhurov, IGU (Irkutsk), A.Vlaskina, SINP MSU (Moscow) | ||
+ | |||
+ | In recent years, machine learning techniques have seen huge adoption in astronomy applications. In this work we discuss the generation of realistic synthetic images of gamma-ray events similar to the ones taken by Imaging Atmospheric Cherenkov Telescopes (IACTs) using the generative model called conditional generative adversarial network (cGAN). The big advantage of the cGAN technique is the much faster generation of new images compared to the standard Monte Carlo simulation. But in order to use cGAN-generated images in a real IACT experiment, we need to ensure that these images are statistically indistinguishable from those generated by the Monte Carlo method. In this work we present the results of a study on comparing the parameters of cGAN-generated image samples with the parameters of image samples obtained using Monte Carlo simulation. The comparison is made by the so-called Hillas parameters, which form a set of geometric features of the event image, widely used in gamma-ray astronomy. Our study shows that the key point is the proper preparation of the training set for the neural network. The properly trained cGAN not only generates individual images well, but also well reproduces the Hillas parameters of the entire sample of generated images. Thus, machine learning simulations are a good alternative to time-consuming Monte Carlo simulations, | ||
+ | |||
+ | ==== 8. Generation of the ground detector readings of the Telescope Array experiment and the search for anomalies using neural networks ==== | ||
+ | |||
+ | __R.Fitagdinov__, | ||
+ | I.Kharuk | ||
+ | |||
+ | Основной целью данной работы было разработать модели для генерации показаний поверхностного детектора с наибольшей амплитудой и для эксперимента Telescope Array с помощью нейронных сетей. Данные, | ||
+ | Были получены визуально похожие данные. Была написана функция поиска аномалий, | ||
+ | |||
+ | ==== 40. Neural network approach to impact parameter estimation in high-energy collisions using the microchannel plate detector data ==== | ||
+ | __K.Galaktionov__ (SPbSU, St.-Petersburg), | ||
+ | V.Roudnev, F.Valiev | ||
+ | |||
+ | Evaluation of the impact parameter (which is the distance between the trajectories of the colliding particles) in a single event in high-energy ion collisions is an important problem in data analysis in particle physics, since the impact parameter is crucial for understanding the dynamics of these collisions and is essential for extracting information about the properties of nuclear matter. In this work, we present the study of a neural network approach in estimating both the value of the impact parameter and the class of collisions (head-on or peripheral collisions). We have modeled and investigated the response of the detectors based on microchannel plates proposed in [1], using the $\mbox{Au}+\mbox{Au}$ collision dataset, at energies $\sqrt{s_{ NN}}=11\mbox{ GeV}$ obtained by the QGSM MC event generator. | ||
+ | We have used the spatial distribution of particles and their time-of-flight data as event features, and we have shown that adding the information about time-of-flight of particles to such event characteristics as the number of registered hits and the average angle of hits on the detectors improves the quality of the impact parameter evaluation. Moreover, by comparing two detector systems with different pseudorapidity acceptance $(3.5 \eta 5.8 \mbox{ and } 4.4 \eta 5.8)$, we have shown that wider interval significantly improves the results. | ||
+ | In the course of studies on this data set, the proposed algorithm was able to successfully classify more than 98\% of $\mbox{Au}+\mbox{Au}$ head-on collision events with an impact parameter of less than 5 fm and can be further useful as a fast trigger system. We also discuss further developments and improvements for possible applications of this technique in future experimental setups. | ||
+ | 1] A.A. Baldin, G.A. Feofilov, P. Har’yuzov, | ||
+ | |||
+ | ==== 28. The selection of rare gamma event from IACT images with deep learning methods ==== | ||
+ | |||
+ | __E.Gres (IGU, Irkutsk)__ \\ | ||
+ | A.Demichev (SINP MSU, Moscow), Ju.Dubenskaya (SINP MSU, Moscow), A.Kryukov (SINP MSU, Moscow), S.Polyakov (SINP MSU, Moscow), D.Zhurov (IGU, Irkutsk), A.Vlaskina (SINP MSU, Moscow) | ||
+ | |||
+ | Imaging Atmospheric Cherenkov Telescopes (IACT) of gamma ray observatory TAIGA detect the Extesnive Air Showers (EASs), originating from the cosmic or gamma rays interactions with the atmosphere. Thereby telescopes obtain images of the EASs. The ability to determine the gamma rays from hadronic cosmic ray background in images is one of the main features of this type of detectors. However, in actual IACT observations simultaneous observation of the background and the source of gamma ray are needed. This observation mode (called as Wobbling) modifies the images of events, which affects the quality of selection by neural networks. | ||
+ | Thus, in this work the results of the application of convolutional neural networks (CNN) in image classification task on MC images of TAIGA-IACTs are presented. The Wobbling mode are considered together with the image adaptation for adequate analysis by the CNN. Also taking into account all necessary image modifications the estimation of the quality selection by the CNN for the rare gamma events selection in MC simulation are given. | ||
+ | |||
+ | ==== 6. Preliminary results of neural network models in HiSCORE experiment ==== | ||
+ | |||
+ | __A.Kryukov (SINP MSU, Moscow),__ \\ | ||
+ | A.Vlaskina (SINP MSU, Moscow), A.Demichev (SINP MSU, Moscow), Ju.Dubenskaya (SINP MSU, Moscow), E.Gres (IGU, Irkutsk), S.Polyakov (SINP MSU, Moscow), D.Zhurov (IGU, Irkutsk) | ||
+ | |||
+ | The paper presents preliminary results on determining the direction of the EAS axis in experiments representing an array of Cherenkov detectors. An example of such a facility is the HiSCORE facility deployed near Baikal lake as part of the TAIGA experiment. Two approaches have been considered. One is based on the representation of the time of registration of signal arrival by stations in the form of an image, which is processed using a convolutional network. Another approach is to allocate a subset of the same number of triggered stations in each events, which includes data on the location of stations relative to each other and the relative time of signal registration. The analysis is performed using a fully connected deep network. It was shown in the work that both approaches give approximately the same accuracy. In the future, we propose to optimize the architecture of both networks and the process of their training to improve the accuracy of predicting EAS parameters. | ||
+ | |||
+ | The work was supported by RSF, grant no.22-21-00442. The work was done using the data of UNU " | ||
+ | |||
+ | ==== 27. The use of conditional variational autoencoders for simulation of EASs images from IACTs ==== | ||
+ | |||
+ | __A.Kryukov__ (SINP MSU, Moscow) \\ | ||
+ | S.Polyakov (SINP MSU, Moscow), A.Demichev (SINP MSU, Moscow), Ju.Dubenskaya (SINP MSU, Moscow), E.Gres (IGU, Irkutsk), D.Zhurov (IGU, Irkutsk), A.Vlaskina (SINP MSU, Moscow) | ||
+ | |||
+ | Imaging atmospheric Cherenkov telescopes (IACTs) are used to record the images of extensive area showers (EASs) caused by high-energy particles colliding with the upper atmosphere. Individual images can be used to distinguish between the types of primary particles and estimate the parameters of the events. The algorithms typically use the data from the events with known parameters simulated using Monte Carlo method. In this paper, we investigate the possible alternative approach of simulating IACT images by using conditional variational autoencoders. Both the characteristics of the individual images and their distributions are compared to those of the images generated by Monte Carlo simulation. | ||
+ | |||
+ | ==== 15. Универсальный метод разделения широких атмосферных ливней по первичной массе с помощью машинного обучения для черенковского телескопа типа СФЕРА ==== | ||
+ | |||
+ | __V.Latypova__ (SINP MSU, Moscow), \\ | ||
+ | Азра К.Ж., Бонвеч Е.А. , Галкин В.И., Зива М.Д., Иванов В.А. , Подгрудков Д.А., Роганова Т.М., Чернов Д.В, Энтина Е.Л. | ||
+ | |||
+ | Важной задачей физики высоких энергий является нахождение массового состава космических лучей в диапазоне энергий 1-100 ПэВ. Целью эксперимента СФЕРА-2 является определение массового состава первичного космического излучения. Чем лучше происходит разделение событий, | ||
+ | В настоящей работе разработан эффективный способ разделения первичных ядер, образовавших широкие атмосферные ливни, на основе смоделированных событий аппарата СФЕРА с помощью методов машинного обучения. Поскольку для моделирования искусственных событий используется модель ядро-ядерного взаимодействия, | ||
+ | Методами машинного обучения решена задача регрессии. Разделение событий на три группы ядер – протонов (p), азота (N), железа (Fe) с помощью нейронной сети происходит более точно, чем при использовании классических методов. | ||
+ | |||
+ | |||
+ | ==== 7. Using Neural Networks for Reconstructing Particle Arrival Angles in the Baikal-GVD Neutrino Telescope ==== | ||
+ | |||
+ | __A.Leonov__ (MIPT, Moscow region), \\ | ||
+ | O.Kalashev | ||
+ | |||
+ | |||
+ | The problem of reconstructing the angles of arrival of neutrinos in the Baikal-GVD experiment using neural networks is being studied. We use Monte Carlo simulation data for single-cluster arrival events of atmospheric neutrinos with energies from 10 GeV to 100 TeV. The performance of the networks was compared with the standard reconstruction based on median angular resolutions. As a result, it was shown that neural networks can cope more accurately than standard reconstruction when restoring small polar arrival angles. | ||
+ | |||
+ | ==== 9. Application of machine learning methods in Baikal-GVD: background noise rejection and selection of neutrino-induced events==== | ||
+ | |||
+ | __A.Matseiko__ (MIPT, Moscow region, INR RAS, Moscow) | ||
+ | |||
+ | Baikal-GVD is a large (~1 km3) underwater neutrino telescope located in Lake Baikal, Russia. In the talk, we present two machine learning techniques developed for its data analysis. First, we introduce a neural network for an efficient rejection of noise hits, emerging due to natural water luminescence. On Monte-Carlo simulated data, it reaches up to 99% signal purity (precision) and 96% survival efficiency (recall). Second, we develop a neural network for distinguishing muon- and neutrino-induced events. By choosing appropriate classification threshold, we preserve 60% of neutrino-induced events, while muon-induced events are suppressed by a factor of 10-6. Both of the developed neural networks employ the causal structure of events and surpass the precision of standard algorithmic approaches. | ||
+ | ==== 36. Novelty Detection Neural Networks for Model-Independent New Physics Search ==== | ||
+ | |||
+ | __A.Zaborenko__ (MSU, Moscow) \\ | ||
+ | L.Dudko | ||
+ | |||
+ | Recent advancements in model-independent approaches in High Energy Physics have encountered challenges due to the limited effectiveness of unsupervised algorithms when compared to their supervised counterparts. In this paper, we present a novel approach utilizing a one-class Deep Neural Network (DNN) to achieve accuracy levels comparable to supervised learning methods. Our proposed novelty detection algorithm uses a multilayer perceptron to learn and distinguish a specific class from simulated noise signals. By training on a single class, our algorithm constructs a hyperplane similar to one-class Support Vector Machines (SVMs) but with enhanced accuracy and significantly reduced training and inference times. | ||
+ | This research contributes to the advancement of model-independent techniques for uncovering New Physics phenomena, showcasing the potential of one-class DNNs as a viable alternative to traditional supervised learning approaches. The obtained results demonstrate the effectiveness of our proposed algorithm, paving the way for improved anomaly detection and exploration of uncharted territories in High Energy Physics. | ||
+ | ===== Track 2. Modern Machine Learning Methods | ||
+ | |||
+ | ==== 14. Reconstruction Methods for a Partial Differential Equation: Application to Physical and Engineering Problems ==== | ||
+ | |||
+ | __N.Bykov__ (ITMO Universiti, St-Petersburg), | ||
+ | A.A.Hvatov, T.A.Andreeva, | ||
+ | |||
+ | Two approaches for restoring a process model in the form of a partial differential equation based on experimental data are developed. The first approach is based on the evolutionary algorithm, and the second one on the algorithm for best subset selection. Different techniques for numerical differentiation of noisy data are proposed for solving physical and engineering problems. A full-scale experiment of heating a viscous liquid by a merged heat source was carried out as the benchmark for verification of developed algorithms. The efficiency of the algorithms for restoring the structure of the equation, determining the coefficients of derivatives and the possibility of detecting the change in the object operation mode are shown. Additionally, | ||
+ | |||
+ | This research is financially supported by The Russian Science Foundation, Agreement N 21-11-00296, | ||
+ | ==== 45. Decomposition of Spectral Contour into Gaussian Bands using Improved Modification of Gender Genetic Algorithm ==== | ||
+ | |||
+ | __S.Dolenko__ (SINP MSU, Moscow), \\ | ||
+ | G.A.Kupriyanov (1), I.V.Isaev (2,3), I.V.Plastinin (2), T.A.Dolenko (1,2) \\ | ||
+ | (1) Faculty of Physics, Lomonosov Moscow State University, Moscow, \\ | ||
+ | (2) SINP MSU, Moscow | ||
+ | |||
+ | One of the methods for analysis of complex spectral contours (especially for spectra of liquid objects) is their decomposition into a limited number of spectral bands with physically reasonable shapes (Gaussian, Lorentzian, Voigt etc.). Consequent analysis of the dependencies of the parameters of these bands on some external conditions in which the spectra are obtained may reveal some regularities bearing information about the physical processes taking place in the object. | ||
+ | |||
+ | The problem with the required decomposition is that such decomposition in presence of noise in spectra is an incorrect inverse problem. Therefore, this problem is often solved by advanced optimization methods less subject to be stuck in local minima, such as genetic algorithms (GA). | ||
+ | |||
+ | In the conventional version of GA, all individuals are similar regarding the probabilities and implementation of the main genetic operators (crossover and mutation) and the procedure of selection. In their preceding studies [1, 2], the authors tested gender GA (GGA), where the individuals of the two genders differ by the probability of mutation (higher for males) and by the procedures of selection for crossover (with the number of crossovers limited for females). In this study, we introduce additional differences between the genders in the procedures of selection and mutation. The improved modification of GGA is tested by comparison of the efficiency of gradient descent, conventional GA and two versions of GGA in solving the problems of decomposition of the Raman valence band of liquid water into Gaussian bands. | ||
+ | |||
+ | This study has been funded by the SINP MSU state budget topic 6.1 (01201255512). | ||
+ | |||
+ | References \\ | ||
+ | 1. G.A.Kupriyanov, | ||
+ | 2. G.Kuptiyanov, | ||
+ | |||
+ | ==== 3. Robust equation discovery as a machine learning method ==== | ||
+ | |||
+ | __A.Hvatov__ (ITMO University), | ||
+ | M.Maslyaev, R.Titov, D. Aminev, J.Schvartsberg, | ||
+ | |||
+ | In this presentation, | ||
+ | |||
+ | ==== 20. Modification of soft connectives in machine learning models ==== | ||
+ | //Poster// | ||
+ | |||
+ | __V.Kalnitsky__ (SPbSU, St.-Petersburg) | ||
+ | |||
+ | The problem of limited accuracy of machine learning models using soft logical connectives is investigated. Such connectives have shown their effectiveness in models with fuzzy initial data. On the one hand, the fundamental disadvantage of soft connectives is their non-associativity. On the other hand, the disadvantages of the currently used soft connectives include the loss of monotonicity and the inability to control several factors simultaneously. We have proposed an approximation of the signum function by a smooth spline. We are controlling the difference between the soft connective and the associative connective. It was shown that the spline approximation is able to reduce the influence of all negative factors and is more flexible in setting. Moreover, the constructed spline model allows numerous modifications depending on the factor that requires the most attention for different tasks. | ||
+ | |||
+ | ==== 26. The importance of the number of overfits in time series forecasting by some optimizers and loss functions in neural networks. ==== | ||
+ | |||
+ | __Z.Kurdoshev__ (Tomsk State University, Tomsk), \\ | ||
+ | Е.Пчелинцев | ||
+ | |||
+ | Time series forecasting is one of the most important problems today, and the issue of forecasting has not lost its relevance. Prediction accuracy is difficult even for recurrent neural networks from simple neural networks. Loss functions of a series of predictions depend on the number of retraining and optimizers, their efficiency and scale in recurrent neural networks. Various loss functions and modern optimizers are used for the study. Empirical studies are carried out using LSTM, RNN and GRU networks. Each neural network examines the dependence of time series on overfitting using Adam, AdaGrad, and stochastic gradient descent. The results of the study are presented in the form of graphs. | ||
+ | |||
+ | ==== 21. Training by relaxation of Boltzmann machine with multispin interactions==== | ||
+ | |||
+ | __I.Lobanov__ (ITMO University, St.-Petersburg) | ||
+ | |||
+ | Boltzmann machine (BM) is an energy based generative model capable of solving unsupervised and supervised machine learning problems. BM have been successfully used for facial recognition, | ||
+ | |||
+ | We proposed to implement the training of the biases as energy minimization of a carefully designed spin system, where the biases are implemented as macrospins included in the device [I. Lobanov. Spin Boltzmann machine. Nanosystems: | ||
+ | |||
+ | A universal BM should be capable of learning correlations in data, that is connections between the units should be optimized. We propose to control the connections using multispin interactions between two units and one or two macrospins representing weights. In the approach the training data is provided to the device by applying external magnetic field, and all the parameters of BM can be trained by relaxation. To test the approach we simulated a generalized Heisenberg model including symmetric two-, three- and four- spins exchanges, uniaxial anisotropy and the Zeeman term. We call the model spin Boltzmann machine (SBM). Simulation is done using an semi-implicit second-order integrator implemented in a hand made GPU accelerated software. The efficiency of SBM is demonstrated on classification of handwritten digits from MNIST database. The significance of multispin exchanges contribution to total energy was demonstrated in [Hoffmann M., Blugel S. Systematic derivation of realistic spin models for beyond-Heisenberg solids. ¨ Phys. Rev. B, 2020, 101, P. 024418], however design of such material on purpose is an open problem. In the work we outline constraints on the multispin exchange parameters, which can guide further search of materials for experimental realization of SBM. | ||
+ | |||
+ | ==== 17. Hyper-parameter tuning of neural network for high-dimensional problems in the case of Helmholtz equation ==== | ||
+ | |||
+ | __D.Poliakov__, | ||
+ | M.Stepanova | ||
+ | |||
+ | In this work we study the effectiveness of common hyper-parameter optimization (HPO) methods for physics-informed neural network (PINN) with application to multidimensional Helmholtz problem. The network was built upon PyTorch framework without the use of special PINN-oriented libraries. We investigate the effect of hyper-parameters on NN model performance and conduct automatic hyper-parameter optimization using different combinations of search algorithms and trial schedulers. | ||
+ | We chose Ray Tune — an open-source HPO framework that provides unified interface for many HPO packages — as the HPO tool in our work. We consider two search algorithms: random search and Bayesian method based on tree-structured Parzen estimator (TPE, in two implementations: | ||
+ | |||
+ | ===== Track 3. Machine Learning in Natural Sciences ===== | ||
+ | |||
+ | ==== 12. Estimating cloud base height from all-sky imagery using artificial neural networks ==== | ||
+ | __M.Borisov__ (MIPT, Moscow region), \\ | ||
+ | M.Krinitskiy, | ||
+ | |||
+ | Cloud Base Height (CBH) is one of most important meteorological parameters. CBH strongly correlates with planetary boundary layer depth. Existing methods for assessing CBH in practice either involve the use of complex and expensive equipment, such as lidars, airplanes, meteorological pilot balloons, or have high uncertainty introduced by an expert in the process of visual assessment. In addition, most of the instrumental methods designed for a stable platform for installing equipment are difficult to apply under conditions of waving. | ||
+ | In this study, we propose a new way to estimate CBH using two optical wide-angle cameras mounted at a distance which may vary from 15m. to 30m. In our approach, we use the phenomenon of parallax, namely the property of observing a point of a cloud at angles that differ due to the displacement of an observation device. Optical images of the visible sky hemisphere are acquired synchronously during field observations from two cameras of the SAIL Cloud v.2 cloud characteristics assessment device developed in Sea-Atmosphere Interaction Laboratory (SAIL) of Shirshov Institute of Oceanology of the Russian Academy of Sciences. | ||
+ | In order to exploit the parallax phenomenon for closely located objects (clouds), we adjust the transformation of one of the images of each pair to compensate for inaccurate camera installation. To do this, we calculate the correction of the position and orientation of the cameras using distant objects, such as the Sun or the Moon. | ||
+ | To find the key points of clouds that are observed from cameras at different angles, the graph artificial neural network SuperGlue is used. With the help of SuperGlue, we identify key points and extract their geometric features, according to which we further match key points on synchronous images. Based on the location of key points on synchronous aligned images, we calculate the angle at which they are visible in two cameras, which allows us to estimate the distance from the camera installation base to the clouds. | ||
+ | As a result of this study, we developed a new algorithm for calculating CBH. We compared the results of CBH estimates with the values of the ERA-5 reanalysis for marine expeditions AI57, AI58, AI61 of the Shirshov Institute of Oceanology of the Russian Academy of Sciences. The closest consistency with the reanalysis data is observed for cirrus and cumulus clouds. | ||
+ | |||
+ | |||
+ | ==== 46. Classification Approach to Prediction of Geomagnetic Disturbances ==== | ||
+ | __I.Gadzhiev__ (1,2), \\ | ||
+ | I.Isaev (1), O.Barinov (1), S.Dolenko (1), I.Myagkova(1) \\ | ||
+ | (1) D.V.Skobeltsyn Institute of Nuclear Physics, M.V.Lomonosov Moscow State University, Moscow, Russia \\ | ||
+ | (2) Faculty of Physics, M.V.Lomonosov Moscow State University, Moscow, Russia | ||
+ | |||
+ | Magnetic storms can cause disruptions in the operation of radio communications, | ||
+ | Geomagnetic disturbances are usually described with the help of planetary index Kp, which is provided at 3-hours interval. The approach used in this study implies classifying geomagnetic disturbances according to the level of the Kp index. To do so, the whole range of the index values is divided into several intervals according to the degree of disturbance. | ||
+ | The input data are time series of parameters of solar wind and interplanetary magnetic field, measured onboard spacecraft at L1 Lagrange point between the Sun and the Earth, and of the value of Kp index itself. To account for the “memory” of the time series, delay embedding of all the parameters is used – for each of the parameters its several preceding values are taken into account. Additional pre-processing of the parameters is performed by calculating moving averages and other statistical indicators of the time series. | ||
+ | To perform classification, | ||
+ | It is demonstrated that the suggested approach outperforms the trivial inertial model for all values of prediction horizon from 3 to 24 hours (with 3 hours step). The most efficient pre-processing methods are described, as well as the best machine learning models. | ||
+ | |||
+ | The study has been conducted at the expense of the grant of the Russian Science Foundation | ||
+ | No. 23-21-00237, | ||
+ | |||
+ | ==== 50. CLIENT-SERVER APPLICATION FOR AUTOMATED ESTIMATION OF THE MATERIAL COMPOSITION OF BOTTOM SEDIMENTS IN THE >0.1 MM FRACTION FROM MICROPHOTOGRAPHY USING MODERN DEEP LEARNING METHODS ==== | ||
+ | |||
+ | __V.Golikov__(MIPT, | ||
+ | Krinitskiy M.A., Borisov D.G. | ||
+ | |||
+ | The information on the past climates deep-water environments is preserved in natural archives, such as marine sediments covering the sea-floor. The study of sediment composition in coarse fraction (variety of mineral and biogenic grains over 0.063 mm in size) is widely used to pry climate clues out of the sediment record. | ||
+ | At present, specialists use a binocular microscope to visually classify grains from a small portion of a sediment sample. This time-consuming technique requires the observer to possess geological expertise. In previous work, we proposed an algorithm for automatic unsupervised detection of particles and their clustering. In the current work, we present qualitative improvements to the algorithm, which now employs the state-of-the-art clustering method, SPICE. This method allowed us to eliminate over-clustering and limit the number of clusters to three, making the results more suitable for interpretation. We trained the algorithm and interpreted its results. The resulting model can be used as a classifier, enabling the calculation of particle distribution by clusters, analysis of grain-size distribution, | ||
+ | |||
+ | ==== 49. Transfer Learning for Neural Network Solution of an Inverse Problem in Optical Spectroscopy. ==== | ||
+ | __A.Guzkov__ (1), \\ | ||
+ | I.Isaev (2), S.Burikov (1), T.Dolenko (1), K.Laptinskiy (2), S.Dolenko (2) \\ | ||
+ | 1. Faculty of Physics Lomonosov Moscow State University \\ | ||
+ | 2. SINP MSU, Moscow \\ | ||
+ | |||
+ | This work is devoted to the inverse problems of optical spectroscopy, | ||
+ | Therefore, in this paper, it was proposed to use the transfer learning approach to improve the quality of the neural network solution. The neural networks were pre-trained on a basic dataset (about 3700 patterns) containing the spectra of solutions prepared with distilled water. Then fine tuning and testing of the networks were carried out on specific data sets (200 - 400 samples) containing the spectra of solutions prepared in river water taken from different rivers. | ||
+ | |||
+ | This study has been performed at the expense of the grant of the Russian Science Foundation no. 19-11-00333, | ||
+ | |||
+ | |||
+ | ==== 48. The study of the integration of physical methods in the neural network solution of the inverse problem of exploration geophysics with variable physical properties of the medium. ==== | ||
+ | |||
+ | __I.Isaev__ (1, 3), \\ | ||
+ | I.Obornev (SINP MSU, Moscow), E.Obornev (2), E.Rodionov (2), M.Shimelevich (2), S.Dolenko (1) \\ | ||
+ | 1. D.V.Skobeltsyn Institute of Nuclear Physics, M.V.Lomonosov Moscow State University, Moscow \\ | ||
+ | 2. Sergo Ordjonikidze Russian State University for Geological Prospecting, | ||
+ | 3. KIRE RAS | ||
+ | |||
+ | The subject area of this study is exploration geophysics, which requires solving specific inverse problems - reconstructing the spatial distribution of the medium properties in the thickness of the earth from the geophysical fields that are measured on its surface. We consider the inverse problems of gravimetry, magnetometry and magnetotelluric sounding, and their integration, | ||
+ | Possibility of the integration of various geophysical methods requires the determined parameters for each of the methods to be the same. This may be achieved by the spatial statement of the problem, in which the task is to determine the boundaries of geophysical objects. In our previous studies, we consider the parameterization scheme where the inverse problem was to determine the lower boundary of several geological layers. Each layer was characterized by variable values of the depth of the lower boundary along the section, and by fixed values of density, magnetization, | ||
+ | The present study is a continuation of work in this direction, and here we consider a parameterization scheme with variable properties of the medium along each layer, and variable properties over a data set. | ||
+ | |||
+ | This study has been performed at the expense of the grant of the Russian Science Foundation no. 19-11-00333, | ||
+ | |||
+ | References \\ | ||
+ | 1. I.V. Isaev, I.E. Obornev, E.A. Obornev, E.A. Rodionov, M.I. Shimelevich, | ||
+ | |||
+ | ==== 13. Estimating significant wave height from X-band navigation radar using convolutional neural networks ==== | ||
+ | __M.Krinitsky__ (Shirshov Institute of Oceanology, RAS, Moscow), \\ | ||
+ | V. Golikov, N.Anikin, A.Suslov, A.Gavrikov, N.Tilinina | ||
+ | |||
+ | Marine radars are vital for safe navigation at sea, detecting vessels and obstacles. Sea clutter, caused by Bragg scattering, is filtered out as noise, but becomes detectable on radar images when wind speed and wave height exceed certain thresholds. Wind-induced wave parameters can be determined using these images, but traditional spectral methods for obtaining wave characteristics face limitations and difficulties in improving accuracy. | ||
+ | Machine learning techniques offer advantages in image processing tasks, being more robust and able to handle noisier data. Convolutional neural networks (CNNs) have gained popularity for their efficiency in image recognition problems, extracting features automatically and exhibiting robustness to noise. CNNs have shown success in geosciences, | ||
+ | Recently, CNNs have been applied for real-time significant wave height estimation using ocean images from various sources. Techniques for calculating wave period and direction of radar sea clutter images have also been developed using CNNs. In recent studies, an approach was demonstrated highlighting the potential of integrating machine learning techniques like CNNs for improved wave measurement accuracy in marine radar applications. | ||
+ | In our study, we present the usage of convolutional neural networks (CNNs) for restoring wave characteristics from shipborne radar data and compares the results with those obtained from Spotter buoy. Using CNNs together with traditional spectral analysis methods helps to improve the accuracy of wave characteristics reconstruction and increase computational efficiency. | ||
+ | |||
+ | ==== 38. Machine learning techniques for anomaly detection in high-frequency time series of wind speed and greenhouse gas concentration measurements ==== | ||
+ | |||
+ | __A.Kasatkin__ (Shirshov Institute of Oceanology, RAS), \\ | ||
+ | M.Krinitsky | ||
+ | |||
+ | Fluxes of greenhouse gases (GHG) may be assessed in situ using eddy covariance method through processing high-frequency measurements of gas concentration and wind speed acquired at a certain sites, e.g., carbon measurements test areas of the pilot project of the Ministry of Education and Science of Russia. The measurements commonly come with noise, anomalies and gaps of various nature. These anomalies result in biased GHG flux estimates. There are a number of empirical and heuristic approaches for filtering the noise and anomalies, as well as for gap filling. These approaches are characterized by a lot of tuning parameters that are commonly adjusted by an expert, which is a limiting factor for large-scale deployment of GHG monitoring stations. | ||
+ | In this study, we propose an alternative approach for anomaly detection in high-frequency measurements of GHG concentration and wind speed. Our approach is based on machine learning techniques. This approach is characterized by lower number of tuning parameters. The goal of our study is to develop a fully-automated data preprocessing routine based on machine learning algorithms. | ||
+ | We collected the dataset of high-frequency GHG concentration and wind speed measurements from one of the carbon measurements test areas. In order to compare anomaly detection algorithms, we labeled anomalies in a subset of this dataset. We present two approaches for anomaly detection, namely (a) identification of outliers based on the error magnitude in time series statistical forecast performed by a machine learning (ML) algorithm; (b) classification of anomalies by a ML model trained on labeled dataset of outliers we mentioned above. We compare the approaches and algorithms based on F1-score metric assessed with respect to expert-labeled subset of anomalies in GHG concentration and wind speed time series. | ||
+ | Within the forecast-error based approach, we trained several ML models: ARIMA autoregression method, Catboost model for autoregression, | ||
+ | We demonstrate that ML models for forecasting deliver high quality of time series prediction within autoregression approach. We also show that the anomaly identification method based on autoregression approach employing additional features delivers the best quality with F1-score = 0.75. | ||
+ | |||
+ | ==== 42. Identifying cetacean mammals in high-resolution optical imagery using anomaly detection approach employing Machine Learning models ==== | ||
+ | |||
+ | __I.Khabutdinov__ (Shirshov Institute of Oceanology, RAS), \\ | ||
+ | M.Krinitskiy, | ||
+ | |||
+ | Cetacean mammal populations, | ||
+ | |||
+ | To address this issue, we propose the use of machine learning (ML) approaches, specifically anomaly detection using ML models. We examine a dataset collected during accounting marine expeditions of Shirshov Institute of Oceanology of Russian Academy of Sciences (IORAS) in the Black Sea from 2018 to 2019. The dataset consists of 3730 high-resolution photographs, | ||
+ | |||
+ | Our study compares classical and naive anomaly detection methods with reconstruction-based approaches that discriminate anomalies based on the magnitude of reconstruction errors. Within this latter approach, we utilize various artificial neural networks, such as Convolutional Autoencoders (CAE) and U-Net, for image reconstruction. Overall, our research aims to streamline the process of counting and monitoring dolphin populations in high-resolution imagery using advanced ML techniques. | ||
+ | |||
+ | |||
+ | |||
+ | ==== 41. Recognition of skin lesions by images ==== | ||
+ | |||
+ | __M.Ledovskikh__ (SPbSU, St.-Petersburg), | ||
+ | V. Gorikhovskii | ||
+ | |||
+ | The growing number of skin cancer patients has revealed the need for diagnostic expert systems that will help detect lesions with high accuracy. The identification of dangerous diseases associated with skin lesions, especially malignant neoplasms, requires the identification of pigmented skin lesions. Image recognition techniques and computer classification capabilities can improve the accuracy of skin cancer detection. In this work, a program was developed for segmentation of affected and unaffected skin areas. The resulting masks were used to train machine learning models. Based on the results, expert analytical systems were proposed that will help doctors diagnose skin diseases and distinguish between skin lesions both cancerous and non-cancerous. | ||
+ | |||
+ | ==== 44. Определение особых точек кривой накопления флуоресценции полимеразной цепной реакции методами машинного обучения без учителя ==== | ||
+ | |||
+ | __A.Orekhov__ (SPbSU, St-.-Petersburg), | ||
+ | М.А.Потехина | ||
+ | |||
+ | Метод полимеразной цепной реакции (ПЦР) – это циклический процесс, | ||
+ | |||
+ | На начальных циклах прибор регистрирует фоновый уровень флуоресценции, | ||
+ | |||
+ | Существует большое количество математических моделей, | ||
+ | |||
+ | Для решения этой задачи можно использовать методы машинного обучения без учителя. Если рассматривать амплификацию как квазидетерминированный дискретный случайный процесс, | ||
+ | |||
+ | |||
+ | ==== 10. Application of machine learning methods to numerical simulation of hypersonic flow ==== | ||
+ | |||
+ | __S.Pavlov__ (SPbSU, St.-Petersburg), | ||
+ | V.Istomin | ||
+ | |||
+ | The present study is devoted to applying of machine learning methods for numerical simulation of hypersonic five-component air-mixture flow past a sphere. Exact methods of transport coefficient calculation such as the Chapman-Enskog method are very computationally inefficient for numerical simulation. At the same time, approximate formulae for transport coefficients sometimes do not provide the necessary accuracy. Due to these facts, different machine learning methods for calculation of transport coefficients are considered. Comparison with experimental data is given. The results are analyzed in terms of calculation speed and computation accuracy. | ||
+ | |||
+ | ==== 24. A technique for the total ozone columns retrieval using spectral measurements of the IKFS-2 instrument. ==== | ||
+ | |||
+ | __A.Polyakov__ (SPbSU, St.-Petersburg), | ||
+ | Virolainen Y.A., Timofeev Yu.M., Nerobelov G.M., Kozlov D.A. | ||
+ | |||
+ | Atmospheric ozone plays an important role in the Earth' | ||
+ | We present a technique for estimating the total ozone columns (TOCs) from outgoing IR radiation spectra based on the artificial neural networks (ANNs). The technique was applied to the spectral measurements of the IKFS-2 instrument aboard the Russian meteorological satellite Meteor M N2 in 2015-2020, using data from the OMI instrument for the ANN training. A detailed validation of the TOC retrievals was carried out based on comparison with ground-based measurements presented by the WOUDC and Eubrewnet networks and satellite measurements of the OMI (Aura), TROPOMI (Sentinel 5p), and IASI (MetOp) instruments. The latitudinal-seasonal behavior of the differences in the results of different measurements was analyzed, which allowed to estimate the corresponding dependences of the measurement errors. TOCs distribution for the period 2015 to 2020 is shown. The average differences between the IKFS-2 data and independent TOC measurements are less than 2%, the standard deviations of the differences (SDDs) vary from 2 to 4%. At the same time, both the analysis of the ANN approximation errors of the OMI data and the comparison of the IKFS-2 results with independent data demonstrate an increase in discrepancies towards the poles. In the spring-winter period, SDDs reach up to 8% in the Southern and up to 6% in the Northern Hemispheres. | ||
+ | The technique developed made it possible to derive a global distribution of TOCs for the 2015-2020 period, regardless of solar illumination and presence of clouds. In addition, this technique can be used to process the spectral data of the IKFS-2 instruments on the Meteor M N2 series of satellites. | ||
+ | The study was carried out in the "Ozone Layer and Upper Atmosphere Research” laboratory of St. Petersburg State University (agreement with the Ministry of Education and Science of the Russian Federation No. 075-15-2021-583). | ||
+ | |||
+ | ==== 24. Повышение точности нейросетевой оценки значимой высоты ветрового волнения по данным судового навигационного радара за счет предварительного обучения на синтетических данных ==== | ||
+ | |||
+ | __V.Rezvov__ (Shirshov Institute of Oceanology RAS, Moscow), \\ | ||
+ | M.Krinitskiy, | ||
+ | |||
+ | Судовые навигационные радары (СНР) являются важным инструментом морской навигации. Обычно отраженный от поверхности воды сигнал, | ||
+ | |||
+ | ==== Decoding fluorescence excitation-emission matrices of carbon dots aqueous solutions with convolutional neural networks to create multimodal nanosensor of metal ions ==== | ||
+ | //Poster// | ||
+ | |||
+ | __O.Sarmanova__ (SINP MSU, Moscow), \\ | ||
+ | G.N.ChugreevaK.A.Laptinskiy, | ||
+ | |||
+ | In the study the problem of creating a multimodal fluorescent carbon dots-based nanosensor to simultaneously measure concentrations of Cu2+, Ni2+, Cr3+, NO3- ions in liquid media, as well as its pH index has been solved. The application of convolutional neural networks to analyze 1000 complex fluorescence excitation-emission matrices of carbon dots in the presence of the studied ions enabled their identification with mean absolute errors that satisfy the needs of monitoring the composition of waste and technological medium. As a result of further extension of the dataset representativeness with variational autoencoders, | ||
+ | |||
+ | This study has been conducted at the expense of the grant of the Russian Science Foundation № 22-12-00138, | ||
+ | |||
+ | ==== 34. SMAP sea surface salinity improvement in the Arctic region using machine learning approaches ==== | ||
+ | |||
+ | __A.Savin__ (MIPT, Moscow region, Shirshov Institute of Oceanology RAS, Moscow) \\ | ||
+ | M. Krinitskiy, A. Osadchiev | ||
+ | |||
+ | Sea salinity is one of the fundamental characteristics of physicochemical processes occurring in the ocean, and its consideration plays a substantial role in the description of the climate. Modern sea surface salinity (SSS) retrieval algorithms using satellite data have been developed and verified with high accuracy for the most typical regions of the World Ocean and show significantly lower quality when used in Arctic. In this study, the quality of standard algorithms is improved by using machine learning (ML) approaches. Different models of ML are examined: from classical ones that apply vector features used by the standard Soil Moisture Active Passive (SMAP) satellite salinity algorithms, to deep artificial neural networks that combine the mentioned vector features with two-dimensional fields taken from the ERA5 reanalysis. The models are validated using in-situ data collected during expeditions of the Shirshov Institute of Oceanology RAS in the Barents, Kara, Laptev, and East Siberian seas from 2015 to 2021. As a result of the study, the SMAP SSS standard product is improved in the examined region, and the most significant features for SSS retrieval are determined. The obtained results enable research of the Arctic region using improved sea surface salinity maps. | ||
+ | |||
+ | ==== 33. Determination of the charge of molecular fragments by machine learning methods ==== | ||
+ | |||
+ | __A.Shevchenko__, | ||
+ | A.Chuvakov | ||
+ | |||
+ | Machine learning is currently used to solve a variety of scientific and industrial problems, including the screening of new drugs, design of new materials, management of artificial intelligence, | ||
+ | Our report is devoted to a narrower, but no less important task for chemists and cosmochemists - to determine the charge of a molecular fragment according to its various representations. We selected data, one of the most important components of machine learning, from information on the structure of monoligand coordination compounds from the Cambridge Structural Database [1], which contains more than one million entities. The separation of molecular fragments from the structure was carried out using the ToposPro software package [2], which implements a wide range of geometric and topological algorithms for solving such problems. | ||
+ | As a result, we collected information on 18.3 thousand molecular fragments from 38.6 thousand crystal structures. Data markup, that is, adding information about the charge of fragments to the database, turned out to be the most time-consuming operation. We determined the charge values from the literature data and calculated them based on chemical considerations, | ||
+ | |||
+ | The study is supported by a grant from the Russian Science Foundation no. 23-23-00387 https:// | ||
+ | |||
+ | [1] Allen F.H. Acta Cryst. Sect. A 58 2002, 380. https:// | ||
+ | [2] Blatov V.A., et al. Cryst. Growth Des. 2014, 14, 7, 3576. | ||
+ | |||
+ | ==== 18. АВТОМАТИЧЕСКОЕ ДЕТЕКТИРОВАНИЕ АКУСТИЧЕСКИХ СИГНАЛОВ БЕЛУХ И АФАЛИН ==== | ||
+ | |||
+ | __А.Тышко__ (Shirshov Institute of Oceanology, RAS, Moscow), \\ | ||
+ | Шатравин А.В., Криницкий М.А. | ||
+ | |||
+ | Акустический мониторинг морских млекопитающих является одним из основных методов оценки численности популяций животных в заданной акватории и позволяет выявлять характерные паттерны миграций животных, | ||
+ | |||
+ | ==== 37. The role of artificial intelligence in the preparation of modern scientific and pedagogical staff. The experience of the course " | ||
+ | |||
+ | __A.Vasiliev__ (MSU, AI, Moscow), \\ | ||
+ | D. Vasina, S. Zapunidi, A. Ivchenko, | ||
+ | |||
+ | Today, there is a lot of talk about artificial intelligence and its great importance both in industry and in science. | ||
+ | |||
+ | The demand for specialists who know how to use the capabilities of intelligent systems in their work is invariably high both in scientific laboratories and in the research departments of commercial companies. | ||
+ | |||
+ | Therefore, one of the tasks of modern education is to train such specialists even within the walls of the university, so that while receiving basic education, the student could also simultaneously study work with artificial intelligence, | ||
+ | |||
+ | The report will be devoted to the experience of conducting a practical course on machine learning, which is held on the basis of the Faculty of Physics of Moscow State University. | ||
+ | |||
+ | The course " | ||
+ | |||
+ | Our course materials are in open source to support the easiest iteration, use and modification. | ||
+ | |||
+ | The authors of the course hope that the training of modern scientific and pedagogical personnel accumulated in this course will seem interesting to a professional audience and will significantly speed up the development of educational courses on machine learning in educational institutions interested in this. | ||
+ | |||
+ | |||
+ | ==== 31. Machine learning for diagnostics of space weather effects in the Arctic region ==== | ||
+ | |||
+ | __A.Vorobev__ (Geophysical Center RAS) | ||
+ | |||
+ | Despite the variety of existing approaches to monitoring space weather and geophysical parameters in the auroral oval region, the problem of effective prediction and diagnostics of auroras as a special state of the upper ionosphere at high latitudes remains practically open. And another significant problem here is also concerned with diagnostics of geomagnetically induced currents (GICs) in extended grounded technological systems, which are driven by telluric electric fields induced by the rapid changes of the geomagnetic field. | ||
+ | The paper is concerned with research and analyzes of possibility of local diagnostics of auroras presence based on the intellectual analysis of geomagnetic data from ground-based sources. The significance of the indicative variables and their statistical relationships are assessed. So, for example, the application of Bayesian inference to the data of the Lovozero geophysical station for 2012–2020 had showed that the dependence of aposteriori probability of observing auroras in the optical range on the state of geomagnetic parameters is logarithmic, | ||
+ | Also the paper is concerned with research on the approach to diagnostics of GIC in the power transmission lines in northwestern Russia based on data from IMAGE magnetometers. Based on the results of the statistical and correlation analysis of the objective function (the level of the GIC recorded at the Vykhodnoy transformer station) and geomagnetic data recorded by the nearby IMAGE magnetometers, | ||
+ | ==== 30. Search for Meteors in the Mini-EUSO Orbital Telescope Data with Neural Networks ==== | ||
+ | |||
+ | __M.Zotov__ (SINP MSU, Moscow) | ||
+ | |||
+ | Since autumn 2019, the Mini-EUSO (``UV Atmosfera'' | ||
+ | |||
+ | |||
+ | |||