Accurate indoor localization using radio signals exchanged between base stations (BSs) and mobile agents (such as mobile phones and other radio-equipped devices) remains a challenging issue. This capability is essential for various critical applications, including search-and-rescue operations and autonomous navigation.
Simultaneous Localization and Mapping (SLAM) involves constructing a feature map of an environment while simultaneously estimating the location of an agent within that map. A promising approach in this domain is multipath-based SLAM, which leverages the specular reflections of radio signals from flat surfaces to estimate the positions of both a mobile agent and the environmental features (e.g., reflecting surfaces). Current model-based techniques for multipath-based SLAM employ high-dimensional Bayesian estimation methods using factor graphs [1]. However, these techniques rely on parametric models, making them vulnerable to performance degradation when the assumptions about feature models or statistical characteristics do not align with actual data. To address this limitation, the parametric structure of the factor graph can be augmented with neural networks, enabling the model to adapt to data-driven insights and correct for model mismatches [2], [3].