arXiv EE and SS @arxiv_eess@qoto.org

Unsupervised Cardiac Video Translation Via Motion Feature Guided Diffusion Model

Unsupervised Cardiac Video Translation Via Motion Feature Guided Diffusion Model https://arxiv.org/abs/2507.02003 #eess.IV

This paper presents a novel motion feature guided diffusion model for unpaired video-to-video translation (MFD-V2V), designed to synthesize dynamic, high-contrast cine cardiac magnetic resonance (CMR) from lower-contrast, artifact-prone displacement encoding with stimulated echoes (DENSE) CMR sequences. To achieve this, we first introduce a Latent Temporal Multi-Attention (LTMA) registration network that effectively learns more accurate and consistent cardiac motions from cine CMR image videos. A multi-level motion feature guided diffusion model, equipped with a specialized Spatio-Temporal Motion Encoder (STME) to extract fine-grained motion conditioning, is then developed to improve synthesis quality and fidelity. We evaluate our method, MFD-V2V, on a comprehensive cardiac dataset, demonstrating superior performance over the state-of-the-art in both quantitative metrics and qualitative assessments. Furthermore, we show the benefits of our synthesized cine CMRs improving downstream clinical and analytical tasks, underscoring the broader impact of our approach. Our code is publicly available at https://github.com/SwaksharDeb/MFD-V2V.

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

Enhancing Power Flow Estimation with Topology-Aware Gated Graph Neural Networks

Enhancing Power Flow Estimation with Topology-Aware Gated Graph Neural Networks https://arxiv.org/abs/2507.02078 #eess.SY #cs.SY

Accurate and scalable surrogate models for AC power flow are essential for real-time grid monitoring, contingency analysis, and decision support in increasingly dynamic and inverter-dominated power systems. However, most existing surrogates fall short of practical deployment due to their limited capacity to capture long-range nonlinear dependencies in meshed transmission networks and their weak enforcement of physical laws. These models often require extensive hyperparameter tuning, exhibit poor generalization under topology changes or large load swings, and typically do not quantify uncertainty or scale well beyond a few hundred buses. To address these challenges, this paper proposes a \textit{gated graph neural network (GGNN)} surrogate for AC power-flow estimation under topological uncertainty. The model is trained across multiple IEEE benchmark networks of varying size and complexity, each incorporating randomized line contingencies and up to 40\% load variation. To improve robustness and generalization, we explore both conventional supervised learning and physics-informed self-supervised training strategies. Comparative evaluations show that the proposed GGNN consistently outperforms prior GNN-based surrogates, achieving predictions closely aligned with Newton--Raphson solutions. By embedding operational constraints directly into the architecture and loss function, the model ensures physical consistency and delivers a lightweight, accurate, and scalable tool for real-time grid operations.

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

A robust and adaptive MPC formulation for Gaussian process models

A robust and adaptive MPC formulation for Gaussian process models https://arxiv.org/abs/2507.02098 #eess.SY #math.OC #cs.LG #cs.SY

In this paper, we present a robust and adaptive model predictive control (MPC) framework for uncertain nonlinear systems affected by bounded disturbances and unmodeled nonlinearities. We use Gaussian Processes (GPs) to learn the uncertain dynamics based on noisy measurements, including those collected during system operation. As a key contribution, we derive robust predictions for GP models using contraction metrics, which are incorporated in the MPC formulation. The proposed design guarantees recursive feasibility, robust constraint satisfaction and convergence to a reference state, with high probability. We provide a numerical example of a planar quadrotor subject to difficult-to-model ground effects, which highlights significant improvements achieved through the proposed robust prediction method and through online learning.

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

Pronunciation Editing for Finnish Speech using Phonetic Posteriorgrams

Pronunciation Editing for Finnish Speech using Phonetic Posteriorgrams https://arxiv.org/abs/2507.02115 #eess.AS

Synthesizing second-language (L2) speech is potentially highly valued for L2 language learning experience and feedback. However, due to the lack of L2 speech synthesis datasets, it is difficult to synthesize L2 speech for low-resourced languages. In this paper, we provide a practical solution for editing native speech to approximate L2 speech and present PPG2Speech, a diffusion-based multispeaker Phonetic-Posteriorgrams-to-Speech model that is capable of editing a single phoneme without text alignment. We use Matcha-TTS's flow-matching decoder as the backbone, transforming Phonetic Posteriorgrams (PPGs) to mel-spectrograms conditioned on external speaker embeddings and pitch. PPG2Speech strengthens the Matcha-TTS's flow-matching decoder with Classifier-free Guidance (CFG) and Sway Sampling. We also propose a new task-specific objective evaluation metric, the Phonetic Aligned Consistency (PAC), between the edited PPGs and the PPGs extracted from the synthetic speech for editing effects. We validate the effectiveness of our method on Finnish, a low-resourced, nearly phonetic language, using approximately 60 hours of data. We conduct objective and subjective evaluations of our approach to compare its naturalness, speaker similarity, and editing effectiveness with TTS-based editing. Our source code is published at https://github.com/aalto-speech/PPG2Speech.

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

Optimality Loss Minimization in Distributed Control with Application to District Heating

Optimality Loss Minimization in Distributed Control with Application to District Heating https://arxiv.org/abs/2507.02144 #eess.SY #cs.SY

This paper presents a novel partitioning method designed to minimize control performance degradation resulting from partitioning a system for distributed control while maintaining the computational benefits of these methods. A game-theoretic performance metric, the modified Price of Anarchy, is introduced and is used in a generalizable partitioning metric to quantify optimality losses in a distributed controller. By finding the partition that minimizes the partitioning metric, the best-performing distributed control design is chosen. The presented partitioning metric is control-design agnostic, making it broadly applicable to many control design problems. In this paper, the developed metric is used to minimize the performance losses in the distributed control of a demand-flexible District Heating Network. The final distributed controller is provably feasible and stable. In simulation, this novel partitioning performed similarly to the centralized controller, increasing overall heat losses by only 1.9%, as compared to a similarly-sized baseline partition, which resulted in a 22% increase in losses.

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

An Investigation on Combining Geometry and Consistency Constraints into Phase Estimation for Speech Enhancement

An Investigation on Combining Geometry and Consistency Constraints into Phase Estimation for Speech Enhancement https://arxiv.org/abs/2507.02192 #eess.AS

We propose a novel iterative phase estimation framework, termed multi-source Griffin-Lim algorithm (MSGLA), for speech enhancement (SE) under additive noise conditions. The core idea is to leverage the ad-hoc consistency constraint of complex-valued short-time Fourier transform (STFT) spectrograms to address the sign ambiguity challenge commonly encountered in geometry-based phase estimation. Furthermore, we introduce a variant of the geometric constraint framework based on the law of sines and cosines, formulating a new phase reconstruction algorithm using noise phase estimates. We first validate the proposed technique through a series of oracle experiments, demonstrating its effectiveness under ideal conditions. We then evaluate its performance on the VB-DMD and WSJ0-CHiME3 data sets, and show that the proposed MSGLA variants match well or slightly outperform existing algorithms, including direct phase estimation and DNN-based sign prediction, especially in terms of background noise suppression.

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

Beyond Interval MDPs: Tight and Efficient Abstractions of Stochastic Systems

Beyond Interval MDPs: Tight and Efficient Abstractions of Stochastic Systems https://arxiv.org/abs/2507.02213 #eess.SY #cs.SY

This work addresses the general problem of control synthesis for continuous-space, discrete-time stochastic systems with probabilistic guarantees via finite abstractions. While established methods exist, they often trade off accuracy for tractability. We propose a unified abstraction framework that improves both the tightness of probabilistic guarantees and computational efficiency. First, we introduce multi-interval MDPs (MI-MDPs), a generalization of interval-valued MDPs (IMDPs), which allows multiple, possibly overlapping clusters of successor states. This results in tighter abstractions but with increased computational complexity. To mitigate this, we further propose a generalized form of MDPs with set-valued transition probabilities (SMDPs), which model transitions as a fixed probability to a state cluster, followed by a non-deterministic choice within the cluster, as a sound abstraction. We show that control synthesis for MI-MDPs reduces to robust dynamic programming via linear optimization, while SMDPs admit even more efficient synthesis algorithms that avoid linear programming altogether. Theoretically, we prove that, given the partitioning of the state and disturbance spaces, both MI-MDPs and SMDPs yield tighter probabilistic guarantees than IMDPs, and that SMDPs are tighter than MI-MDPs. Extensive experiments across several benchmarks validate our theoretical results and demonstrate that SMDPs achieve favorable trade-offs among tightness, memory usage, and computation time.

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

Derivative-Free Optimization-Empowered Wireless Channel Reconfiguration for 6G

Derivative-Free Optimization-Empowered Wireless Channel Reconfiguration for 6G https://arxiv.org/abs/2507.02243 #eess.SP

Reconfigurable antennas, including reconfigurable intelligent surface (RIS), movable antenna (MA), fluid antenna (FA), and other advanced antenna techniques, have been studied extensively in the context of reshaping wireless propagation environments for 6G and beyond wireless communications. Nevertheless, how to reconfigure/optimize the real-time controllable coefficients to achieve a favorable end-to-end wireless channel remains a substantial challenge, as it usually requires accurate modeling of the complex interaction between the reconfigurable devices and the electromagnetic waves, as well as knowledge of implicit channel propagation parameters. In this paper, we introduce a derivative-free optimization (a.k.a., zeroth-order (ZO) optimization) technique to directly optimize reconfigurable coefficients to shape the wireless end-to-end channel, without the need of channel modeling and estimation of the implicit environmental propagation parameters. We present the fundamental principles of ZO optimization and discuss its potential advantages in wireless channel reconfiguration. Two case studies for RIS and movable antenna-enabled single-input single-output (SISO) systems are provided to show the superiority of ZO-based methods as compared to state-of-the-art techniques. Finally, we outline promising future research directions and offer concluding insights on derivative-free optimization for reconfigurable antenna technologies.

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

Localized kernel method for separation of linear chirps

Localized kernel method for separation of linear chirps https://arxiv.org/abs/2507.02262 #eess.SP

The task of separating a superposition of signals into its individual components is a common challenge encountered in various signal processing applications, especially in domains such as audio and radar signals. A previous paper by Chui and Mhaskar proposes a method called Signal Separation Operator (SSO) to find the instantaneous frequencies and amplitudes of such superpositions where both of these change continuously and slowly over time. In this paper, we amplify and modify this method in order to separate chirp signals in the presence of crossovers, a very low SNR, and discontinuities. We give a theoretical analysis of the behavior of SSO in the presence of noise to examine the relationship between the minimal separation, minimal amplitude, SNR, and sampling frequency. Our method is illustrated with a few examples, and numerical results are reported on a simulated dataset comprising 7 simulated signals.

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 2 days ago

CineMyoPS: Segmenting Myocardial Pathologies from Cine Cardiac MR

CineMyoPS: Segmenting Myocardial Pathologies from Cine Cardiac MR https://arxiv.org/abs/2507.02289 #eess.IV #cs.CV

Myocardial infarction (MI) is a leading cause of death worldwide. Late gadolinium enhancement (LGE) and T2-weighted cardiac magnetic resonance (CMR) imaging can respectively identify scarring and edema areas, both of which are essential for MI risk stratification and prognosis assessment. Although combining complementary information from multi-sequence CMR is useful, acquiring these sequences can be time-consuming and prohibitive, e.g., due to the administration of contrast agents. Cine CMR is a rapid and contrast-free imaging technique that can visualize both motion and structural abnormalities of the myocardium induced by acute MI. Therefore, we present a new end-to-end deep neural network, referred to as CineMyoPS, to segment myocardial pathologies, \ie scars and edema, solely from cine CMR images. Specifically, CineMyoPS extracts both motion and anatomy features associated with MI. Given the interdependence between these features, we design a consistency loss (resembling the co-training strategy) to facilitate their joint learning. Furthermore, we propose a time-series aggregation strategy to integrate MI-related features across the cardiac cycle, thereby enhancing segmentation accuracy for myocardial pathologies. Experimental results on a multi-center dataset demonstrate that CineMyoPS achieves promising performance in myocardial pathology segmentation, motion estimation, and anatomy segmentation.

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

Scalable Offline ASR for Command-Style Dictation in Courtrooms

Scalable Offline ASR for Command-Style Dictation in Courtrooms https://arxiv.org/abs/2507.01021 #eess.AS #cs.CL #cs.SD

We propose an open-source framework for Command-style dictation that addresses the gap between resource-intensive Online systems and high-latency Batch processing. Our approach uses Voice Activity Detection (VAD) to segment audio and transcribes these segments in parallel using Whisper models, enabling efficient multiplexing across audios. Unlike proprietary systems like SuperWhisper, this framework is also compatible with most ASR architectures, including widely used CTC-based models. Our multiplexing technique maximizes compute utilization in real-world settings, as demonstrated by its deployment in around 15% of India's courtrooms. Evaluations on live data show consistent latency reduction as user concurrency increases, compared to sequential batch processing. The live demonstration will showcase our open-sourced implementation and allow attendees to interact with it in real-time.

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

Workflow-Based Evaluation of Music Generation Systems

Workflow-Based Evaluation of Music Generation Systems https://arxiv.org/abs/2507.01022 #eess.AS #cs.HC #cs.LG #cs.MM #cs.SD

This study presents an exploratory evaluation of Music Generation Systems (MGS) within contemporary music production workflows by examining eight open-source systems. The evaluation framework combines technical insights with practical experimentation through criteria specifically designed to investigate the practical and creative affordances of the systems within the iterative, non-linear nature of music production. Employing a single-evaluator methodology as a preliminary phase, this research adopts a mixed approach utilizing qualitative methods to form hypotheses subsequently assessed through quantitative metrics. The selected systems represent architectural diversity across both symbolic and audio-based music generation approaches, spanning composition, arrangement, and sound design tasks. The investigation addresses limitations of current MGS in music production, challenges and opportunities for workflow integration, and development potential as collaborative tools while maintaining artistic authenticity. Findings reveal these systems function primarily as complementary tools enhancing rather than replacing human expertise. They exhibit limitations in maintaining thematic and structural coherence that emphasize the indispensable role of human creativity in tasks demanding emotional depth and complex decision-making. This study contributes a structured evaluation framework that considers the iterative nature of music creation. It identifies methodological refinements necessary for subsequent comprehensive evaluations and determines viable areas for AI integration as collaborative tools in creative workflows. The research provides empirically-grounded insights to guide future development in the field.

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

Hello Afrika: Speech Commands in Kinyarwanda

Hello Afrika: Speech Commands in Kinyarwanda https://arxiv.org/abs/2507.01024 #eess.AS #cs.AI #cs.SD

Voice or Speech Commands are a subset of the broader Spoken Word Corpus of a language which are essential for non-contact control of and activation of larger AI systems in devices used in everyday life especially for persons with disabilities. Currently, there is a dearth of speech command models for African languages. The Hello Afrika project aims to address this issue and its first iteration is focused on the Kinyarwanda language since the country has shown interest in developing speech recognition technologies culminating in one of the largest datasets on Mozilla Common Voice. The model was built off a custom speech command corpus made up of general directives, numbers, and a wake word. The final model was deployed on multiple devices (PC, Mobile Phone and Edge Devices) and the performance was assessed using suitable metrics.

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

Prompt Mechanisms in Medical Imaging: A Comprehensive Survey

Prompt Mechanisms in Medical Imaging: A Comprehensive Survey https://arxiv.org/abs/2507.01055 #eess.IV #cs.AI #cs.CV

Deep learning offers transformative potential in medical imaging, yet its clinical adoption is frequently hampered by challenges such as data scarcity, distribution shifts, and the need for robust task generalization. Prompt-based methodologies have emerged as a pivotal strategy to guide deep learning models, providing flexible, domain-specific adaptations that significantly enhance model performance and adaptability without extensive retraining. This systematic review critically examines the burgeoning landscape of prompt engineering in medical imaging. We dissect diverse prompt modalities, including textual instructions, visual prompts, and learnable embeddings, and analyze their integration for core tasks such as image generation, segmentation, and classification. Our synthesis reveals how these mechanisms improve task-specific outcomes by enhancing accuracy, robustness, and data efficiency and reducing reliance on manual feature engineering while fostering greater model interpretability by making the model's guidance explicit. Despite substantial advancements, we identify persistent challenges, particularly in prompt design optimization, data heterogeneity, and ensuring scalability for clinical deployment. Finally, this review outlines promising future trajectories, including advanced multimodal prompting and robust clinical integration, underscoring the critical role of prompt-driven AI in accelerating the revolution of diagnostics and personalized treatment planning in medicine.

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

MID-INFRARED (MIR) OCT-based inspection in industry

MID-INFRARED (MIR) OCT-based inspection in industry https://arxiv.org/abs/2507.01074 #eess.IV #cs.CV

This paper aims to evaluate mid-infrared (MIR) Optical Coherence Tomography (OCT) systems as a tool to penetrate different materials and detect sub-surface irregularities. This is useful for monitoring production processes, allowing Non-Destructive Inspection Techniques of great value to the industry. In this exploratory study, several acquisitions are made on composite and ceramics to know the capabilities of the system. In addition, it is assessed which preprocessing and AI-enhanced vision algorithms can be anomaly-detection methodologies capable of detecting abnormal zones in the analyzed objects. Limitations and criteria for the selection of optimal parameters will be discussed, as well as strengths and weaknesses will be highlighted.

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

Imitation Learning for Satellite Attitude Control under Unknown Perturbations

Imitation Learning for Satellite Attitude Control under Unknown Perturbations https://arxiv.org/abs/2507.01161 #eess.SY #cs.RO #cs.SY

This paper presents a novel satellite attitude control framework that integrates Soft Actor-Critic (SAC) reinforcement learning with Generative Adversarial Imitation Learning (GAIL) to achieve robust performance under various unknown perturbations. Traditional control techniques often rely on precise system models and are sensitive to parameter uncertainties and external perturbations. To overcome these limitations, we first develop a SAC-based expert controller that demonstrates improved resilience against actuator failures, sensor noise, and attitude misalignments, outperforming our previous results in several challenging scenarios. We then use GAIL to train a learner policy that imitates the expert's trajectories, thereby reducing training costs and improving generalization through expert demonstrations. Preliminary experiments under single and combined perturbations show that the SAC expert can rotate the antenna to a specified direction and keep the antenna orientation reliably stable in most of the listed perturbations. Additionally, the GAIL learner can imitate most of the features from the trajectories generated by the SAC expert. Comparative evaluations and ablation studies confirm the effectiveness of the SAC algorithm and reward shaping. The integration of GAIL further reduces sample complexity and demonstrates promising imitation capabilities, paving the way for more intelligent and autonomous spacecraft control systems.

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

Classical Guitar Duet Separation using GuitarDuets -- a Dataset of Real and Synthesized Guitar Recordings

Classical Guitar Duet Separation using GuitarDuets -- a Dataset of Real and Synthesized Guitar Recordings https://arxiv.org/abs/2507.01172 #eess.AS

Recent advancements in music source separation (MSS) have focused in the multi-timbral case, with existing architectures tailored for the separation of distinct instruments, overlooking thus the challenge of separating instruments with similar timbral characteristics. Addressing this gap, our work focuses on monotimbral MSS, specifically within the context of classical guitar duets. To this end, we introduce the GuitarDuets dataset, featuring a combined total of approximately three hours of real and synthesized classical guitar duet recordings, as well as note-level annotations of the synthesized duets. We perform an extensive cross-dataset evaluation by adapting Demucs, a state-of-the-art MSS architecture, to monotimbral source separation. Furthermore, we develop a joint permutation-invariant transcription and separation framework, to exploit note event predictions as auxiliary information. Our results indicate that utilizing both the real and synthesized subsets of GuitarDuets leads to improved separation performance in an independently recorded test set compared to utilizing solely one subset. We also find that while the availability of ground-truth note labels greatly helps the performance of the separation network, the predicted note estimates result only in marginal improvement. Finally, we discuss the behavior of commonly utilized metrics, such as SDR and SI-SDR, in the context of monotimbral MSS.

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

An Adaptive Estimation Approach based on Fisher Information to Overcome the Challenges of LFP Battery SOC Estimation

An Adaptive Estimation Approach based on Fisher Information to Overcome the Challenges of LFP Battery SOC Estimation https://arxiv.org/abs/2507.01173 #eess.SY #cs.SY

Robust and Real-time State of Charge (SOC) estimation is essential for Lithium Iron Phosphate (LFP) batteries, which are widely used in electric vehicles (EVs) and energy storage systems due to safety and longevity. However, the flat Open Circuit Voltage (OCV)-SOC curve makes this task particularly challenging. This challenge is complicated by hysteresis effects, and real-world conditions such as current bias, voltage quantization errors, and temperature that must be considered in the battery management system use. In this paper, we proposed an adaptive estimation approach to overcome the challenges of LFPSOC estimation. Specifically, the method uses an adaptive fisher information fusion strategy that adaptively combines the SOC estimation from two different models, which are Coulomb counting and equivalent circuit model-based parameter identification. The effectiveness of this strategy is rationalized by the information richness excited by external cycling signals. A 3D OCV-H-SOC map that captures the relationship between OCV, hysteresis, and SOC was proposed as the backbone, and can be generalizable to other widely adopted parameter-identification methods. Extensive validation under ideal and real-world use scenarios, including SOC-OCV flat zones, current bias, voltage quantization errors, low temperatures, and insufficient current excitations, have been performed using 4 driving profiles, i.e., the Orange County Transit Bus Cycle, the California Unified Cycle, the US06 Drive Cycle, and the New York City Cycle, where the results demonstrate superiority over the state-of-the-art unscented Kalman filter, long short-term memory networks and transformer in all validation cases.

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

A Spectral-Based Tuning Criterion for PI Controllers in IPDT Systems With Unified Tracking and Disturbance Rejection Performance

A Spectral-Based Tuning Criterion for PI Controllers in IPDT Systems With Unified Tracking and Disturbance Rejection Performance https://arxiv.org/abs/2507.01197 #eess.SY #cs.SY

This paper proposes a spectral-based tuning method for proportional-integral (PI) controllers in integrating-plus-dead-time (IPDT) systems. The design objective is to achieve unified exponential decay for both reference tracking and disturbance rejection by minimizing the spectral abscissa of the closed-loop system. A second-order semi-discrete model accurately captures the integrator and delay dynamics while enabling efficient dominant pole extraction. These discrete-time poles are mapped to continuous time and refined using Newton-Raphson iterations on the exact transcendental characteristic equation. The method produces a unique PI gain set without requiring heuristic trade-offs or weighting parameters. Comparative simulations demonstrate that the proposed tuning achieves faster convergence and improved robustness margins compared to classical rules (Ziegler-Nichols, SIMC) and integral performance criteria (IAE, ITAE). The approach provides a transparent and computationally efficient framework for PI control in delay-dominant systems.

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

**arXiv EE and SS** @arxiv_eess@qoto.org · 3 days ago

LotteryCodec: Searching the Implicit Representation in a Random Network for Low-Complexity Image Compression

LotteryCodec: Searching the Implicit Representation in a Random Network for Low-Complexity Image Compression https://arxiv.org/abs/2507.01204 #eess.IV #math.IT #cs.IT

We introduce and validate the lottery codec hypothesis, which states that untrained subnetworks within randomly initialized networks can serve as synthesis networks for overfitted image compression, achieving rate-distortion (RD) performance comparable to trained networks. This hypothesis leads to a new paradigm for image compression by encoding image statistics into the network substructure. Building on this hypothesis, we propose LotteryCodec, which overfits a binary mask to an individual image, leveraging an over-parameterized and randomly initialized network shared by the encoder and the decoder. To address over-parameterization challenges and streamline subnetwork search, we develop a rewind modulation mechanism that improves the RD performance. LotteryCodec outperforms VTM and sets a new state-of-the-art in single-image compression. LotteryCodec also enables adaptive decoding complexity through adjustable mask ratios, offering flexible compression solutions for diverse device constraints and application requirements.