arXiv EE and SS @arxiv_eess@qoto.org

Enhancing Brain Tumor Classification Using TrAdaBoost and Multi-Classifier Deep Learning Approaches

Enhancing Brain Tumor Classification Using TrAdaBoost and Multi-Classifier Deep Learning Approaches https://arxiv.org/abs/2411.00875 #eess.IV #cs.AI #cs.CV

Brain tumors pose a serious health threat due to their rapid growth and potential for metastasis. While medical imaging has advanced significantly, accurately identifying and characterizing these tumors remains a challenge. This study addresses this challenge by leveraging the innovative TrAdaBoost methodology to enhance the Brain Tumor Segmentation (BraTS2020) dataset, aiming to improve the efficiency and accuracy of brain tumor classification. Our approach combines state-of-the-art deep learning algorithms, including the Vision Transformer (ViT), Capsule Neural Network (CapsNet), and convolutional neural networks (CNNs) such as ResNet-152 and VGG16. By integrating these models within a multi-classifier framework, we harness the strengths of each approach to achieve more robust and reliable tumor classification. A novel decision template is employed to synergistically combine outputs from different algorithms, further enhancing classification accuracy. To augment the training process, we incorporate a secondary dataset, "Brain Tumor MRI Dataset," as a source domain, providing additional data for model training and improving generalization capabilities. Our findings demonstrate a high accuracy rate in classifying tumor versus non-tumor images, signifying the effectiveness of our approach in the medical imaging domain. This study highlights the potential of advanced machine learning techniques to contribute significantly to the early and accurate diagnosis of brain tumors, ultimately improving patient outcomes.

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

Topology-Aware Graph Augmentation for Predicting Clinical Trajectories in Neurocognitive Disorders

Topology-Aware Graph Augmentation for Predicting Clinical Trajectories in Neurocognitive Disorders https://arxiv.org/abs/2411.00888 #q-bio.NC #eess.IV #cs.CV #cs.LG

Brain networks/graphs derived from resting-state functional MRI (fMRI) help study underlying pathophysiology of neurocognitive disorders by measuring neuronal activities in the brain. Some studies utilize learning-based methods for brain network analysis, but typically suffer from low model generalizability caused by scarce labeled fMRI data. As a notable self-supervised strategy, graph contrastive learning helps leverage auxiliary unlabeled data. But existing methods generally arbitrarily perturb graph nodes/edges to generate augmented graphs, without considering essential topology information of brain networks. To this end, we propose a topology-aware graph augmentation (TGA) framework, comprising a pretext model to train a generalizable encoder on large-scale unlabeled fMRI cohorts and a task-specific model to perform downstream tasks on a small target dataset. In the pretext model, we design two novel topology-aware graph augmentation strategies: (1) hub-preserving node dropping that prioritizes preserving brain hub regions according to node importance, and (2) weight-dependent edge removing that focuses on keeping important functional connectivities based on edge weights. Experiments on 1, 688 fMRI scans suggest that TGA outperforms several state-of-the-art methods.

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

Deep Learning Predicts Mammographic Breast Density in Clinical Breast Ultrasound Images

Deep Learning Predicts Mammographic Breast Density in Clinical Breast Ultrasound Images https://arxiv.org/abs/2411.00891 #eess.IV #cs.AI #cs.CV

Background: Mammographic breast density, as defined by the American College of Radiology's Breast Imaging Reporting and Data System (BI-RADS), is one of the strongest risk factors for breast cancer, but is derived from mammographic images. Breast ultrasound (BUS) is an alternative breast cancer screening modality, particularly useful for early detection in low-resource, rural contexts. The purpose of this study was to explore an artificial intelligence (AI) model to predict BI-RADS mammographic breast density category from clinical, handheld BUS imaging. Methods: All data are sourced from the Hawaii and Pacific Islands Mammography Registry. We compared deep learning methods from BUS imaging, as well as machine learning models from image statistics alone. The use of AI-derived BUS density as a risk factor for breast cancer was then compared to clinical BI-RADS breast density while adjusting for age. The BUS data were split by individual into 70/20/10% groups for training, validation, and testing. Results: 405,120 clinical BUS images from 14.066 women were selected for inclusion in this study, resulting in 9.846 women for training (302,574 images), 2,813 for validation (11,223 images), and 1,406 for testing (4,042 images). On the held-out testing set, the strongest AI model achieves AUROC 0.854 predicting BI-RADS mammographic breast density from BUS imaging and outperforms all shallow machine learning methods based on image statistics. In cancer risk prediction, age-adjusted AI BUS breast density predicted 5-year breast cancer risk with 0.633 AUROC, as compared to 0.637 AUROC from age-adjusted clinical breast density. Conclusions: BI-RADS mammographic breast density can be estimated from BUS imaging with high accuracy using a deep learning model. Furthermore, we demonstrate that AI-derived BUS breast density is predictive of 5-year breast cancer risk in our population.

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

Blind Time-of-Flight Imaging: Sparse Deconvolution on the Continuum with Unknown Kernels

Blind Time-of-Flight Imaging: Sparse Deconvolution on the Continuum with Unknown Kernels https://arxiv.org/abs/2411.00893 #eess.IV #eess.SP #math.IT #cs.CV #cs.IT

In recent years, computational Time-of-Flight (ToF) imaging has emerged as an exciting and a novel imaging modality that offers new and powerful interpretations of natural scenes, with applications extending to 3D, light-in-flight, and non-line-of-sight imaging. Mathematically, ToF imaging relies on algorithmic super-resolution, as the back-scattered sparse light echoes lie on a finer time resolution than what digital devices can capture. Traditional methods necessitate knowledge of the emitted light pulses or kernels and employ sparse deconvolution to recover scenes. Unlike previous approaches, this paper introduces a novel, blind ToF imaging technique that does not require kernel calibration and recovers sparse spikes on a continuum, rather than a discrete grid. By studying the shared characteristics of various ToF modalities, we capitalize on the fact that most physical pulses approximately satisfy the Strang-Fix conditions from approximation theory. This leads to a new mathematical formulation for sparse super-resolution. Our recovery approach uses an optimization method that is pivoted on an alternating minimization strategy. We benchmark our blind ToF method against traditional kernel calibration methods, which serve as the baseline. Extensive hardware experiments across different ToF modalities demonstrate the algorithmic advantages, flexibility and empirical robustness of our approach. We show that our work facilitates super-resolution in scenarios where distinguishing between closely spaced objects is challenging, while maintaining performance comparable to known kernel situations. Examples of light-in-flight imaging and light-sweep videos highlight the practical benefits of our blind super-resolution method in enhancing the understanding of natural scenes.

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

Multiscale texture separation https://arxiv.org/abs/2411.00894 #eess.IV #math.FA #cs.CV

Multiscale texture separation

In this paper, we investigate theoretically the behavior of Meyer's image cartoon + texture decomposition model. Our main results is a new theorem which shows that, by combining the decomposition model and a well chosen Littlewood-Paley filter, it is possible to extract almost perfectly a certain class of textures. This theorem leads us to the construction of a parameterless multiscale texture separation algorithm. Finally, we propose to extend this algorithm into a directional multiscale texture separation algorithm by designing a directional Littlewood-Paley filter bank. Several experiments show the efficiency of the proposed method both on synthetic and real images.

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

Intensity Field Decomposition for Tissue-Guided Neural Tomography

Intensity Field Decomposition for Tissue-Guided Neural Tomography https://arxiv.org/abs/2411.00900 #eess.IV #cs.CV

Cone-beam computed tomography (CBCT) typically requires hundreds of X-ray projections, which raises concerns about radiation exposure. While sparse-view reconstruction reduces the exposure by using fewer projections, it struggles to achieve satisfactory image quality. To address this challenge, this article introduces a novel sparse-view CBCT reconstruction method, which empowers the neural field with human tissue regularization. Our approach, termed tissue-guided neural tomography (TNT), is motivated by the distinct intensity differences between bone and soft tissue in CBCT. Intuitively, separating these components may aid the learning process of the neural field. More precisely, TNT comprises a heterogeneous quadruple network and the corresponding training strategy. The network represents the intensity field as a combination of soft and hard tissue components, along with their respective textures. We train the network with guidance from estimated tissue projections, enabling efficient learning of the desired patterns for the network heads. Extensive experiments demonstrate that the proposed method significantly improves the sparse-view CBCT reconstruction with a limited number of projections ranging from 10 to 60. Our method achieves comparable reconstruction quality with fewer projections and faster convergence compared to state-of-the-art neural rendering based methods.

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 6, 2024

Zero-Shot Self-Consistency Learning for Seismic Irregular Spatial Sampling Reconstruction

Zero-Shot Self-Consistency Learning for Seismic Irregular Spatial Sampling Reconstruction https://arxiv.org/abs/2411.00911 #physics.geo-ph #eess.IV #cs.CV #cs.LG

Seismic exploration is currently the most important method for understanding subsurface structures. However, due to surface conditions, seismic receivers may not be uniformly distributed along the measurement line, making the entire exploration work difficult to carry out. Previous deep learning methods for reconstructing seismic data often relied on additional datasets for training. While some existing methods do not require extra data, they lack constraints on the reconstruction data, leading to unstable reconstruction performance. In this paper, we proposed a zero-shot self-consistency learning strategy and employed an extremely lightweight network for seismic data reconstruction. Our method does not require additional datasets and utilizes the correlations among different parts of the data to design a self-consistency learning loss function, driving a network with only 90,609 learnable parameters. We applied this method to experiments on the USGS National Petroleum Reserve-Alaska public dataset and the results indicate that our proposed approach achieved good reconstruction results. Additionally, our method also demonstrates a certain degree of noise suppression, which is highly beneficial for large and complex seismic exploration tasks.

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models https://arxiv.org/abs/2411.00023 #eess.AS #cs.AI #cs.CL #cs.SD

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

Enhancing Brain Source Reconstruction through Physics-Informed 3D Neural Networks https://arxiv.org/abs/2411.00143 #eess.IV #cs.LG

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

CRB Optimization using a Parametric Scattering Model for Extended Targets in ISAC Systems https://arxiv.org/abs/2411.00145 #eess.SP

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

Optimizing Energy Management and Sizing of Photovoltaic Batteries for a Household in Granada, Spain: A Novel Approach Considering Time Resolution https://arxiv.org/abs/2411.00159 #eess.SY #cs.SY

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

A Novel Acoustic Wearable for Assessment of Tendon Health and Loading Condition https://arxiv.org/abs/2411.00184 #q-bio.TO #eess.SP

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

Learning Optimal Interaction Weights in Multi-Agents Systems https://arxiv.org/abs/2411.00223 #eess.SY #cs.SY

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

A New Switched Reluctance Motor with Embedded Permanent Magnets for Transportation Electrification https://arxiv.org/abs/2411.00224 #eess.SY #cs.SY

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

A Novel Breast Ultrasound Image Augmentation Method Using Advanced Neural Style Transfer: An Efficient and Explainable Approach https://arxiv.org/abs/2411.00254 #eess.IV #cs.CV #cs.LG

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

Parameter Estimation on Homogeneous Spaces https://arxiv.org/abs/2411.00258 #eess.SP

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 5, 2024

Cyclic Reformulation Based System Identification for Periodically Time-varying Systems

Cyclic Reformulation Based System Identification for Periodically Time-varying Systems https://arxiv.org/abs/2411.00318 #eess.SY #cs.SY

This paper addresses a system identification for linear periodically time-varying plants in the discrete-time setting. A system identification algorithm for linear, periodically time-varying plants is introduced based on a cyclic reformulation and a state coordinate transformation of the cycled system. By using our system identification algorithm, the high-accuracy model of the periodically time-varying plant can be obtained without using specific periodic input signals. The effectiveness of the proposed algorithm is demonstrated with numerical examples.

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 2, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 2, 2024

Nov 2, 2024

DDMD: AI-Powered Digital Drug Music Detector

DDMD: AI-Powered Digital Drug Music Detector https://arxiv.org/abs/2410.23293 #eess.AS #cs.SD

We present the first version of DDMD (Digital Drug Music Detector), a binary classifier that distinguishes digital drug music from normal music. In the literature, digital drug music is primarily explored regarding its psychological, neurological, or social impact. However, despite numerous studies on using machine learning in Music Information Retrieval (MIR), including music genre classification, digital drug music has not been considered in this field. In this study, we initially collected a dataset of 3,176 audio files divided into two classes (1,676 digital drugs and 1,500 non-digital drugs). We extracted machine learning features, including MFCCs, chroma, spectral contrast, and frequency analysis metrics (mean and standard deviation of detected frequencies). Using a Random Forest classifier, we achieved an accuracy of 93%. Finally, we developed a web application to deploy the model, enabling end users to detect digital drug music.

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 2, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 2, 2024

Nov 2, 2024

Improved Patch Denoising Diffusion Probabilistic Models for Magnetic Resonance Fingerprinting

Improved Patch Denoising Diffusion Probabilistic Models for Magnetic Resonance Fingerprinting https://arxiv.org/abs/2410.23318 #eess.IV #cs.CV #cs.LG

Magnetic Resonance Fingerprinting (MRF) is a time-efficient approach to quantitative MRI, enabling the mapping of multiple tissue properties from a single, accelerated scan. However, achieving accurate reconstructions remains challenging, particularly in highly accelerated and undersampled acquisitions, which are crucial for reducing scan times. While deep learning techniques have advanced image reconstruction, the recent introduction of diffusion models offers new possibilities for imaging tasks, though their application in the medical field is still emerging. Notably, diffusion models have not yet been explored for the MRF problem. In this work, we propose for the first time a conditional diffusion probabilistic model for MRF image reconstruction. Qualitative and quantitative comparisons on in-vivo brain scan data demonstrate that the proposed approach can outperform established deep learning and compressed sensing algorithms for MRF reconstruction. Extensive ablation studies also explore strategies to improve computational efficiency of our approach.

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 2, 2024

**arXiv EE and SS** @arxiv_eess@qoto.org · Nov 2, 2024

Nov 2, 2024

Enhancing Image Resolution: A Simulation Study and Sensitivity Analysis of System Parameters for Resourcesat-3S/3SA

Enhancing Image Resolution: A Simulation Study and Sensitivity Analysis of System Parameters for Resourcesat-3S/3SA https://arxiv.org/abs/2410.23319 #eess.IV

Resourcesat-3S/3SA, an upcoming Indian satellite, is designed with Aft and Fore payloads capturing stereo images at look angles of -5deg and 26deg, respectively. Operating at 632.6 km altitude, it features a panchromatic (PAN) band offering a Ground Sampling Distance (GSD) of 1.25 meters and a 60 km swath. To balance swath width and resolution, an Instantaneous Geometric Field of View (IGFOV) of 2.5 meters is maintained while ensuring a 1.25-meter GSD both along and across track. Along-track sampling is achieved through precise timing, while across-track accuracy is ensured by using two staggered pixel arrays. Signal-to-Noise Ratio (SNR) is enhanced through Time Delay and Integration (TDI), employing two five-stage subarrays spaced 80 μm apart along the track, with a 4 μm (0.5 pixel) stagger in the across-track direction to achieve 1.25-meter resolution. To further boost resolution, the satellite employs super-resolution (SR), combining multiple low-resolution captures using sub-pixel shifts to produce high-resolution images. This technique, effective when images contain aliased high-frequency details, reconstructs full-resolution imagery using phase information from multiple observations, and has been successfully applied in remote sensing missions like SPOT-5, SkySat, and DubaiSat-1. A Monte Carlo simulation explores the factors influencing the resolution in Resourcesat-3S/3SA, with sensitivity analysis highlighting key impacts. The simulation methodology is broadly applicable to other remote sensing missions, optimizing SR for enhanced image clarity and resolution in future satellite systems.