Quantum Cognition Machine Learning for Forecasting Chromosomal Instability arxiv.org/abs/2506.03199

Quantum Cognition Machine Learning for Forecasting Chromosomal Instability

The accurate prediction of chromosomal instability from the morphology of circulating tumor cells (CTCs) enables real-time detection of CTCs with high metastatic potential in the context of liquid biopsy diagnostics. However, it presents a significant challenge due to the high dimensionality and complexity of single-cell digital pathology data. Here, we introduce the application of Quantum Cognition Machine Learning (QCML), a quantum-inspired computational framework, to estimate morphology-predicted chromosomal instability in CTCs from patients with metastatic breast cancer. QCML leverages quantum mechanical principles to represent data as state vectors in a Hilbert space, enabling context-aware feature modeling, dimensionality reduction, and enhanced generalization without requiring curated feature selection. QCML outperforms conventional machine learning methods when tested on out of sample verification CTCs, achieving higher accuracy in identifying predicted large-scale state transitions (pLST) status from CTC-derived morphology features. These preliminary findings support the application of QCML as a novel machine learning tool with superior performance in high-dimensional, low-sample-size biomedical contexts. QCML enables the simulation of cognition-like learning for the identification of biologically meaningful prediction of chromosomal instability from CTC morphology, offering a novel tool for CTC classification in liquid biopsy.

arXiv.org

Predicting Postoperative Stroke in Elderly SICU Patients: An Interpretable Machine Learning Model Using MIMIC Data arxiv.org/abs/2506.03209

Predicting Postoperative Stroke in Elderly SICU Patients: An Interpretable Machine Learning Model Using MIMIC Data

Postoperative stroke remains a critical complication in elderly surgical intensive care unit (SICU) patients, contributing to prolonged hospitalization, elevated healthcare costs, and increased mortality. Accurate early risk stratification is essential to enable timely intervention and improve clinical outcomes. We constructed a combined cohort of 19,085 elderly SICU admissions from the MIMIC-III and MIMIC-IV databases and developed an interpretable machine learning (ML) framework to predict in-hospital stroke using clinical data from the first 24 hours of Intensive Care Unit (ICU) stay. The preprocessing pipeline included removal of high-missingness features, iterative Singular Value Decomposition (SVD) imputation, z-score normalization, one-hot encoding, and class imbalance correction via the Adaptive Synthetic Sampling (ADASYN) algorithm. A two-stage feature selection process-combining Recursive Feature Elimination with Cross-Validation (RFECV) and SHapley Additive exPlanations (SHAP)-reduced the initial 80 variables to 20 clinically informative predictors. Among eight ML models evaluated, CatBoost achieved the best performance with an AUROC of 0.8868 (95% CI: 0.8802--0.8937). SHAP analysis and ablation studies identified prior cerebrovascular disease, serum creatinine, and systolic blood pressure as the most influential risk factors. Our results highlight the potential of interpretable ML approaches to support early detection of postoperative stroke and inform decision-making in perioperative critical care.

arXiv.org

A Pre-trained Framework for Multilingual Brain Decoding Using Non-invasive Recordings arxiv.org/abs/2506.03214

A Pre-trained Framework for Multilingual Brain Decoding Using Non-invasive Recordings

Brain-computer interfaces (BCIs) with speech decoding from brain recordings have broad application potential in fields such as clinical rehabilitation and cognitive neuroscience. However, current decoding methods remain limited to single-language, single-subject, and single neuroimaging modality settings, restricting their clinical applicability and generalizability. Here we propose a joint multilingual, multi-subject and multimodal decoding framework. It maps diverse brain recordings into a unified semantic space defined by a pre-trained multilingual model (PMM), enabling decoding across multiple languages, multiple subjects and multiple neuroimaging modalities. The proposed framework is validated using non-invasive brain recordings from 159 participants across four languages. Experimental results show that it exhibits strong generalization across multilingual, multi-subject, and multimodal settings. More importantly, the proposed framework can promote linguistic fairness, which is vital for underrepresented languages in BCI applications. The unified semantic space enables cross-lingual mapping enhancement, allowing the framework to boost the decoding performance of underrepresented languages, thereby promoting linguistic fairness. Overall, the proposed framework establishes a new potential paradigm for brain decoding, opening new paths for broader applications of BCI.

arXiv.org

UniSite: The First Cross-Structure Dataset and Learning Framework for End-to-End Ligand Binding Site Detection arxiv.org/abs/2506.03237

UniSite: The First Cross-Structure Dataset and Learning Framework for End-to-End Ligand Binding Site Detection

The detection of ligand binding sites for proteins is a fundamental step in Structure-Based Drug Design. Despite notable advances in recent years, existing methods, datasets, and evaluation metrics are confronted with several key challenges: (1) current datasets and methods are centered on individual protein-ligand complexes and neglect that diverse binding sites may exist across multiple complexes of the same protein, introducing significant statistical bias; (2) ligand binding site detection is typically modeled as a discontinuous workflow, employing binary segmentation and subsequent clustering algorithms; (3) traditional evaluation metrics do not adequately reflect the actual performance of different binding site prediction methods. To address these issues, we first introduce UniSite-DS, the first UniProt (Unique Protein)-centric ligand binding site dataset, which contains 4.81 times more multi-site data and 2.08 times more overall data compared to the previously most widely used datasets. We then propose UniSite, the first end-to-end ligand binding site detection framework supervised by set prediction loss with bijective matching. In addition, we introduce Average Precision based on Intersection over Union (IoU) as a more accurate evaluation metric for ligand binding site prediction. Extensive experiments on UniSite-DS and several representative benchmark datasets demonstrate that IoU-based Average Precision provides a more accurate reflection of prediction quality, and that UniSite outperforms current state-of-the-art methods in ligand binding site detection. The dataset and codes will be made publicly available at https://github.com/quanlin-wu/unisite.

arXiv.org

Learning to cluster neuronal function arxiv.org/abs/2506.03293

Learning to cluster neuronal function

Deep neural networks trained to predict neural activity from visual input and behaviour have shown great potential to serve as digital twins of the visual cortex. Per-neuron embeddings derived from these models could potentially be used to map the functional landscape or identify cell types. However, state-of-the-art predictive models of mouse V1 do not generate functional embeddings that exhibit clear clustering patterns which would correspond to cell types. This raises the question whether the lack of clustered structure is due to limitations of current models or a true feature of the functional organization of mouse V1. In this work, we introduce DECEMber -- Deep Embedding Clustering via Expectation Maximization-based refinement -- an explicit inductive bias into predictive models that enhances clustering by adding an auxiliary $t$-distribution-inspired loss function that enforces structured organization among per-neuron embeddings. We jointly optimize both neuronal feature embeddings and clustering parameters, updating cluster centers and scale matrices using the EM-algorithm. We demonstrate that these modifications improve cluster consistency while preserving high predictive performance and surpassing standard clustering methods in terms of stability. Moreover, DECEMber generalizes well across species (mice, primates) and visual areas (retina, V1, V4). The code is available at https://github.com/Nisone2000/sensorium/tree/neuroips_version.

arXiv.org

Climate benefits of afforestation and reforestation with varying species mixtures and densities in the north-western boreal lands arxiv.org/abs/2506.03300

Climate benefits of afforestation and reforestation with varying species mixtures and densities in the north-western boreal lands

The boreal forest plays a crucial role as a global carbon sink. This study uses two 250-year simulations of Canada's Taiga Plains, an area targeted by the 2 Billion Trees Program to evaluate afforestation and reforestation strategies that vary by species mix, planting density, and surface albedo. Medium density stands, 600 to 1400 trees per hectare, composed of mixed species with approximately 25 to 40 percent deciduous trees sequestered 15 to 30 percent more net ecosystem carbon than conifer monocultures. These benefits stem from a combination of rapid early growth, long-term carbon retention, and enhanced resilience to disturbance. Replanting understocked stands with such mixtures increased long-term carbon storage by 18 to 30 percent relative to prevailing scenarios. When surface albedo was considered, pure evergreen or deciduous stands showed a reduction in climate benefit by 6 to 20 percent, while mixed stands maintained net cooling and achieved the highest sequestration rates, approximately 4.6 to 4.7 tons of carbon dioxide equivalent per hectare per year. Scenarios involving partial harvesting followed by replanting sustained or improved ecosystem carbon stocks, about 300 to 340 tons of carbon per hectare, and productivity, roughly 1.6 to 2.0 tons of carbon per hectare per year, without increasing ecological risk. Overall, integrating fast-growing deciduous species with long-lived conifers at moderate planting densities enhances the climate mitigation potential of boreal afforestation and reforestation efforts and offers guidance for reforestation policy in similar high latitude ecosystems.

arXiv.org

Pattern formation within phenotype-structured chemotactic populations arxiv.org/abs/2506.03389

Pattern formation within phenotype-structured chemotactic populations

Populations can become spatially organised through chemotaxis autoattraction, wherein population members release their own chemoattractant. Standard models of this process usually assume phenotypic homogeneity, but recent studies have shed illumination on the inherent heterogeneity within populations: in terms of chemotactic behaviour, trait heterogeneity can range from the sensitivity to attractant gradients to the rate at which attractants are produced. We propose a framework that accounts for this heterogeneity, extending the standard Keller-Segel model to a non-local formulation in which the population is continuously structured across some phenotype state space. Focussing on autoattraction, we allow both the chemotactic sensitivity and the rate of attractant secretion to vary across the population and suppose members can switch between different phenotype states. We extend classical Turing-type linear stability analyses to determine the impact of phenotypic structuring on pattern formation, showing that the rate of switching influences both the critical condition for self-organisation and subsequent pattern dynamics. Scenarios in which the chemotactic sensitivity and attractant secretion are positively or negatively correlated are used to highlight the significance of these results.

arXiv.org

Physiology-Informed Generative Multi-Task Network for Contrast-Free CT Perfusion arxiv.org/abs/2505.22673

PSBench: a large-scale benchmark for estimating the accuracy of protein complex structural models arxiv.org/abs/2505.22674

PSBench: a large-scale benchmark for estimating the accuracy of protein complex structural models

Predicting protein complex structures is essential for protein function analysis, protein design, and drug discovery. While AI methods like AlphaFold can predict accurate structural models for many protein complexes, reliably estimating the quality of these predicted models (estimation of model accuracy, or EMA) for model ranking and selection remains a major challenge. A key barrier to developing effective machine learning-based EMA methods is the lack of large, diverse, and well-annotated datasets for training and evaluation. To address this gap, we introduce PSBench, a benchmark suite comprising four large-scale, labeled datasets generated during the 15th and 16th community-wide Critical Assessment of Protein Structure Prediction (CASP15 and CASP16). PSBench includes over one million structural models covering a wide range of protein sequence lengths, complex stoichiometries, functional classes, and modeling difficulties. Each model is annotated with multiple complementary quality scores at the global, local, and interface levels. PSBench also provides multiple evaluation metrics and baseline EMA methods to facilitate rigorous comparisons. To demonstrate PSBench's utility, we trained and evaluated GATE, a graph transformer-based EMA method, on the CASP15 data. GATE was blindly tested in CASP16 (2024), where it ranked among the top-performing EMA methods. These results highlight PSBench as a valuable resource for advancing EMA research in protein complex modeling. PSBench is publicly available at: https://github.com/BioinfoMachineLearning/PSBench.

arXiv.org

Exploring Holography in Neuro-Vascular Dynamics arxiv.org/abs/2505.22680

Exploring Holography in Neuro-Vascular Dynamics

The holonomic brain theory, originally formulated to account for the need of non-local memory encoding in cognitive systems, could gain new theoretical traction when integrated with holographic principles from physics, most notably the AdS/CFT correspondence. Recent findings in neuroscience suggest that conformal field theories (CFTs), emerging at critical points across spatiotemporal scales in neural dynamics, are essential for brain function. Concurrently, black-brane geometries, long studied in gravitational physics, can find unexpected analogues in the interplay of active matter dynamics and the brain s neuroanatomical organization. Motivated by these parallels, we posit a generalized holographic framework and interrogate its validity through the fluid/gravity duality; a correspondence linking hydrodynamic equations to gravitational spacetime metrics. In this work, we explore the holographic principles at the Navier-Stokes regime, demonstrating that holography can model key neurophysiological mechanisms: cerebral autoregulation (the brain s hemodynamic self-stabilization) and neurovascular coupling (the dynamic neuron-bloodflow interplay). This work bridges holography, active matter physics, and neuroscience, proposing a unified framework to decode the brain s multiscale organization, its resilience to perturbations, and its computational capabilities. By grounding neurovascular physiology in gravitational duals, we open pathways to reinterpret brain function through the lens of emergent spacetime geometry.

arXiv.org

ConnectomeDiffuser: Generative AI Enables Brain Network Construction from Diffusion Tensor Imaging arxiv.org/abs/2505.22683

ConnectomeDiffuser: Generative AI Enables Brain Network Construction from Diffusion Tensor Imaging

Brain network analysis plays a crucial role in diagnosing and monitoring neurodegenerative disorders such as Alzheimer's disease (AD). Existing approaches for constructing structural brain networks from diffusion tensor imaging (DTI) often rely on specialized toolkits that suffer from inherent limitations: operator subjectivity, labor-intensive workflows, and restricted capacity to capture complex topological features and disease-specific biomarkers. To overcome these challenges and advance computational neuroimaging instrumentation, ConnectomeDiffuser is proposed as a novel diffusion-based framework for automated end-to-end brain network construction from DTI. The proposed model combines three key components: (1) a Template Network that extracts topological features from 3D DTI scans using Riemannian geometric principles, (2) a diffusion model that generates comprehensive brain networks with enhanced topological fidelity, and (3) a Graph Convolutional Network classifier that incorporates disease-specific markers to improve diagnostic accuracy. ConnectomeDiffuser demonstrates superior performance by capturing a broader range of structural connectivity and pathology-related information, enabling more sensitive analysis of individual variations in brain networks. Experimental validation on datasets representing two distinct neurodegenerative conditions demonstrates significant performance improvements over other brain network methods. This work contributes to the advancement of instrumentation in the context of neurological disorders, providing clinicians and researchers with a robust, generalizable measurement framework that facilitates more accurate diagnosis, deeper mechanistic understanding, and improved therapeutic monitoring of neurodegenerative diseases such as AD.

arXiv.org

Investigating the effectiveness of multimodal data in forecasting SARS-COV-2 case surges arxiv.org/abs/2505.22688

Investigating the effectiveness of multimodal data in forecasting SARS-COV-2 case surges

The COVID-19 pandemic response relied heavily on statistical and machine learning models to predict key outcomes such as case prevalence and fatality rates. These predictions were instrumental in enabling timely public health interventions that helped break transmission cycles. While most existing models are grounded in traditional epidemiological data, the potential of alternative datasets, such as those derived from genomic information and human behavior, remains underexplored. In the current study, we investigated the usefulness of diverse modalities of feature sets in predicting case surges. Our results highlight the relative effectiveness of biological (e.g., mutations), public health (e.g., case counts, policy interventions) and human behavioral features (e.g., mobility and social media conversations) in predicting country-level case surges. Importantly, we uncover considerable heterogeneity in predictive performance across countries and feature modalities, suggesting that surge prediction models may need to be tailored to specific national contexts and pandemic phases. Overall, our work highlights the value of integrating alternative data sources into existing disease surveillance frameworks to enhance the prediction of pandemic dynamics.

arXiv.org

Early Assessment of Artificial Lower Extremity Sensory Response Times and Proprioceptive Acuity via Sensory Cortex Electrical Stimulation arxiv.org/abs/2505.22691

Early Assessment of Artificial Lower Extremity Sensory Response Times and Proprioceptive Acuity via Sensory Cortex Electrical Stimulation

Bi-directional brain computer interfaces (BD-BCIs) may restore brain-controlled walking and artificial leg sensation after spinal cord injury. Current BD-BCIs provide only simplistic "tingling" feedback, which lacks proprioceptive information to perceive critical gait events (leg swing, double support). This information must also be perceived adequately fast to facilitate timely motor responses. Here, we investigated utilizing primary sensory cortex (S1) direct cortical electrical stimulation (DCES) to deliver leg proprioceptive information and measured response times to artificial leg sensations. Subjects with subdural electrocorticogram electrodes over S1 leg areas participated in two tasks: (1) Proprioceptive acuity: subjects identified the difference between DCES-induced percepts emulating various leg swing speeds; (2) Sensory response: measuring subjects' reaction time to DCES-induced leg sensations, with DCES-hand, visual and auditory control conditions. Three subjects were recruited. Only one completed the proprioceptive assessment, achieving 80%, 70%, 60%, and 53% accuracy in discriminating between fast/slow, fast/medium, medium/slow, and same speeds, respectively (p-value=1.9x10$^{-5}$). Response times for leg/hand percepts were 1007$\pm$413/599$\pm$171 ms, visual leg/hand responses were 528$\pm$137/384$\pm$84 ms, and auditory leg/hand responses were 393$\pm$106/352$\pm$93 ms, respectively. These results suggest proprioceptive information can be delivered artificially, but perception may be significantly delayed. Future work should address improving acuity, reducing response times, and expanding sensory modalities.

arXiv.org

Self-orthogonalizing attractor neural networks emerging from the free energy principle arxiv.org/abs/2505.22749

Self-orthogonalizing attractor neural networks emerging from the free energy principle

Attractor dynamics are a hallmark of many complex systems, including the brain. Understanding how such self-organizing dynamics emerge from first principles is crucial for advancing our understanding of neuronal computations and the design of artificial intelligence systems. Here we formalize how attractor networks emerge from the free energy principle applied to a universal partitioning of random dynamical systems. Our approach obviates the need for explicitly imposed learning and inference rules and identifies emergent, but efficient and biologically plausible inference and learning dynamics for such self-organizing systems. These result in a collective, multi-level Bayesian active inference process. Attractors on the free energy landscape encode prior beliefs; inference integrates sensory data into posterior beliefs; and learning fine-tunes couplings to minimize long-term surprise. Analytically and via simulations, we establish that the proposed networks favor approximately orthogonalized attractor representations, a consequence of simultaneously optimizing predictive accuracy and model complexity. These attractors efficiently span the input subspace, enhancing generalization and the mutual information between hidden causes and observable effects. Furthermore, while random data presentation leads to symmetric and sparse couplings, sequential data fosters asymmetric couplings and non-equilibrium steady-state dynamics, offering a natural extension to conventional Boltzmann Machines. Our findings offer a unifying theory of self-organizing attractor networks, providing novel insights for AI and neuroscience.

arXiv.org

The global communication pathways of the human brain transcend the cortical-subcortical-cerebellar division arxiv.org/abs/2505.22893

The global communication pathways of the human brain transcend the cortical-subcortical-cerebellar division

Neural communication across the cortex, subcortex, and cerebellum is orchestrated by the structural connectome, forming the indispensable anatomical framework for capabilities spanning from elementary motor actions to higher cognitive functions. Yet, despite this importance, the core organizational rules that govern this connectivity remain insufficiently understood. Here we show, for the first time, how the integrated cortical, subcortical, and cerebellar brain areas shape the structural architecture of the whole brain. We find dense structural clusters, which differ in composition and arrangement, vertically transverse the canonical cortical, subcortical, and cerebellar boundaries. These clusters are centralized by a global rich club of predominantly subcortical, alongside cortical hub regions. Congruently, we find that subcortical hubs are not only the most widely connected brain areas but are also leading overall structural integration. Nearly all larger subcortical structures encompass these hub regions, but they also exhibit brain regions with fewer but more specialized connections, pointing toward functional heterogeneity in these structures themselves. Our findings move beyond traditional cortico-centric analysis, offering an initial and global perspective for understanding overall structural connectivity.

arXiv.org

An open-source Modular Online Psychophysics Platform (MOPP) arxiv.org/abs/2505.23137

An open-source Modular Online Psychophysics Platform (MOPP)

In recent years, there is a growing need and opportunity to use online platforms for psychophysics research. Online experiments make it possible to evaluate large and diverse populations remotely and quickly, complementing laboratory-based research. However, developing and running online psychophysics experiments poses several challenges: i) a high barrier-to-entry for researchers who often need to learn complex code-based platforms, ii) an uncontrolled experimental environment, and iii) questionable credibility of the participants. Here, we introduce an open-source Modular Online Psychophysics Platform (MOPP) to address these challenges. Through the simple web-based interface of MOPP, researchers can build modular experiments, share them with others, and copy or modify tasks from each others environments. MOPP provides built-in features to calibrate for viewing distance and to measure visual acuity. It also includes email-based and IP-based authentication, and reCAPTCHA verification. We developed five example psychophysics tasks, that come preloaded in the environment, and ran a pilot experiment which was hosted on the AWS (Amazon Web Services) cloud. Pilot data collected for these tasks yielded similar results to those reported in laboratory settings. MOPP can thus help researchers collect large psychophysics datasets online, with reduced turnaround time, and in a standardized manner.

arXiv.org
Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.