Learn how to leverage generative AI to support and improve your productivity at work. The next cohort will take place online on April 28 and 30, 2026, in French.
We use cookies to analyze the browsing and usage of our website and to personalize your experience. You can disable these technologies at any time, but this may limit certain functionalities of the site. Read our Privacy Policy for more information.
Setting cookies
You can enable and disable the types of cookies you wish to accept. However certain choices you make could affect the services offered on our sites (e.g. suggestions, personalised ads, etc.).
Essential cookies
These cookies are necessary for the operation of the site and cannot be deactivated. (Still active)
Analytics cookies
Do you accept the use of cookies to measure the audience of our sites?
Multimedia Player
Do you accept the use of cookies to display and allow you to watch the video content hosted by our partners (YouTube, etc.)?
Publications
Metabolic Control and Frequency of Clinical Monitoring Among Canadian Children With Phenylalanine Hydroxylase Deficiency: A Retrospective Cohort Study
We describe a publicly available multimodal dataset of annotated Positron Emission Tomography/Computed Tomography (PET/CT) studies for head … (see more)and neck cancer research. The dataset includes 1123 FDG-PET/CT studies from patients with histologically confirmed head and neck cancer, acquired from 10 international medical centers. All examinations consisted of co-registered PET/CT scans with varying acquisition protocols, reflecting real-world clinical diversity across institutions. Primary gross tumor volumes (GTVp) and involved lymph nodes (GTVn) were manually segmented by experienced radiation oncologists and radiologists following standardized guidelines and quality control measures. We provide anonymized NifTi files of all studies, along with expert-annotated segmentation masks, radiotherapy dose distribution for a subset of patients, and comprehensive clinical metadata. This metadata includes TNM staging, HPV status, demographics (age and gender), long-term follow-up outcomes, survival times, censoring indicators, and treatment information. We demonstrate how this dataset can be used for three key clinical tasks: automated tumor segmentation, recurrence-free survival prediction, and HPV status classification, providing benchmark results using state-of-the-art deep learning models, including UNet, SegResNet, and multimodal prognostic frameworks.
Hierarchical reinforcement learning (RL) has the potential to enable effective decision-making over long timescales. Existing approaches, wh… (see more)ile promising, have yet to realize the benefits of large-scale training. In this work, we identify and solve several key challenges in scaling hierarchical RL to high-throughput environments. We propose Scalable Option Learning (SOL), a highly scalable hierarchical RL algorithm which achieves a 25x higher throughput compared to existing hierarchical methods. We train our hierarchical agents using 20 billion frames of experience on the complex game of NetHack, significantly surpassing flat agents and demonstrating positive scaling trends. We also validate our algorithm on MiniHack and Mujoco environments, showcasing its general applicability. Our code is open sourced at github.com/facebookresearch/sol.
1
Accurately capturing genetic ancestry is critical for ensuring reproducibility and fairness in genomic st… (see more)udies and downstream health research. This study aims to address the prediction of ancestry from genetic data using deep learning, with a focus on generalizability across datasets with diverse populations and on explainability to improve model transparency. We adapt the Diet Network, a deep learning architecture proven effective in handling high-dimensional data, to learn population ancestry from single nucleotide polymorphisms (SNPs) data using the populational Thousand Genomes Project dataset. Our results highlight the model’s ability to generalize to diverse populations in the CARTaGENE and Montreal Heart Institute biobanks and that predictions remain robust to high levels of missing SNPs. We show that, despite the lack of North African populations in the training dataset, the model learns latent representations that reflect meaningful population structure for North African individuals in the biobanks. To improve model transparency, we apply Saliency Maps, DeepLift, GradientShap and Integrated Gradients attribution techniques and evaluate their performance in identifying SNPs leveraged by the model. Using DeepLift, we show that model’s predictions are driven by population-specific signals consistent with those identified by traditional population genetics metrics. This work presents a generalizable and interpretable deep learning framework for genetic ancestry inference in large-scale biobanks with genetic data. By enabling more widespread genomic ancestry characterization in these cohorts, this study contributes practical tools for integrating genetic data into downstream biomedical applications, supporting more inclusive and equitable healthcare solutions.
1.1
Unraveling the lifespan trajectories of human brain development is critical for understanding brain health and … (see more)disease. Recent research demonstrates that electroencephalography signals are composed of periodic and aperiodic components reflecting distinct physiological substrates. This dissociation raises the possibility that they follow different developmental tendencies. Here, we delineate the lifespan trajectories of aperiodic and periodic neural oscillations using a large international cohort (N=1,563, ages 5–95, resting state, eyes closed). We reveal two fundamental developmental patterns: a Monotonic decrease in aperiodic activity and a Growth-and-Decline pattern for periodic activity. Both components have inflections around age 20 and transition to a stable senescent phase around age 40. Spatially, anterior regions mainly exhibit aperiodic activity, while periodic activity concentrate on posterior regions and these patterns remain stable throughout life. Crucially, multimodal analysis shows these trajectories map onto distinct biological substrates. The periodic component’s Growth and Decline trajectory aligns with GABAergic function and myelination. In contrast, the monotonically decreasing trajectory of aperiodic activity mirrors fundamental biomarkers of biological aging, such as DNA methylation and telomere length. Transforming age to a logarithmic scale simplifies these nonlinear trajectories into a linear decreasing and a piecewise concave linear model for aperiodic and periodic components. This form provides a robust and parsimonious framework for quantifying maturation and identifying neurological deviations.
We delineate distinct lifespan trajectories of aperiodic and periodic neural activity in a large-scale international cohort (N=1,563, ages 5–95). Aperiodic activity undergoes a Monotonic Decrease with age. In contrast, periodic activity follows a Growth-then-Decline trajectory, peaking in early adulthood.
Both trajectories feature a critical transition around age 20 and stabilize into a protracted senescent phase from approximately 40 onward.
These neural trajectories map onto distinct biological substrates: periodic activity tracks integrative functions (myelination, GABAergic, and aperiodic decline mirrors fundamental aging processes (DNA methylation).
A stable pattern observed throughout the lifespan is the spatial segregation of neural activity, where aperiodic signals are dominant in anterior regions and periodic signals are concentrated in posterior ones.
Logarithmically transforming age linearized the developmental trajectories, yielding a monotonic decline for the aperiodic component and a concave piecewise for the periodic one. This process establishes robust linear norms for the personalized assessment of brain dysfunction.
Artificial Intelligence models are increasingly used in healthcare, yet global performance metrics can mask variations in reliability across… (see more) individual patients or subgroups with shared attributes, called
patient profiles
. This study introduces MED3pa, a method that identifies when models are less reliable, allowing clinicians to better assess model limitations.
We propose a framework that estimates predictive confidence using three combined approaches: Individualized (IPC), Aggregated (APC), and Mixed Predictive Confidence (MPC). IPC estimates confidence for each patient, APC assesses it across profiles, and MPC combines both. We evaluate our method on four datasets: one simulated, two public, and one private clinical dataset. Metrics by Declaration Rate (MDR) curves show how performance changes when retaining only the most confident predictions, while interpretable decision trees reveal profiles with higher or lower model confidence.
We demonstrate our method in internal, temporal, and external validation settings, as well as through a clinical example. In internal validation, limiting predictions to the 93% most confident cases improved sensitivity by 14.3% and the AUC by 5.1%. In the clinical example, MED3pa identified a patient profile with high misclassification risk, demonstrating its potential for safer deployment.
By identifying low-confidence predictions, our framework improves model reliability in clinical settings. It can be integrated into decision support systems to help clinicians make more informed decisions. Confidence thresholds help balance model performance with the proportion of patients for whom predictions are considered reliable.
Better leveraging confidence in model predictions could improve reliability and trustworthiness, supporting safer and more effective use in healthcare.