Publications

Contributions of network structure, chemoarchitecture and diagnostic categories to transitions between cognitive topographies
Andrea I. Luppi
S. Parker Singleton
Justine Y. Hansen
Keith W. Jamison
Amy Kuceyeski
Richard F. Betzel
Bratislav Misic
The mechanisms linking the brain’s network structure to cognitively relevant activation patterns remain largely unknown. Here, by leveragi… (see more)ng principles of network control, we show how the architecture of the human connectome shapes transitions between 123 experimentally defined cognitive activation maps (cognitive topographies) from the NeuroSynth meta-analytic database. Specifically, we systematically integrated large-scale multimodal neuroimaging data from functional magnetic resonance imaging, diffusion tractography, cortical morphometry and positron emission tomography to simulate how anatomically guided transitions between cognitive states can be reshaped by neurotransmitter engagement or by changes in cortical thickness. Our model incorporates neurotransmitter-receptor density maps (18 receptors and transporters) and maps of cortical thickness pertaining to a wide range of mental health, neurodegenerative, psychiatric and neurodevelopmental diagnostic categories (17,000 patients and 22,000 controls). The results provide a comprehensive look-up table charting how brain network organization and chemoarchitecture interact to manifest different cognitive topographies, and establish a principled foundation for the systematic identification of ways to promote selective transitions between cognitive topographies.
Critical dynamics in spontaneous EEG predict anesthetic-induced loss of consciousness and perturbational complexity
Charlotte Maschke
Jordan O'Byrne
Michele Angelo Colombo
Melanie Boly
Olivia Gosseries
Steven Laureys
Mario Rosanova
Stefanie Blain-Moraes
Consciousness has been proposed to be supported by electrophysiological patterns poised at criticality, a dynamical regime which exhibits ad… (see more)aptive computational properties, maximally complex patterns and divergent sensitivity to perturbation. Here, we investigate dynamical properties of the resting-state electroencephalogram (EEG) of healthy subjects undergoing general anesthesia with propofol, xenon or ketamine. Importantly, all participants were unresponsive under anesthesia, while consciousness was retained only during ketamine anesthesia (in the form of vivid dreams), enabling an experimental dissociation between unresponsiveness and unconsciousness. For each condition, we measure (i) avalanche criticality, (ii) chaoticity, and (iii) criticality-related metrics, revealing that states of unconsciousness are characterized by a distancing from both avalanche criticality and the edge of chaos. We then ask whether these same dynamical properties are predictive of the perturbational complexity index (PCI), a TMS-based measure that has shown remarkably high sensitivity in detecting consciousness independently of behavior. We successfully predict individual subjects’ PCI values with considerably high accuracy from resting-state EEG dynamical properties alone. Our results establish a firm link between perturbational complexity and criticality, and provide further evidence that criticality is a necessary condition for the emergence of consciousness.
Learning Hybrid Interpretable Models: Theory, Taxonomy, and Methods
Julien Ferry
Ulrich Matchi Aïvodji
A hybrid model involves the cooperation of an interpretable model and a complex black box. At inference, any input of the hybrid model is as… (see more)signed to either its interpretable or complex component based on a gating mechanism. The advantages of such models over classical ones are two-fold: 1) They grant users precise control over the level of transparency of the system and 2) They can potentially perform better than a standalone black box since redirecting some of the inputs to an interpretable model implicitly acts as regularization. Still, despite their high potential, hybrid models remain under-studied in the interpretability/explainability literature. In this paper, we remedy this fact by presenting a thorough investigation of such models from three perspectives: Theory, Taxonomy, and Methods. First, we explore the theory behind the generalization of hybrid models from the Probably-Approximately-Correct (PAC) perspective. A consequence of our PAC guarantee is the existence of a sweet spot for the optimal transparency of the system. When such a sweet spot is attained, a hybrid model can potentially perform better than a standalone black box. Secondly, we provide a general taxonomy for the different ways of training hybrid models: the Post-Black-Box and Pre-Black-Box paradigms. These approaches differ in the order in which the interpretable and complex components are trained. We show where the state-of-the-art hybrid models Hybrid-Rule-Set and Companion-Rule-List fall in this taxonomy. Thirdly, we implement the two paradigms in a single method: HybridCORELS, which extends the CORELS algorithm to hybrid modeling. By leveraging CORELS, HybridCORELS provides a certificate of optimality of its interpretable component and precise control over transparency. We finally show empirically that HybridCORELS is competitive with existing hybrid models, and performs just as well as a standalone black box (or even better) while being partly transparent.
The effect of gestational age on short- and long-term complications following primary esophageal atresia repair
Mathias Johansen
Samuel Wasserman
Jean Martin Laberge
Sam J. Daniel
Thomas Engelhardt
Are self-explanations from Large Language Models faithful?
From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards
Emmanuel Ma
Futian Andrew Wei
Jackie CK Cheung
Investigating Failures to Generalize for Coreference Resolution Models
A.R. Olteanu
Kaheer Suleman
Adam Trischler
Jackie CK Cheung
Coreference resolution models are often evaluated on multiple datasets. Datasets vary, however, in how coreference is realized -- i.e., how … (see more)the theoretical concept of coreference is operationalized in the dataset -- due to factors such as the choice of corpora and annotation guidelines. We investigate the extent to which errors of current coreference resolution models are associated with existing differences in operationalization across datasets (OntoNotes, PreCo, and Winogrande). Specifically, we distinguish between and break down model performance into categories corresponding to several types of coreference, including coreferring generic mentions, compound modifiers, and copula predicates, among others. This break down helps us investigate how state-of-the-art models might vary in their ability to generalize across different coreference types. In our experiments, for example, models trained on OntoNotes perform poorly on generic mentions and copula predicates in PreCo. Our findings help calibrate expectations of current coreference resolution models; and, future work can explicitly account for those types of coreference that are empirically associated with poor generalization when developing models.
Knowledge Distillation for Federated Learning: a Practical Guide
Federated Learning (FL) enables the training of Deep Learning models without centrally collecting possibly sensitive raw data. This paves th… (see more)e way for stronger privacy guarantees when building predictive models. The most used algorithms for FL are parameter-averaging based schemes (e.g., Federated Averaging) that, however, have well known limits: (i) Clients must implement the same model architecture; (ii) Transmitting model weights and model updates implies high communication cost, which scales up with the number of model parameters; (iii) In presence of non-IID data distributions, parameter-averaging aggregation schemes perform poorly due to client model drifts. Federated adaptations of regular Knowledge Distillation (KD) can solve and/or mitigate the weaknesses of parameter-averaging FL algorithms while possibly introducing other trade-offs. In this article, we provide a review of KD-based algorithms tailored for specific FL issues.
Multi Teacher Privileged Knowledge Distillation for Multimodal Expression Recognition
Muhammad Haseeb Aslam
Alessandro Lameiras Koerich
Eric Granger
Human emotion is a complex phenomenon conveyed and perceived through facial expressions, vocal tones, body language, and physiological signa… (see more)ls. Multimodal emotion recognition systems can perform well because they can learn complementary and redundant semantic information from diverse sensors. In real-world scenarios, only a subset of the modalities employed for training may be available at test time. Learning privileged information allows a model to exploit data from additional modalities that are only available during training. SOTA methods for PKD have been proposed to distill information from a teacher model (with privileged modalities) to a student model (without privileged modalities). However, such PKD methods utilize point-to-point matching and do not explicitly capture the relational information. Recently, methods have been proposed to distill the structural information. However, PKD methods based on structural similarity are primarily confined to learning from a single joint teacher representation, which limits their robustness, accuracy, and ability to learn from diverse multimodal sources. In this paper, a multi-teacher PKD (MT-PKDOT) method with self-distillation is introduced to align diverse teacher representations before distilling them to the student. MT-PKDOT employs a structural similarity KD mechanism based on a regularized optimal transport (OT) for distillation. The proposed MT-PKDOT method was validated on the Affwild2 and Biovid datasets. Results indicate that our proposed method can outperform SOTA PKD methods. It improves the visual-only baseline on Biovid data by 5.5%. On the Affwild2 dataset, the proposed method improves 3% and 5% over the visual-only baseline for valence and arousal respectively. Allowing the student to learn from multiple diverse sources is shown to increase the accuracy and implicitly avoids negative transfer to the student model.
Neural differential equations for temperature control in buildings under demand response programs
Noise covariance estimation in multi-task high-dimensional linear models
Kai Tan
Gabriel Romon
Lune P Bellec
Satellite Sunroof: High-res Digital Surface Models and Roof Segmentation for Global Solar Mapping
Vishal Batchu
A. Wilson
Betty Peng
Carl D. Elkin
Umangi Jain
Christopher Van Arsdale
Varun Gulshan
The transition to renewable energy, particularly solar, is key to mitigating climate change. Google's Solar API aids this transition by esti… (see more)mating solar potential from aerial imagery, but its impact is constrained by geographical coverage. This paper proposes expanding the API's reach using satellite imagery, enabling global solar potential assessment. We tackle challenges involved in building a Digital Surface Model (DSM) and roof instance segmentation from lower resolution and single oblique views using deep learning models. Our models, trained on aligned satellite and aerial datasets, produce 25cm DSMs and roof segments. With ~1m DSM MAE on buildings, ~5deg roof pitch error and ~56% IOU on roof segmentation, they significantly enhance the Solar API's potential to promote solar adoption.