Publications

Symmetry Breaking and Equivariant Neural Networks

Sékou-Oumar Kaba

Siamak Ravanbakhsh

Using symmetry as an inductive bias in deep learning has been proven to be a principled approach for sample-efficient model design. However,… (voir plus) the relationship between symmetry and the imperative for equivariance in neural networks is not always obvious. Here, we analyze a key limitation that arises in equivariant functions: their incapacity to break symmetry at the level of individual data samples. In response, we introduce a novel notion of 'relaxed equivariance' that circumvents this limitation. We further demonstrate how to incorporate this relaxation into equivariant multilayer perceptrons (E-MLPs), offering an alternative to the noise-injection method. The relevance of symmetry breaking is then discussed in various application domains: physics, graph representation learning, combinatorial optimization and equivariant decoding.

2023-11-27

NeurIPS.cc/2023/Workshop/NeurReps (présentation orale)

On the Information Geometry of Vision Transformers

Sonia Joseph

Kumar Krishna Agrawal

Arna Ghosh

Blake Aaron Richards

2023-11-27

NeurIPS.cc/2023/Workshop/NeurReps (poster)

On the Varied Faces of Overparameterization in Supervised and Self-Supervised Learning

Matteo Gamba

Arna Ghosh

Kumar Krishna Agrawal

Blake Aaron Richards

Agrawal

Hossein Azizpour

Mårten Björkman

The quality of the representations learned by neural networks depends on several factors, including the loss function, learning algorithm, a… (voir plus)nd model architecture. In this work, we use information geometric measures to assess the representation quality in a principled manner. We demonstrate that the sensitivity of learned representations to input perturbations, measured by the spectral norm of the feature Jacobian, provides valuable information about downstream generalization. On the other hand, measuring the coefficient of spectral decay observed in the eigenspectrum of feature covariance provides insights into the global representation geometry. First, we empirically establish an equivalence between these notions of representation quality and show that they are inversely correlated. Second, our analysis reveals the varying roles that overparameterization plays in improving generalization. Unlike supervised learning, we observe that increasing model width leads to higher discriminability and less smoothness in the self-supervised regime. Furthermore, we report that there is no observable double descent phenomenon in SSL with non-contrastive objectives for commonly used parameterization regimes, which opens up new opportunities for tight asymptotic analysis. Taken together, our results provide a loss-aware characterization of the different role of overparameterization in supervised and self-supervised learning.

2023-11-27

NeurIPS.cc/2023/Workshop/NeurReps (poster)

1351. Predictors of Loss of Infectivity Among Healthcare Workers with Primary and Recurrent SARS-CoV-2 infection: An Observational Cohort Study

Stefka Dzieciolowska

Yves Longtin

Hugues Charest

Tonya Roy

Judith Fafard

Inès Levade

Jean Longtin

Leighanne Parkes

Jasmin Villeneuve

Patrice Savard

J. Corbeil

Gaston De Serres

Abstract Background Factors associated with loss of infectivity in healthcare workers (HCWs) with COVID-19 are poorly understood. Understand… (voir plus)ing predictive factors could help optimize return-to-work criteria and minimize absenteeism. Methods Prospective observational cohort study of HCWs with COVID-19 conducted between Feb 20 2022 and March 6 2023 in 20 institutions in Montreal, Canada, with clinical/laboratory follow-up on Day 5, 7 and 10 of infection. Infectivity was determined by viral culture (Vero E6 cells) on nasopharyngeal swabs. Predictors of loss of infectivity were investigated by univariate and multivariate logistic regression. Results Overall, 121 participants (79.3% female, mean age 40 years) were recruited. Most (n=107, 88.4%) had received ≥3 vaccines and 20 (16.5%) had a history of prior COVID-19. The proportion of HCWs with a positive viral culture decreased from 71.9% on day 5 of infection to 18.2% on day 10. The proportion of HCWs with a positive RT-PCR decreased from 93.3% (112/120) on day 5 (median Ct value, 23.4 [IQR, 20.6-27.9]) to 61.2% (74/120) on day 10 (median Ct value, 32.5 [IQR, 28.5 to undetectable]). Rapid antigen detection test (RADT) positivity decreased from 81.5% on day 5 to 34.2% on day 10. Participants with recurrent COVID-19 had lower likelihood of infectivity at each visit (OR on day 5, 0.14; 95% CI 0.05-0.40; p 0.001; OR on day 7, 0.04; 95% CI, 0.01-0.33; p=0.003) and none were infective on day 10 (p=0.02). At each visit, recurrent cases had higher median RT-PCR Ct values than primary infections (p 0.03) and were more likely to have a negative RADT result (p 0.01). By multivariate analysis, ongoing infectivity was associated with a RT-PCR Ct value 23 (adjusted OR [aOR] on day 5, 22.75; p 0.001; aOR on Day 7, 182.30; p 0.001; and aOR on Day 10; 24.71; p=0.02). A history of previous COVID-19 was associated with a lower probability of infectivity on Day 5 (aOR, 0.005; p=0.003). By contrast, symptom improvement (including fever) and RADT result were not independent predictors of loss of infectivity. Conclusion A lower RT-PCR Ct value is associated with ongoing infectivity, whereas COVID-19 reinfection is a predictor of loss of infectivity. These findings could help optimize return-to-work algorithms. Disclosures All Authors: No reported disclosures

2023-11-26

Open Forum Infectious Diseases (publié)

Author Correction: 30×30 biodiversity gains rely on national coordination

Isaac Eckert

Andrea Brown

Dominique Caron

Federico Riva

Laura J. Pollock

2023-11-26

Nature Communications (publié)

Exploring the multidimensional nature of repetitive and restricted behaviors and interests (RRBI) in autism: neuroanatomical correlates and clinical implications

Aline Lefebvre

Nicolas Traut

Amandine Pedoux

Anna Maruani

Anita Beggiato

Monique Elmaleh

David Germanaud

Anouck Amestoy

Myriam Ly‐Le Moal

Christopher H. Chatham

Lorraine Murtagh

Manuel Bouvard

Marianne Alisson

Marion Leboyer

Thomas Bourgeron

Roberto Toro

Guillaume Dumas

Clara A. Moreau

Richard Delorme

2023-11-26

Molecular Autism (publié)

scGeneRythm: Using Neural Networks and Fourier Transformation to Cluster Genes by Time-Frequency Patterns in Single-Cell Data

Yiming Jia

Hao Wu

Jun Ding

2023-11-26

bioRxiv (prépublication)

Laurence Perreault-Levasseur

The search for the lost attractor

Mario Pasquato

Syphax Haddad

Pierfrancesco Di Cintio

Alexandre Adam

Pablo Lemos

Noé Dia

Mircea Petrache

Ugo Niccolò Di Carlo

Alessandro Alberto Trani

Yashar Hezaveh

2023-11-26

arXiv (prépublication)

Hessian Aware Low-Rank Perturbation for Order-Robust Continual Learning

Jiaqi Li

Rui Wang

Yuanhao Lai

Changjian Shui

Sabyasachi Sahoo

Charles X. Ling

Shichun Yang

Boyu Wang

Christian Gagné

Fan Zhou

Continual learning aims to learn a series of tasks sequentially without forgetting the knowledge acquired from the previous ones. In this wo… (voir plus)rk, we propose the Hessian Aware Low-Rank Perturbation algorithm for continual learning. By modeling the parameter transitions along the sequential tasks with the weight matrix transformation, we propose to apply the low-rank approximation on the task-adaptive parameters in each layer of the neural networks. Specifically, we theoretically demonstrate the quantitative relationship between the Hessian and the proposed low-rank approximation. The approximation ranks are then globally determined according to the marginal increment of the empirical loss estimated by the layer-specific gradient and low-rank approximation error. Furthermore, we control the model capacity by pruning less important parameters to diminish the parameter growth. We conduct extensive experiments on various benchmarks, including a dataset with large-scale tasks, and compare our method against some recent state-of-the-art methods to demonstrate the effectiveness and scalability of our proposed method. Empirical results show that our method performs better on different benchmarks, especially in achieving task order robustness and handling the forgetting issue. The source code is at https://github.com/lijiaqi/HALRP.

2023-11-25

ArXiv (prépublication)

Low Compute Unlearning via Sparse Representations

Vedant Shah

Frederik Träuble

Ashish Malik

Hugo Larochelle

Michael Curtis Mozer

Sanjeev Arora

Yoshua Bengio

Anirudh Goyal

Machine unlearning, which involves erasing knowledge about a \emph{forget set} from a trained model, can prove to be costly and infeasible … (voir plus)using existing techniques. We propose a low-compute unlearning technique based on a discrete representational bottleneck. We show that the proposed technique efficiently unlearns the forget set and incurs negligible damage to the model's performance on the rest of the dataset. We evaluate the proposed technique on the problem of class unlearning using four datasets: CIFAR-10, CIFAR-100, LACUNA-100 and ImageNet-1k. We compare the proposed technique to SCRUB, a state-of-the-art approach which uses knowledge distillation for unlearning. Across all four datasets, the proposed technique performs as well as, if not better than SCRUB while incurring almost no computational cost.

2023-11-25

arXiv (prépublication)

Fourier neural operator for real-time simulation of 3D dynamic urban microclimate

Wenhui Peng

Shaoxiang Qin

Senwen Yang

Jianchun Wang

Xue Liu

Liangzhu (Leon) Wang

Global urbanization has underscored the significance of urban microclimates for human comfort, health, and building/urban energy efficiency.… (voir plus) They profoundly influence building design and urban planning as major environmental impacts. Understanding local microclimates is essential for cities to prepare for climate change and effectively implement resilience measures. However, analyzing urban microclimates requires considering a complex array of outdoor parameters within computational domains at the city scale over a longer period than indoors. As a result, numerical methods like Computational Fluid Dynamics (CFD) become computationally expensive when evaluating the impact of urban microclimates. The rise of deep learning techniques has opened new opportunities for accelerating the modeling of complex non-linear interactions and system dynamics. Recently, the Fourier Neural Operator (FNO) has been shown to be very promising in accelerating solving the Partial Differential Equations (PDEs) and modeling fluid dynamic systems. In this work, we apply the FNO network for real-time three-dimensional (3D) urban wind field simulation. The training and testing data are generated from CFD simulation of the urban area, based on the semi-Lagrangian approach and fractional stepping method to simulate urban microclimate features for modeling large-scale urban problems. Numerical experiments show that the FNO model can accurately reconstruct the instantaneous spatial velocity field. We further evaluate the trained FNO model on unseen data with different wind directions, and the results show that the FNO model can generalize well on different wind directions. More importantly, the FNO approach can make predictions within milliseconds on the graphics processing unit, making real-time simulation of 3D dynamic urban microclimate possible.

2023-11-24

Building and Environment (publié)