Publications

Preserving Privacy in GANs Against Membership Inference Attack

Mohammadhadi Shateri

Francisco Messina

Fabrice Labeau

Generative Adversarial Networks (GANs) have been widely used for generating synthetic data for cases where there is a limited size real-worl… (voir plus)d dataset or when data holders are unwilling to share their data samples. Recent works showed that GANs, due to overfitting and memorization, might leak information regarding their training data samples. This makes GANs vulnerable to Membership Inference Attacks (MIAs). Several defense strategies have been proposed in the literature to mitigate this privacy issue. Unfortunately, defense strategies based on differential privacy are proven to reduce extensively the quality of the synthetic data points. On the other hand, more recent frameworks such as PrivGAN and PAR-GAN are not suitable for small-size training datasets. In the present work, the overfitting in GANs is studied in terms of the discriminator, and a more general measure of overfitting based on the Bhattacharyya coefficient is defined. Then, inspired by Fano's inequality, our first defense mechanism against MIAs is proposed. This framework, which requires only a simple modification in the loss function of GANs, is referred to as the maximum entropy GAN or MEGAN and significantly improves the robustness of GANs to MIAs. As a second defense strategy, a more heuristic model based on minimizing the information leaked from generated samples about the training data points is presented. This approach is referred to as mutual information minimization GAN (MIMGAN) and uses a variational representation of the mutual information to minimize the information that a synthetic sample might leak about the whole training data set. Applying the proposed frameworks to some commonly used data sets against state-of-the-art MIAs reveals that the proposed methods can reduce the accuracy of the adversaries to the level of random guessing accuracy with a small reduction in the quality of the synthetic data samples.

2023-12-12

IEEE Transactions on Information Forensics and Security (inconnu)

doi.org

arxiv.org

Privacy-preserving analysis of time-to-event data under nested case-control sampling

Lamin Juwara

Archer Yang

Ana M Velly

Paramita Saha-Chaudhuri

2023-12-12

Statistical Methods in Medical Research (publié)

doi.org

Relative Almost Sure Regret Bounds for Certainty Equivalence Control of Markov Jump Systems

Borna Sayedana

Mohammad Afshari

Peter E. Caines

Aditya Mahajan

In this paper, we consider learning and control problem in an unknown Markov jump linear system (MJLS) with perfect state observations. We f… (voir plus)irst establish a generic upper bound on regret for any learning based algorithm. We then propose a certainty equivalence-based learning alagrithm and show that this algorithm achieves a regret of

2023-12-12

2023 62nd IEEE Conference on Decision and Control (CDC) (publié)

doi.org

Weighted-Norm Bounds on Model Approximation in MDPs with Unbounded Per-Step Cost

Berk Bozkurt

Aditya Mahajan

Ashutosh Nayyar

Yi Ouyang

We consider the problem of designing a control policy for an infinite-horizon discounted cost Markov Decision Process (MDP) …

2023-12-12

IEEE Conference on Decision and Control (publié)

doi.org

A Hitchhiker's Guide to Geometric GNNs for 3D Atomic Systems

Alexandre AGM Duval

Simon V. Mathis

Chaitanya K. Joshi

Victor Schmidt

Santiago Miret

Fragkiskos D. Malliaros

Taco Cohen

Pietro Lio’

Yoshua Bengio

Michael M. Bronstein

2023-12-11

ArXiv (prépublication)

doi.org

arxiv.org

Efficient Graphics Representation with Differentiable Indirection

Sayantan Datta

Carl Marshall

Zhao Dong

Zhengqin Li

D. Nowrouzezahrai

We introduce differentiable indirection – a novel learned primitive that employs differentiable multi-scale lookup tables as an effective … (voir plus)substitute for traditional compute and data operations across the graphics pipeline. We demonstrate its flexibility on a number of graphics tasks, i.e., geometric and image representation, texture mapping, shading, and radiance field representation. In all cases, differentiable indirection seamlessly integrates into existing architectures, trains rapidly, and yields both versatile and efficient results.

2023-12-10

SIGGRAPH Asia 2023 Conference Papers (publié)

doi.org

arxiv.org

Explorable Mesh Deformation Subspaces from Unstructured 3D Generative Models

Arman Maesumi

Paul Guerrero

Noam Aigerman

Vladimir Kim

Matthew Fisher

Siddhartha Chaudhuri

Daniel Ritchie

2023-12-10

SIGGRAPH Asia 2023 Conference Papers (publié)

doi.org

arxiv.org

Model Breadcrumbs: Scaling Multi-Task Model Merging with Sparse Masks

MohammadReza Davari

Eugene Belilovsky

2023-12-10

ArXiv (prépublication)

doi.org

arxiv.org

Lagrangian Properties and Control of Soft Robots Modeled with Discrete Cosserat Rods

Lekan Molu

Shaoru Chen

Audrey Sedal

The characteristic ``in-plane"bending associated with soft robots' deformation make them preferred over rigid robots in sophisticated manipu… (voir plus)lation and movement tasks. Executing such motion strategies to precision in soft deformable robots and structures is however fraught with modeling and control challenges given their infinite degrees-of-freedom. Imposing \textit{piecewise constant strains} (PCS) across (discretized) Cosserat microsolids on the continuum material however, their dynamics become amenable to tractable mathematical analysis. While this PCS model handles the characteristic difficult-to-model ``in-plane"bending well, its Lagrangian properties are not exploited for control in literature neither is there a rigorous study on the dynamic performance of multisection deformable materials for ``in-plane"bending that guarantees steady-state convergence. In this sentiment, we first establish the PCS model's structural Lagrangian properties. Second, we exploit these for control on various strain goal states. Third, we benchmark our hypotheses against an Octopus-inspired robot arm under different constant tip loads. These induce non-constant ``in-plane"deformation and we regulate strain states throughout the continuum in these configurations. Our numerical results establish convergence to desired equilibrium throughout the continuum in all of our tests. Within the bounds here set, we conjecture that our methods can find wide adoption in the control of cable- and fluid-driven multisection soft robotic arms; and may be extensible to the (learning-based) control of deformable agents employed in simulated, mixed, or augmented reality.

2023-12-09

ArXiv (prépublication)

doi.org

arxiv.org

Filtering Pixel Latent Variables for Unmixing Noisy and Undersampled Volumetric Images

Catherine Bouchard

Andréanne Deschênes

Vincent Boulanger

Jean-Michel Bellavance

Julia Chabbert

Alexy Pelletier-Rioux

Flavie Lavoie-Cardinal

Christian Gagné

2023-12-07

ArXiv (prépublication)

arxiv.org

Harnessing Predictive Modeling and Software Analytics in the Age of LLM-Powered Software Development (Invited Talk)

Foutse Khomh

2023-12-07

International Conference on Predictive Models in Software Engineering (publié)

doi.org

Bug Characterization in Machine Learning-based Systems

Mohammad Mehdi Morovati

Amin Nikanjam

Florian Tambon

Foutse Khomh

Z. Jiang

Rapid growth of applying Machine Learning (ML) in different domains, especially in safety-critical areas, increases the need for reliable ML… (voir plus) components, i.e., a software component operating based on ML. Understanding the bugs characteristics and maintenance challenges in ML-based systems can help developers of these systems to identify where to focus maintenance and testing efforts, by giving insights into the most error-prone components, most common bugs, etc. In this paper, we investigate the characteristics of bugs in ML-based software systems and the difference between ML and non-ML bugs from the maintenance viewpoint. We extracted 447,948 GitHub repositories that used one of the three most popular ML frameworks, i.e., TensorFlow, Keras, and PyTorch. After multiple filtering steps, we select the top 300 repositories with the highest number of closed issues. We manually investigate the extracted repositories to exclude non-ML-based systems. Our investigation involved a manual inspection of 386 sampled reported issues in the identified ML-based systems to indicate whether they affect ML components or not. Our analysis shows that nearly half of the real issues reported in ML-based systems are ML bugs, indicating that ML components are more error-prone than non-ML components. Next, we thoroughly examined 109 identified ML bugs to identify their root causes, symptoms, and calculate their required fixing time. The results also revealed that ML bugs have significantly different characteristics compared to non-ML bugs, in terms of the complexity of bug-fixing (number of commits, changed files, and changed lines of code). Based on our results, fixing ML bugs are more costly and ML components are more error-prone, compared to non-ML bugs and non-ML components respectively. Hence, paying a significant attention to the reliability of the ML components is crucial in ML-based systems.

2023-12-04

Empirical Software Engineering (publié)

doi.org

arxiv.org

TRAIL : IA responsable pour les professionnels et les leaders

Fondateur en résidence Mila Ventures

Avantage IA : productivité dans la fonction publique

Publications

TRAIL : IA responsable pour les professionnels et les leaders

Fondateur en résidence Mila Ventures

Avantage IA : productivité dans la fonction publique

Mots-clés populaires:

Publications