Publications

Geometry-Aware Generative Autoencoders for Metric Learning and Generative Modeling on Data Manifolds

Xingzhi Sun

Danqi Liao

Kincaid MacDonald

Yanlei Zhang

Guillaume Huguet

Guy Wolf

Ian Adelstein

Tim G. J. Rudner

Smita Krishnaswamy

Non-linear dimensionality reduction methods have proven successful at learning low-dimensional representations of high-dimensional point clo… (see more)uds on or near data manifolds. However, existing methods are not easily extensible—that is, for large datasets, it is prohibitively expensive to add new points to these embeddings. As a result, it is very difficult to use existing embeddings generatively, to sample new points on and along these manifolds. In this paper, we propose GAGA (geometry-aware generative autoencoders) a framework which merges the power of generative deep learning with non-linear manifold learning by: 1) learning generalizable geometry-aware neural network embeddings based on non-linear dimensionality reduction methods like PHATE and diffusion maps, 2) deriving a non-euclidean pullback metric on the embedded space to generate points faithfully along manifold geodesics, and 3) learning a flow on the manifold that allows us to transport populations. We provide illustration on easily-interpretable synthetic datasets and showcase results on simulated and real single cell datasets. In particular, we show that the geodesic-based generation can be especially important for scientific datasets where the manifold represents a state space and geodesics can represent dynamics of entities over this space.

2024-06-17

ICML.cc/2024/Workshop/GRaM (published)

Is a Good Description Worth a Thousand Pictures? Reducing Multimodal Alignment to Text-Based, Unimodal Alignment

Amin Memarian

Touraj Laleh

Irina Rish

Ardavan S. Nobandegani

Generative AI systems (ChatGPT, Llama, etc.) are increasingly adopted across a range of high-stake domains, including healthcare and crimina… (see more)l justice system. This rapid adoption indeed raises moral and ethical concerns. The emerging field of AI alignment aims to make AI systems that respect human values. In this work, we focus on evaluating the ethics of multimodal AI systems involving both text and images --- a relatively under-explored area, as most alignment work is currently focused on language models. Specifically, here we investigate whether the multimodal alignment problem (i.e., the problem of aligning a multimodal system) could be effectively reduced to the (text-based) unimodal alignment problem, wherein a language model would make a moral judgment purely based on a description of an image. Focusing on GPT-4 and LLaVA as two prominent examples of multimodal systems, here we demonstrate, rather surprisingly, that this reduction can be achieved with a relatively small loss in moral judgment performance in the case of LLaVa, and virtually no loss in the case of GPT-4.

2024-06-17

ICML.cc/2024/Workshop/MFHAIA (poster)

IDs for AI Systems

Alan Chan

Noam Kolt

Peter Wills

Usman Anwar

Christian Schroeder de Witt

Nitarshan Rajkumar

Lewis Hammond

Lennart Heim

Markus Anderljung

2024-06-17

ArXiv (preprint)

IDs for AI Systems

Alan Chan

Noam Kolt

Peter Wills

Usman Anwar

Christian Schroeder de Witt

Nitarshan Rajkumar

Lewis Hammond

Lennart Heim

Markus Anderljung

2024-06-17

ArXiv (preprint)

IDs for AI Systems

Alan Chan

Noam Kolt

Peter Wills

Usman Anwar

Christian Schroeder de Witt

Nitarshan Rajkumar

Lewis Hammond

Lennart Heim

Markus Anderljung

2024-06-17

ArXiv (preprint)

IDs for AI Systems

Alan Chan

Noam Kolt

Peter Wills

Usman Anwar

Christian Schroeder de Witt

Nitarshan Rajkumar

Lewis Hammond

Lennart Heim

Markus Anderljung

2024-06-17

ArXiv (preprint)

IDs for AI Systems

Alan Chan

Noam Kolt

Peter Wills

Usman Anwar

Christian Schroeder de Witt

Nitarshan Rajkumar

Lewis Hammond

Lennart Heim

Markus Anderljung

AI systems are increasingly pervasive, yet information needed to decide whether and how to engage with them may not exist or be accessible. … (see more)A user may not be able to verify whether a system has certain safety certifications. An investigator may not know whom to investigate when a system causes an incident. It may not be clear whom to contact to shut down a malfunctioning system. Across a number of domains, IDs address analogous problems by identifying particular entities (e.g., a particular Boeing 747) and providing information about other entities of the same class (e.g., some or all Boeing 747s). We propose a framework in which IDs are ascribed to instances of AI systems (e.g., a particular chat session with Claude 3), and associated information is accessible to parties seeking to interact with that system. We characterize IDs for AI systems, provide concrete examples where IDs could be useful, argue that there could be significant demand for IDs from key actors, analyze how those actors could incentivize ID adoption, explore a potential implementation of our framework for deployers of AI systems, and highlight limitations and risks. IDs seem most warranted in settings where AI systems could have a large impact upon the world, such as in making financial transactions or contacting real humans. With further study, IDs could help to manage a world where AI systems pervade society.

2024-06-17

ArXiv (preprint)

IDs for AI Systems

Alan Chan

Noam Kolt

Peter Wills

Usman Anwar

Christian Schroeder de Witt

Nitarshan Rajkumar

Lewis Hammond

Lennart Heim

Markus Anderljung

AI systems are increasingly pervasive, yet information needed to decide whether and how to engage with them may not exist or be accessible. … (see more)A user may not be able to verify whether a system has certain safety certifications. An investigator may not know whom to investigate when a system causes an incident. It may not be clear whom to contact to shut down a malfunctioning system. Across a number of domains, IDs address analogous problems by identifying particular entities (e.g., a particular Boeing 747) and providing information about other entities of the same class (e.g., some or all Boeing 747s). We propose a framework in which IDs are ascribed to instances of AI systems (e.g., a particular chat session with Claude 3), and associated information is accessible to parties seeking to interact with that system. We characterize IDs for AI systems, provide concrete examples where IDs could be useful, argue that there could be significant demand for IDs from key actors, analyze how those actors could incentivize ID adoption, explore a potential implementation of our framework for deployers of AI systems, and highlight limitations and risks. IDs seem most warranted in settings where AI systems could have a large impact upon the world, such as in making financial transactions or contacting real humans. With further study, IDs could help to manage a world where AI systems pervade society.

2024-06-17

ArXiv (preprint)

IDs for AI Systems

Alan Chan

Noam Kolt

Peter Wills

Usman Anwar

Christian Schroeder de Witt

Nitarshan Rajkumar

Lewis Hammond

Lennart Heim

Markus Anderljung

2024-06-17

ArXiv (preprint)

IDs for AI Systems

Alan Chan

Noam Kolt

Peter Wills

Usman Anwar

Christian Schroeder de Witt

Nitarshan Rajkumar

Lewis Hammond

Lennart Heim

Markus Anderljung

AI systems are increasingly pervasive, yet information needed to decide whether and how to engage with them may not exist or be accessible. … (see more)A user may not be able to verify whether a system has certain safety certifications. An investigator may not know whom to investigate when a system causes an incident. It may not be clear whom to contact to shut down a malfunctioning system. Across a number of domains, IDs address analogous problems by identifying particular entities (e.g., a particular Boeing 747) and providing information about other entities of the same class (e.g., some or all Boeing 747s). We propose a framework in which IDs are ascribed to instances of AI systems (e.g., a particular chat session with Claude 3), and associated information is accessible to parties seeking to interact with that system. We characterize IDs for AI systems, provide concrete examples where IDs could be useful, argue that there could be significant demand for IDs from key actors, analyze how those actors could incentivize ID adoption, explore a potential implementation of our framework for deployers of AI systems, and highlight limitations and risks. IDs seem most warranted in settings where AI systems could have a large impact upon the world, such as in making financial transactions or contacting real humans. With further study, IDs could help to manage a world where AI systems pervade society.

2024-06-17

ArXiv (preprint)

Improving Molecular Modeling with Geometric GNNs: an Empirical Study

Ali Ramlaoui

Théo Saulus

Basile Terver

Victor Schmidt

David Rolnick

Fragkiskos D. Malliaros

Alexandre AGM Duval

2024-06-17

ICML.cc/2024/Workshop/ML4LMS (poster)