From May 7 to May 11, 2024, dozens of Mila researchers will attend the twelfth International Conference on Learning Representations (ICLR 2024) in Vienna, Austria. This year, they will share 54 scientific papers at the main conference, showcasing their groundbreaking artificial intelligence (AI) research to peers from all around the world.
Here is a list of papers accepted at ICLR 2024 that contain at least one Mila-affiliated author:
Title | Authors | |
Mastering Memory Tasks with World Models | Mohammad Reza Samsami, Artem Zholus, Janarthanan Rajendran, Sarath Chandar | https://openreview.net/pdf?id=1vDArHJ68h |
Course Correcting Koopman Representations | Mahan Fathi, Clement Gehring, Jonathan Pilault, David Kanaa, Pierre-luc Bacon, Ross Goroshin | https://openreview.net/pdf?id=A18gWgc5mi |
Large Language Models as Generalizable Policies for Embodied Tasks | Andrew Szot, Max Schwarzer, Harsh Agrawal, Bogdan Mazoure, Rin Metcalf, Katherine Metcalf, Walter Talbott, Natalie Mackraz, Devon Hjelm, Alexander Toshev | https://openreview.net/pdf?id=u6imHU4Ebu |
Object-centric architectures enable efficient causal representation learning | Amin Mansouri, Jason Hartford, Yan Zhang, Yoshua Bengio | https://openreview.net/pdf?id=r9FsiXZxZt |
On Diffusion Modeling for Anomaly Detection | Victor Livernoche, Vineet Jain, Yashar Hezaveh, Siamak Ravanbakhsh | https://openreview.net/pdf?id=lR3rk7ysXz |
Synaptic Weight Distributions Depend on the Geometry of Plasticity | Roman Pogodin, Jonathan Cornford, Arna Ghosh, Gauthier Gidel, Guillaume Lajoie, Blake Aaron Richards | https://openreview.net/pdf?id=x5txICnnjC |
Delta-AI: Local objectives for amortized inference in sparse graphical models | Jean-Pierre R. Falet, Hae Beom Lee, Nikolay Malkin, Chen Sun, Dragos Secrieru, Dinghuai Zhang, Guillaume Lajoie, Yoshua Bengio | https://openreview.net/pdf?id=LemSSn8htt |
Expected flow networks in stochastic environments and two-player zero-sum games | Marco Jiralerspong, Bilun Sun, Danilo Vucetic, Tianyu Zhang, Yoshua Bengio, Gauthier Gidel, Nikolay Malkin | https://openreview.net/pdf?id=uH0FGECSEI |
Searching for High-Value Molecules Using Reinforcement Learning and Transformers | Raj Ghugare, Santiago Miret, Adriana Hugessen, Mariano Phielipp, Glen Berseth | https://openreview.net/pdf?id=O8mZO2ri33 |
Sufficient conditions for offline reactivation in recurrent neural networks | Nanda H Krishna, Colin Bredenberg, Daniel Levenstein, Blake Aaron Richards, Guillaume Lajoie | https://openreview.net/pdf?id=RVrINT6MT7 |
Efficient Dynamics Modeling in Interactive Environments with Koopman Theory | Arnab Kumar Mondal, Siba Smarak Panigrahi, Sai Rajeswar, K. Siddiqi, Siamak Ravanbakhsh | https://openreview.net/pdf?id=CmUWQ9s2D_ |
TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series | Arjun Ashok, Étienne Marcotte, Valentina Zantedeschi, Nicolas Chapados, Alexandre Drouin | https://openreview.net/pdf?id=xtOydkE1Ku |
Motif: Intrinsic Motivation from Artificial Intelligence Feedback | Martin Klissarov, P. D'Oro, Shagun Sodhani, Roberta Raileanu, Pierre-luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff | https://openreview.net/pdf?id=8v8AVAo6E5 |
On the Stability of Iterative Retraining of Generative Models on their own Data | Quentin Bertrand, Avishek Joey Bose, Alexandre Duplessis, Marco Jiralerspong, Gauthier Gidel | https://openreview.net/pdf?id=JORAfH2xFd |
Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency | Tianhong Li, Sangnie Bhardwaj, Yonglong Tian, Han Zhang, Jarred Barber, Dina Katabi, Guillaume Lajoie, Huiwen Chang, Dilip Krishnan | https://openreview.net/pdf?id=kNjrhD67LP |
GOAt: Explaining Graph Neural Networks via Graph Output Attribution | Shengyao Lu, Keith G Mills, Jiao He, Bang Liu, Di Niu | https://openreview.net/pdf?id=2Q8TZWAHv4 |
Improving Intrinsic Exploration by Creating Stationary Objectives | Roger Creus Castanyer, Joshua Romoff, Glen Berseth | https://openreview.net/pdf?id=YbZxT0SON4 |
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo | Haque Ishfaq, Qingfeng Lan, Pan Xu, A. Rupam Mahmood, Doina Precup, Anima Anandkumar, Kamyar Azizzadenesheli | https://openreview.net/pdf?id=6u1z0RH6u1 |
Cycle Consistency Driven Object Discovery | Aniket Didolkar, Anirudh Goyal, Yoshua Bengio | https://openreview.net/pdf?id=f1xnBr4WD6 |
Pre-Training and Fine-Tuning Generative Flow Networks | L. Pan, Moksh Jain, Kanika Madan, Yoshua Bengio | https://openreview.net/pdf?id=2KY3WwgcTi |
Towards Foundation Models for Knowledge Graph Reasoning | Mikhail Galkin, Xinyu Yuan, Hesham Mostafa, Jian Tang, Zhaocheng Zhu | https://openreview.net/forum?id=jVEoydFOl9 |
PhyloGFN: Phylogenetic inference with generative flow networks | Ming Yang Zhou, Zichao Yan, Elliot Layne, Nikolay Malkin, Dinghuai Zhang, Moksh Jain, Mathieu Blanchette, Yoshua Bengio | https://openreview.net/pdf?id=hB7SlfEmze |
Reward Model Ensembles Help Mitigate Overoptimization | Thomas Coste, Usman Anwar, Robert Kirk, David Krueger | https://openreview.net/pdf?id=NiQYQEPUsA |
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning | Mingde Zhao, Safa Alver, Harm van Seijen, Romain Laroche, Doina Precup, Yoshua Bengio | https://openreview.net/pdf?id=eo9dHwtTFt |
Amortizing intractable inference in large language models | Edward J. Hu, Moksh Jain, Eric Elmoznino, Younesse Kaddar, Guillaume Lajoie, Yoshua Bengio, Nikolay Malkin | https://openreview.net/pdf?id=Ouj6p4ca60 |
Piecewise Linear Parametrization of Policies: Towards Interpretable Deep Reinforcement Learning | Maxime Wabartha, Joelle Pineau | https://openreview.net/pdf?id=Zbt9z0a95l |
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View. | Raj Ghugare, Matthieu Geist, Glen Berseth, Benjamin Eysenbach | https://openreview.net/forum?id=qg5JENs0N4 |
Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory optimization | Dinghuai Zhang, Ricky T. Q. Chen, Cheng-Hao Liu, Aaron Courville, Yoshua Bengio | https://openreview.net/pdf?id=OIsahq1UYC |
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets | Dominique Beaini, Shenyang Huang, Joao Alex Cunha, Zhiyi Li, Gabriela Moisescu-Pareja, Oleksandr Dymov, S. Maddrell-Mander, Callum McLean, Frederick Wenkel, Luis Müller, Jama Hussein Mohamud, Alipanah Parviz, Michael Craig, Michal Koziarski, Jiarui Lu, Zhaocheng Zhu, Cristian Gabellini, Kerstin Klaser, Josef Dean, Cas Wognum, Maciej Sypetkowski, Guillaume Rabusseau, Reihaneh Rabbany, Jian Tang, Ioannis Koutis, Christopher Morris, Mirco Ravanelli, Guy Wolf, Prudencio Tossou, Hadrien Mary, Thérence Bois, A. Fitzgibbon, Blazej Banaszewski, Chad Martin, Dominic Masters | https://openreview.net/pdf?id=Zc2aIcucwc |
Ghost on the Shell: An Expressive Representation of General 3D Shapes | Zhen Liu, Yao Feng, Yuliang Xiu, Weiyang Liu, Liam Paull, Michael J. Black, Bernhard Scholkopf | https://openreview.net/pdf?id=Ad87VjRqUw |
How connectivity structure shapes rich and lazy learning in neural circuits | Yuhan Helena Liu, Aristide Baratin, Jonathan Cornford, Stefan Mihalas, Eric Shea-Brown, Guillaume Lajoie | https://openreview.net/pdf?id=slSmYGc8ee |
Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models | Pablo Pernias, Dominic Rampas, Mats L. Richter, Christopher Joseph Pal, Marc Aubreville | https://openreview.net/pdf?id=gU58d5QeGv |
Less or More From Teacher: Exploiting Trilateral Geometry For Knowledge Distillation | Chengming Hu, Haolun Wu, Xuan Li, Chen Ma, Xi Chen, Boyu Wang, Jun Yan, Xue Liu | https://openreview.net/pdf?id=OZitfSXpdT |
The Curse of Diversity in Ensemble-Based Exploration | Zhixuan Lin, P. D'Oro, Evgenii Nikishin, Aaron Courville | https://openreview.net/pdf?id=M3QXCOTTk4 |
Poly-View Contrastive Learning | Amitis Shidani, Dan Busbridge, Devon Hjelm, Jason Ramapuram, Eeshan Gunesh Dhekane, Russell Webb | https://openreview.net/pdf?id=iHcTLIor0m |
Improving Natural Language Understanding with Computation-Efficient Retrieval Augmentation | Shangyu Wu, Ying Xiong, Yufei CUI, Xue Liu, Buzhou Tang, Tei-Wei Kuo, Chun Jason Xue | https://openreview.net/pdf?id=JtKGkz9fAe |
Local Search GFlowNets | Minsu Kim, Taeyoung Yun, Emmanuel Bengio, Dinghuai Zhang, Yoshua Bengio, Sungsoo Ahn, Jinkyoo Park | https://openreview.net/pdf?id=6cFcw1Rxww |
Balancing Act: Sparse Models with Constrained Disparate Impact | Meraj Hashemizadeh, Juan Ramirez, Rohan Sukumaran, Golnoosh Farnadi, Simon Lacoste-Julien, Jose Gallego-Posada | https://openreview.net/pdf?id=Xz13DtbOVW |
The Cost of Scaling Down Large Language Models: Reducing Model Size Affects Memory before In-context Learning | Tian Jin, Nolan Clement, X. Dong, Vaishnavh Nagarajan, Michael Carbin, Jonathan Ragan-Kelley, Gintare Karolina Dziugaite | https://openreview.net/pdf?id=ldJXXxPE0L |
Tree Cross Attention | Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, M. O. Ahmed | https://openreview.net/pdf?id=Vw24wtSddM |
Decoupling regularization from the action space | Sobhan Mohammadpour, Emma Frejinger, Pierre-luc Bacon | https://openreview.net/pdf?id=UaMgmoKEBj |
Bridging State and History Representations: Understanding Self-Predictive RL | Tianwei Ni, Benjamin Eysenbach, Erfan SeyedSalehi, Michel Ma, Clement Gehring, Aditya Mahajan, Pierre-luc Bacon | https://openreview.net/pdf?id=ms0VgzSGF2 |
LOQA: Learning with Opponent Q-Learning Awareness | Milad Aghajohari, Juan Agustin Duque, Tim Cooijmans, Aaron Courville | https://openreview.net/pdf?id=FDQF6A1s6M |
Reasoning with Latent Diffusion in Offline Reinforcement Learning | Siddarth Venkatraman, Shivesh Khaitan, Ravi Tej Akella, John Dolan, Jeff Schneider, Glen Berseth | https://openreview.net/pdf?id=tGQirjzddO |
Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation | Divyat Mahajan, Ioannis Mitliagkas, Brady Neal, Vasilis Syrgkanis | https://openreview.net/pdf?id=yuy6cGt3KL |
Str2Str: A Score-based Framework for Zero-shot Protein Conformation Sampling | Jiarui Lu, Bozitao Zhong, Zuobai Zhang, Jian Tang | https://openreview.net/pdf?id=C4BikKsgmK |
What happens when you fine-tuning your model? Mechanistic analysis of procedurally generated tasks. | Samyak Jain, Robert Kirk, E. S. Lubana, Robert P. Dick, Hidenori Tanaka, Tim Rocktäschel, Edward Grefenstette, David Krueger | https://openreview.net/pdf?id=A0HKeKl4Nl |
Evaluating Representation Learning on the Protein Structure Universe | Arian Rokkum Jamasb, Alex Morehead, Zuobai Zhang, Chaitanya K. Joshi, Kieran Didi, Simon V Mathis, Charles Harris, Jian Tang, Jianlin Cheng, Pietro Lio, Tom Leon Blundell | https://openreview.net/pdf?id=sTYuRVrdK3 |
Intelligent Switching for Reset-Free RL | Darshan Patil, Janarthanan Rajendran, Glen Berseth, Sarath Chandar | https://openreview.net/pdf?id=Nq45xeghcL |
Learning Multi-Agent Communication with Contrastive Learning | Yat Long Lo, Biswa Sengupta, Jakob Foerster, Michael Noukhovitch | https://openreview.net/pdf?id=vZZ4hhniJU |
SE(3)-Stochastic Flow Matching for Protein Backbone Generation | Avishek Joey Bose*, Tara Akhound-Sadegh*, Guillaume Huguet, Kilian FATRAS, Jarrid Rector-Brooks, Cheng-Hao Liu, Andrei Cristian Nica, Maksym Korablyov, Michael M. Bronstein, Alexander Tong | https://openreview.net/pdf?id=kJFIH23hXb |
Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions | Satwik Bhattamishra, Arkil Patel, Phil Blunsom, Varun Kanade | https://openreview.net/pdf?id=ekeyCgeRfC |
Ensemble Distillation for Unsupervised Constituency Parsing | Behzad Shayegh, Yanshuai Cao, Xiaodan Zhu, Jackie CK Cheung, Lili Mou | https://openreview.net/pdf?id=RR8y0WKrFv |
GraphPulse: Topological representations for temporal graph property prediction | Kiarash Shamsi, Farimah Poursafaei, Shenyang Huang, Bao Tran Gia Ngo, Baris Coskunuzer, Cuneyt Gurcan Akcora | https://openreview.net/pdf?id=DZqic2sPTY |