Almost 90 Mila-Affiliated Papers Accepted at ICLR 2025

logo ICLR and Mila

From April 24 to April 28, 2025, dozens of Mila researchers will attend the Thirteenth International Conference on Learning Representations (ICLR 2025) in Singapore. This year, they will share 87 scientific papers at the main conference and dozens of papers at workshops, showcasing their groundbreaking artificial intelligence (AI) research to peers from all around the world.

Here is a list of papers accepted at ICLR 2025 that contain at least one Mila-affiliated author.

Main Conference

PapersAuthorsPdf
Pitfalls of Evidence-Based AI PolicyStephen Casper, David Krueger, Dylan Hadfield-Menellhttps://openreview.net/pdf?id=8nyIAanfST
Advantage Alignment AlgorithmsJuan Agustin Duque, Milad Aghajohari, Tim Cooijmans, razvan ciuca, Tianyu Zhang, Gauthier Gidel, Aaron Courvillehttps://openreview.net/pdf?id=QFO1asgas2
Solving Hidden Monotone Variational Inequalities with Surrogate LossesRyan D'Orazio, Danilo Vucetic, Zichu Liu, Junhyung Lyle Kim, Ioannis Mitliagkas, Gauthier Gidelhttps://openreview.net/pdf?id=4ZX2a3OKEV
Contractive Dynamical Imitation Policies for Efficient Out-of-Sample RecoveryAmin Abyaneh, Mahrokh Ghoddousi Boroujeni, Hsiu-Chin Lin, Giancarlo Ferrari-Trecatehttps://openreview.net/pdf?id=lILEtkWOXD
AdaFisher: Adaptive Second Order Optimization via Fisher InformationDamien MARTINS GOMES, Yanlei Zhang, Eugene Belilovsky, Guy Wolf, Mahdi S. Hosseinihttps://openreview.net/pdf?id=puTxuiK2qO
Expressivity of Neural Networks with Random Weights and Learned BiasesEzekiel Williams, Alexandre Payeur, Avery Hee-Woon Ryoo, Thomas Jiralerspong, Matthew G Perich, Luca Mazzucato, Guillaume Lajoiehttps://openreview.net/pdf?id=5xwx1Myosu
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic LearningHaque Ishfaq, Guangyuan Wang, Mohammad Sami Nur Islam, Doina Precuphttps://openreview.net/pdf?id=FvQsk3la17
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RLGhada Sokar, Johan Samir Obando Ceron, Aaron Courville, Hugo Larochelle, Pablo Castrohttps://openreview.net/pdf?id=8oCrlOaYcc
Boosting Latent Diffusion with Perceptual ObjectivesTariq Berrada, Pietro Astolfi, Melissa Hall, Marton Havasi, Yohann Benchetrit, Adriana Romero, Karteek Alahari, Michal Drozdzal, Jakob Verbeekhttps://openreview.net/pdf?id=y4DtzADzd1
The Pitfalls of Memorization: When Memorization Hurts GeneralizationReza Bayat, Mohammad Pezeshki, Elvis Dohmatob, David Lopez-Paz, Pascal Vincenthttps://openreview.net/pdf?id=vVhZh9ZpIM
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning TracesDiJia Su, Sainbayar Sukhbaatar, Michael Rabbat, Yuandong Tian, Qinqing Zhenghttps://openreview.net/pdf?id=bmbRCRiNDu
Training Language Models to Self-Correct via Reinforcement LearningAviral Kumar, Vincent Zhuang, Rishabh Agarwal, Yi Su, John D Co-Reyes, Avi Singh, Kate Baumli, Shariq Iqbal, Colton Bishop, Rebecca Roelofs, Lei M Zhang, Kay McKinney, Disha Shrivastava, Cosmin Paduraru, George Tucker, Doina Precup, Feryal Behbahani, Aleksandra Fausthttps://openreview.net/pdf?id=CjwERcAU7w
PETRA: Parallel End-to-end Training with Reversible ArchitecturesStephane Rivaud, Louis Fournier, Thomas Pumir, Eugene Belilovsky, Michael Eickenberg, Edouard Oyallonhttps://openreview.net/pdf?id=0fhzSFsGUT
The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling LawsTian Jin, Ahmed Imtiaz Humayun, Utku Evci, Suvinay Subramanian, Amir Yazdanbakhsh, Dan Alistarh, Gintare Karolina Dziugaitehttps://openreview.net/pdf?id=ud8FtE1N4N
SymmCD: Symmetry-Preserving Crystal Generation with Diffusion ModelsDaniel Levy, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Qiang Zhu, Kin Long Kelvin Lee, Mikhail Galkin, Santiago Miret, Siamak Ravanbakhshhttps://openreview.net/pdf?id=V7x2KZQn2v
Input Space Mode Connectivity in Deep Neural NetworksJakub Vrabel, Ori Shem-Ur, Yaron Oz, David Kruegerhttps://openreview.net/pdf?id=3qeOy7HwUT
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic ApproximationLu Li, Tianyu Zhang, Zhiqi Bu, Suyuchen Wang, Huan He, Jie Fu, Yonghui Wu, Jiang Bian, Yong Chen, Yoshua Bengiohttps://openreview.net/pdf?id=1v7SRWsYve
Meta Flow Matching: Integrating Vector Fields on the Wasserstein ManifoldLazar Atanackovic, Xi Zhang, Brandon Amos, Mathieu Blanchette, Leo J Lee, Yoshua Bengio, Alexander Tong, Kirill Neklyudovhttps://openreview.net/pdf?id=9SYczU3Qgm
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard ModelsSeanie Lee, Haebin Seong, Dong Bok Lee, Minki Kang, Xiaoyin Chen, Dominik Wagner, Yoshua Bengio, Juho Lee, Sung Ju Hwanghttps://openreview.net/pdf?id=y3zswp3gek
Action abstractions for amortized samplingOussama Boussif, Lena Nehale Ezzine, Joseph D Viviano, Michał Koziarski, Moksh J. Jain, Nikolay Malkin, Emmanuel Bengio, Rim Assouel, Yoshua Bengiohttps://openreview.net/pdf?id=ispjankYab
Towards Interpreting Visual Information Processing in Vision-Language ModelsClement Neo, Luke Ong, Philip Torr, Mor Geva, David Krueger, Fazl Barezhttps://openreview.net/pdf?id=chanJGoa7f
Neuroplastic Expansion in Deep Reinforcement LearningJiashun Liu, Johan Samir Obando Ceron, Aaron Courville, Ling Panhttps://openreview.net/pdf?id=20qZK2T7fa
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language ModelsMichael Noukhovitch, Shengyi Huang, Sophie Xhonneux, Arian Hosseini, Rishabh Agarwal, Aaron Courvillehttps://openreview.net/pdf?id=FhTAG591Ve
Multi-agent cooperation through learning-aware policy gradientsAlexander Meulemans, Seijin Kobayashi, Johannes Von Oswald, Nino Scherrer, Eric Elmoznino, Blake Aaron Richards, Guillaume Lajoie, Blaise Aguera y Arcas, João Sacramentohttps://openreview.net/pdf?id=GkWA6NjePN
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced ReasoningMd Rifat Arefin, Gopeshh Subbaraj, Nicolas Gontier, Yann LeCun, Irina Rish, Ravid Shwartz-Ziv, Christopher Palhttps://openreview.net/pdf?id=30oIfmrcFO
Accelerating Training with Neuron Interaction and Nowcasting NetworksBoris Knyazev, Abhinav Moudgil, Guillaume Lajoie, Eugene Belilovsky, Simon Lacoste-Julienhttps://openreview.net/pdf?id=cUFIil6hEG
MaestroMotif: Skill Design from Artificial Intelligence FeedbackMartin Klissarov, Mikael Henaff, Roberta Raileanu, Shagun Sodhani, Pascal Vincent, Amy Zhang, Pierre-Luc Bacon, Doina Precup, Marlos C. Machado, Pierluca D'Orohttps://openreview.net/pdf?id=or8mMhmyRV
Non-Adversarial Inverse Reinforcement Learning via Successor Feature MatchingArnav Kumar Jain, Harley Wiltzer, Jesse Farebrother, Irina Rish, Glen Berseth, Sanjiban Choudhuryhttps://openreview.net/pdf?id=LvRQgsvd5V
Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous InferenceMatthew Riemer, Gopeshh Subbaraj, Glen Berseth, Irina Rishhttps://openreview.net/pdf?id=fXb9BbuyAD
Towards General-Purpose Model-Free Reinforcement LearningScott Fujimoto, Pierluca D'Oro, Amy Zhang, Yuandong Tian, Michael Rabbathttps://openreview.net/pdf?id=R1hIXdST22
Structure Language Models for Protein Conformation GenerationJiarui Lu, Xiaoyin Chen, Stephen Zhewen Lu, Chence Shi, Hongyu Guo, Yoshua Bengio, Jian Tanghttps://openreview.net/pdf?id=15AkNhFX1R
Adaptive teachers for amortized samplersMinsu Kim, Sanghyeok Choi, Taeyoung Yun, Emmanuel Bengio, Leo Feng, Jarrid Rector-Brooks, Sungsoo Ahn, Jinkyoo Park, Nikolay Malkin, Yoshua Bengiohttps://openreview.net/pdf?id=BdmVgLMvaf
ParetoFlow: Guided Flows in Multi-Objective OptimizationYe Yuan, Can Chen, Christopher Pal, Xue Liuhttps://openreview.net/pdf?id=mLyyB4le5u
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNetsZhen Liu, Tim Z. Xiao, Weiyang Liu, Yoshua Bengio, Dinghuai Zhanghttps://openreview.net/pdf?id=Aye5wL6TCn
PQMass: Probabilistic Assessment of the Quality of Generative Models using Probability Mass EstimationPablo Lemos, Sammy Nasser Sharief, Nikolay Malkin, Salma Salhi, Connor Stone, Laurence Perreault-Levasseur, Yashar Hezavehhttps://openreview.net/pdf?id=n7qGCmluZr
Fully-inductive Node Classification on Arbitrary GraphsJianan Zhao, Zhaocheng Zhu, Mikhail Galkin, Hesham Mostafa, Michael M. Bronstein, Jian Tanghttps://openreview.net/pdf?id=1Qpt43cqhg
Learning diverse attacks on large language models for robust red-teaming and safety tuningSeanie Lee, Minsu Kim, Lynn Cherif, David Dobre, Juho Lee, Sung Ju Hwang, Kenji Kawaguchi, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Moksh J. Jainhttps://openreview.net/pdf?id=1mXufFuv95
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight GenerationGaurav Sahu, Abhay Puri, Juan A. Rodriguez, Amirhossein Abaskohi, Mohammad Chegini, Alexandre Drouin, Perouz Taslakian, Valentina Zantedeschi, Alexandre Lacoste, David Vazquez, Nicolas Chapados, Christopher Pal, Sai Rajeswar, Issam Hadj Laradjihttps://openreview.net/pdf?id=ZGqd0cbBvm
MatExpert: Decomposing Materials Discovery By Mimicking Human ExpertsQianggang Ding, Santiago Miret, Bang Liuhttps://openreview.net/pdf?id=AUBvo4sxVL
On the Modeling Capabilities of Large Language Models for Sequential Decision MakingMartin Klissarov, R Devon Hjelm, Alexander T Toshev, Bogdan Mazourehttps://openreview.net/pdf?id=vodsIF3o7N
Syntactic and Semantic Control of Large Language Models via Sequential Monte CarloJoão Loula, Benjamin LeBrun, Li Du, Ben Lipkin, Clemente Pasti, Gabriel Grand, Tianyu Liu, Yahya Emara, Marjorie Freedman, Jason Eisner, Ryan Cotterell, Vikash Mansinghka, Alexander K. Lew, Tim Vieira, Timothy J. O'Donnellhttps://openreview.net/pdf?id=xoXn62FzD0
Proving Olympiad Inequalities by Synergizing LLMs and Symbolic ReasoningZenan Li, Zhaoyu Li, Wen Tang, Xian Zhang, Yuan Yao, Xujie Si, Fan Yang, Kaiyu Yang, Xiaoxing Mahttps://openreview.net/pdf?id=FiyS0ecSm0
BigDocs: An Open Dataset for Training Multimodal Models on Document and Code TasksJuan A. Rodriguez, Xiangru Jian, Siba Smarak Panigrahi, Tianyu Zhang, Aarash Feizi, Abhay Puri, Akshay Kalkunte Suresh, François Savard, Ahmed Masry, Shravan Nayak, Rabiul Awal, Mahsa Massoud, Amirhossein Abaskohi, Zichao Li, Suyuchen Wang, Pierre-Andre Noel, Mats Leon Richter, Saverio Vadacchino, Shubham Agarwal, Sanket Biswas, Sara Shanian, Ying Zhang, Sathwik Tejaswi Madhusudhan, Joao Monteiro, Krishnamurthy Dj Dvijotham, Torsten Scholak, Nicolas Chapados, Sepideh Kharaghani, Sean Hughes, M. Özsu, Siva Reddy, Marco Pedersoli, Yoshua Bengio, Christopher Pal, Issam Hadj Laradji, Spandana Gella, Perouz Taslakian, David Vazquez, Sai Rajeswarhttps://openreview.net/pdf?id=b1ivBPLb1n
Studying the Interplay Between the Actor and Critic Representations in Reinforcement LearningSamuel Garcin, Trevor McInroe, Pablo Castro, Christopher G. Lucas, David Abel, Prakash Panangaden, Stefano V Albrechthttps://openreview.net/pdf?id=tErHYBGlWc
What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative ModelsAhmed Imtiaz Humayun, Ibtihel Amara, Cristina Nader Vasconcelos, Deepak Ramachandran, Candice Schumann, Junfeng He, Katherine A Heller, Golnoosh Farnadi, Negar Rostamzadeh, Mohammad Havaeihttps://openreview.net/pdf?id=etif9j1CnG
On the Transfer of Object-Centric Representation LearningAniket Rajiv Didolkar, Andrii Zadaianchuk, Anirudh Goyal, Michael Curtis Mozer, Yoshua Bengio, Georg Martius, Maximilian Seitzerhttps://openreview.net/pdf?id=bSq0XGS3kW
Forgetting Transformer: Softmax Attention with a Forget GateZhixuan Lin, Evgenii Nikishin, Xu He, Aaron Courvillehttps://openreview.net/pdf?id=q2Lnyegkr8
Influence Functions for Scalable Data Attribution in Diffusion ModelsBruno Mlodozeniec, Runa Eschenhagen, Juhan Bae, Alexander Immer, David Krueger, Richard E. Turnerhttps://openreview.net/pdf?id=esYrEndGsr
GlycanML: A Multi-Task and Multi-Structure Benchmark for Glycan Machine LearningMinghao Xu, Yunteng Geng, Yihang Zhang, Ling Yang, Jian Tang, Wentao Zhanghttps://openreview.net/pdf?id=owEQ0FTfVj
Accelerating Inference of Retrieval-Augmented Generation via Sparse Context SelectionYun Zhu, Jia-Chen Gu, Caitlin Sikora, Ho Ko, Yinxiao Liu, Chu-Cheng Lin, Lei Shu, Liangchen Luo, Lei Meng, Bang Liu, Jindong Chenhttps://openreview.net/pdf?id=HE6pJoNnFp
Towards Improving Exploration through Sibling Augmented GFlowNetsKanika Madan, Alex Lamb, Emmanuel Bengio, Glen Berseth, Yoshua Bengiohttps://openreview.net/pdf?id=HH4KWP8RP5
Protecting against simultaneous data poisoning attacksNeel Alex, Shoaib Ahmed Siddiqui, Amartya Sanyal, David Kruegerhttps://openreview.net/pdf?id=rK0YJwL69S
AssembleFlow: Rigid Flow Matching with Inertial Frames for Molecular AssemblyHongyu Guo, Yoshua Bengio, Shengchao Liuhttps://openreview.net/pdf?id=jckKNzYYA6
VCR: Pixel-Level Complex Reasoning by Restoring Occluded TextTianyu Zhang, Suyuchen Wang, Lu Li, Ge Zhang, Perouz Taslakian, Sai Rajeswar, Jie Fu, Bang Liu, Yoshua Bengiohttps://openreview.net/pdf?id=s0Z4csHOoE
CarbonSense: A Multimodal Dataset and Baseline for Carbon Flux ModellingMatthew Fortier, Mats Leon Richter, Oliver Sonnentag, Christopher Palhttps://openreview.net/pdf?id=l8zRnvD95l
Handling Delay in Real-Time Reinforcement LearningIvan Anokhin, Rishav, Matthew Riemer, Stephen Chung, Irina Rish, Samira Ebrahimi Kahouhttps://openreview.net/pdf?id=YOc5t8PHf2
Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution QualityGe Ya Luo, Gian Mario Favero, Zhi Hao Luo, Alexia Jolicoeur-Martineau, Christopher Palhttps://openreview.net/pdf?id=cC3LxGZasH
Multi-session, multi-task neural decoding from distinct cell-types and brain regionsMehdi Azabou, Krystal Xuejing Pan, Vinam Arora, Ian Jarratt Knight, Eva L Dyer, Blake Aaron Richardshttps://openreview.net/pdf?id=IuU0wcO0mo
Credit-based self organizing maps: training deep topographic networks with minimal performance degradationAmir Ozhan Dehghani, Xinyu Qian, Asa Farahani, Pouya Bashivanhttps://openreview.net/pdf?id=wMgr7wBuUo
Accelerating neural network training: An analysis of the AlgoPerf competitionPriya Kasimbeg, Frank Schneider, Runa Eschenhagen, Juhan Bae, Chandramouli Shama Sastry, Mark Saroufim, BOYUAN FENG, Less Wright, Edward Z. Yang, Zachary Nado, Sourabh Medapati, Philipp Hennig, Michael Rabbat, George E. Dahlhttps://openreview.net/pdf?id=CtM5xjRSfm
Interpreting Emergent Planning in Model-Free Reinforcement LearningThomas Bush, Stephen Chung, Usman Anwar, Adrià Garriga-Alonso, David Kruegerhttps://openreview.net/pdf?id=DzGe40glxs
A Generalist Hanabi AgentArjun V Sudhakar, Hadi Nekoei, Mathieu Reymond, Miao Liu, Janarthanan Rajendran, Sarath Chandarhttps://openreview.net/pdf?id=pCj2sLNoJq
Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth StudyShawn Tan, Songlin Yang, Aaron Courville, Rameswar Panda, Yikang Shenhttps://openreview.net/pdf?id=r8J3DSD5kF
Selective Unlearning via Representation Erasure Using Domain Adversarial TrainingNazanin Mohammadi Sepahvand, Eleni Triantafillou, Hugo Larochelle, Doina Precup, James J. Clark, Daniel M. Roy, Gintare Karolina Dziugaitehttps://openreview.net/pdf?id=KzSGJy1PIf
OSCAR: Operating System Control via State-Aware Reasoning and Re-PlanningXiaoqiang Wang, Bang Liuhttps://openreview.net/pdf?id=VuTrZzrPfn
AFlow: Automating Agentic Workflow GenerationJiayi Zhang, Jinyu Xiang, Zhaoyang Yu, Fengwei Teng, Xiong-Hui Chen, Jiaqi Chen, Mingchen Zhuge, Xin Cheng, Sirui Hong, Jinlin Wang, Bingnan Zheng, Bang Liu, Yuyu Luo, Chenglin Wuhttps://openreview.net/pdf?id=z5uVAKwmjf
MMTEB: Massive Multilingual Text Embedding BenchmarkKenneth Enevoldsen, Isaac Chung, Imene Kerboua, Márton Kardos, Ashwin Mathur, David Stap, Jay Gala, Wissam Siblini, Dominik Krzemiński, Genta Indra Winata, Saba Sturua, Saiteja Utpala, Mathieu Ciancone, Marion Schaeffer, Diganta Misra, Shreeya Dhakal, Jonathan Rystrøm, Roman Solomatin, Ömer Veysel Çağatan, Akash Kundu, Martin Bernstorff, Shitao Xiao, Akshita Sukhlecha, Bhavish Pahwa, Rafał Poświata, Kranthi Kiran GV, Shawon Ashraf, Daniel Auras, Björn Plüster, Jan Philipp Harries, Loïc Magne, Isabelle Mohr, Dawei Zhu, Hippolyte Gisserot-Boukhlef, Tom Aarsen, Jan Kostkan, Konrad Wojtasik, Taemin Lee, Marek Suppa, Crystina Zhang, Roberta Rocca, Mohammed Hamdy, Andrianos Michail, John Yang, Manuel Faysse, Aleksei Vatolin, Nandan Thakur, Manan Dey, Dipam Vasani, Pranjal A Chitale, Simone Tedeschi, Nguyen Tai, Artem Snegirev, Mariya Hendriksen, Michael Günther, Mengzhou Xia, Weijia Shi, Xing Han Lu, Jordan Clive, Gayatri K, Maksimova Anna, Silvan Wehrli, Maria Tikhonova, Henil Shalin Panchal, Aleksandr Abramov, Malte Ostendorff, Zheng Liu, Simon Clematide, Lester James Validad Miranda, Alena Fenogenova, Guangyu Song, Ruqiya Bin Safi, Wen-Ding Li, Alessia Borghini, Federico Cassano, Lasse Hansen, Sara Hooker, Chenghao Xiao, Vaibhav Adlakha, Orion Weller, Siva Reddy, Niklas Muennighoffhttps://openreview.net/pdf?id=zl3pfz4VCV
Safety Representations for Safer Policy LearningKaustubh Mani, Vincent Mai, Charlie Gauthier, Annie S Chen, Samer B. Nashed, Liam Paullhttps://openreview.net/pdf?id=gJG4IPwg6l
Mastering Task Arithmetic: τJp as a Key Indicator for Weight DisentanglementKotaro Yoshida, Yuji Naraki, Takafumi Horie, Ryosuke Yamaki, Ryotaro Shimizu, Yuki Saito, Julian McAuley, Hiroki Naganumahttps://openreview.net/pdf?id=1VwWi6zbxs
3DMolFormer: A Dual-channel Framework for Structure-based Drug DiscoveryXiuyuan Hu, Guoqing Liu, Can Chen, Yang Zhao, Hao Zhang, Xue Liuhttps://openreview.net/pdf?id=RgE1qiO2ek
An Auditing Test to Detect Behavioral Shift in Language ModelsLeo Richter, Xuanli He, Pasquale Minervini, Matt J. Kusner 
Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation ModelsAndrea Tirinzoni, Ahmed Touati, Jesse Farebrother, Mateusz Guzek, Anssi Kanervisto, Yingchen Xu, Alessandro Lazaric, Matteo Pirotta 
AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate StatementsAdriana Eufrosina Bora, Pierre-Luc St-Charles, Mirko Bronzi, Arsene Fansi Tchango, Bruno Rousseau, Kerrie Mengersen 
Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior PredictionJarrid Rector-Brooks, Mohsin Hasan, Zhangzhi Peng, Zachary Quinn, Chenghao Liu, Sarthak Mittal, Nouha Dziri, Michael Bronstein, Yoshua Bengio, Pranam Chatterjee, Alexander Tong, Avishek Joey Bosehttps://openreview.net/pdf?id=Ombm8S40zN
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RLClaas A Voelcker, Marcel Hussing, Eric Eaton, Amir-massoud Farahmand, Igor Gilitschenski 
A Truncated Newton Method for Optimal TransportMete Kemertas, Amir-massoud Farahmand, Allan Douglas Jepson 
MuPT: A Generative Symbolic Music Pretrained TransformerXingwei Qu, yuelin bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xeron Du, Shuyue Guo, Yiming Liang, Yizhi LI, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan, Wenhao Huang, Jie Fu, Ge Zhang 
Spectra: Surprising Effectiveness of Pretraining Ternary Language Models at ScaleAyush Kaushal, Tejas Vaidhya, Arnab Kumar Mondal, Tejas Pandey, Aaryan Bhagat, Irina Rish 
INCLUDE: Evaluating Multilingual Language Understanding with Regional KnowledgeAngelika Romanou, Negar Foroutan, Anna Sotnikova, Sree Harsha Nelaturu, Shivalika Singh, Rishabh Maheshwary, Micol Altomare, Zeming Chen, Mohamed A. Haggag, Snegha A, Alfonso Amayuelas, Azril Hafizi Amirudin, Danylo Boiko, Michael Chang, Jenny Chim, Gal Cohen, Aditya Kumar Dalmia, Abraham Diress, Sharad Duwal, Daniil Dzenhaliou, Daniel Fernando Erazo Florez, Fabian Farestam, Joseph Marvin Imperial, Shayekh Bin Islam, Perttu Isotalo, Maral Jabbarishiviari, Börje F. Karlsson, Eldar Khalilov, Christopher Klamm, Fajri Koto, Dominik Krzemiński, Gabriel Adriano de Melo, Syrielle Montariol, Yiyang Nan, Joel Niklaus, Jekaterina Novikova, Johan Samir Obando Ceron, Debjit Paul, Esther Ploeger, Jebish Purbey, Swati Rajwal, Selvan Sunitha Ravi, Sara Rydell, Roshan Santhosh, Drishti Sharma, Marjana Prifti Skenduli, Arshia Soltani Moakhar, Bardia soltani moakhar, Ayush Kumar Tarun, Azmine Toushik Wasi, Thenuka Ovin Weerasinghe, Serhan Yilmaz, Mike Zhang, Imanol Schlag, Marzieh Fadaee, Sara Hooker, Antoine Bosselut ( 
RevisEval: Improving LLM-as-a-Judge via Response-Adapted ReferencesQiyuan Zhang, Yufei Wang, Tiezheng YU, Yuxin Jiang, Chuhan Wu, Liangyou Li, Yasheng Wang, Xin Jiang, Lifeng Shang, Ruiming Tang, Fuyuan Lyu, Chen Ma 
Let Your Features Tell The Differences: Understanding Graph Convolution By Feature SplittingYilun Zheng, Xiang Li, Sitao Luan, Xiaojiang Peng, Lihui Chenhttps://openreview.net/pdf?id=I9omfcWfMp
ZETA: Leveraging Z-order Curves for Efficient Top-K AttentionQiuhuao Zeng, Jerry Huang, Peng Lu, Gezheng Xu, Boxing Chen, Charles Ling, Boyu Wang 
Improving Equivariant Networks with Probabilistic Symmetry BreakingHannah Lawrence, Vasco Portilheiro, Yan Zhang, Sékou-Oumar Kaba 
Bridging the Data Provenance Gap Across Text, Speech, and VideoShayne Longpre, Nikhil Singh, Manuel Cherep, Kushagra Tiwary, Joanna Materzynska, William Brannon, Robert Mahari, Naana Obeng-Marnu, Manan Dey, Mohammed Hamdy, Nayan Saxena, Ahmad Mustafa Anis, Emad A. Alghamdi, Vu Minh Chien, Da Yin, Kun Qian, Yizhi LI, Minnie Liang, An Dinh, Shrestha Mohanty, Deividas Mataciunas, Tobin South, Jianguo Zhang, Ariel N. Lee, Campbell S. Lund, Christopher Klamm, Damien Sileo, Diganta Misra, Enrico Shippole, Kevin Klyman, Lester James Validad Miranda, Niklas Muennighoff, Seonghyeon Ye, Seungone Kim, Vipul Gupta, Vivek Sharma, Xuhui Zhou, Caiming Xiong, Luis Villa, Stella Biderman, Alex Pentland, Sara Hooker, Jad Kabbara 
Multi-Modal and Multi-Attribute Generation of Single Cells with CFGenAlessandro Palma, Till Richter, Hanyi Zhang, Manuel Lubetzki, Alexander Tong, Andrea Dittadi, Fabian J Theis 
The Superposition of Diffusion Models Using the Itô Density EstimatorMarta Skreta, Lazar Atanackovic, Joey Bose, Alexander Tong, Kirill Neklyudov 
Efficient Evolutionary Search Over Chemical Space with Large Language ModelsHaorui Wang, Marta Skreta, Cher-Tian Ser, Wenhao Gao, Lingkai Kong, Felix Strieth-Kalthoff, Chenru Duan, Yuchen Zhuang, Yue Yu, Yanqiao Zhu, Yuanqi Du, Alán Aspuru-Guzik, Kirill Neklyudov, Chao Zhang 

Workshops

PaperAuthorsPdf
Performative Prediction on Games and Mechanism DesignAntónio Góis, Mehrnaz Mofakhami, Fernando P. Santos, Simon Lacoste-Julien, Gauthier Gidel 
Preference Optimization for Concept Bottleneck ModelsEmiliano Penaloza, Tianyue H. Zhang, Laurent Charlin, Mateo Espinosa Zarlengahttps://openreview.net/pdf?id=Bz92EvEeD1
Design Editing for Offline Model-based OptimizationYe Yuan, Youyuan Zhang, Can Chen, Haolun Wu, Melody Zixuan Li, Jianmo Li, James J. Clark, Xue Liu 
Mitigating Shortcut Learning with Diffusion Counterfactuals and Diverse EnsemblesLuca Scimeca, Alexander Rubinstein, Damien Teney, Seong Joon Oh, Yoshua Bengiohttps://openreview.net/pdf?id=fF1KXgAhKN
Solving Bayesian inverse problems with diffusion priors and off-policy RLLuca Scimeca, Siddarth Venkatraman, Moksh Jain, Minsu Kim, Marcin Sendera, Mohsin Hasan, Alexandre Adam, Yashar Hezaveh, Laurence Perreault-Levasseur, Yoshua Bengio, Glen Berseth, Nikolay Malkin 
Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative modelsSiddarth Venkatraman, Mohsin Hasan, Minsu Kim, Luca Scimeca, Marcin Sendera, Yoshua Bengio, Glen Berseth, Nikolay Malkin 
Shaping Inductive Bias in Diffusion Models through Frequency-Based Noise ControlThomas Jiralerspong, Berton Earnshaw, Jason Hartford, Yoshua Bengio, Luca Scimeca 
Societal Alignment Frameworks Can Improve LLM AlignmentKarolina Stańczak, Nicholas Meade, Mehar Bhatia, Hattie Zhou, Konstantin Böttinger, Jeremy Barnes, Jason Stanley, Jessica Montgomery, Richard Zemel, Nicolas Papernot, Nicolas Chapados, Denis Therien, Timothy P. Lillicrap, Ana Marasović, Sylvie Delacroix, Gillian K. Hadfield, Siva Reddy 
AffinityFlow: Guided Flows for Antibody Affinity MaturationCan Chen, Karla-Luise Herpoldt, Chenchao Zhao, Zichen Wang, Marcus Collins, Shang Shang, Ron Bensonhttps://arxiv.org/pdf/2503.00069
Temporal Difference FlowsJesse Farebrother, Matteo Pirotta, Andrea Tirinzoni, Remi Munos, Alessandro Lazaric, Ahmed Touati 
CrystalGym: A New Benchmark for Materials Discovery Using Reinforcement LearningPrashant Govindarajan, Mathieu Reymond, Antoine Clavaud, Mariano Phielipp, Santiago Miret, Sarath Chandar 
Curly Flow Matching for Learning Non-Gradient Field DynamicsKatarina Petrovic, Lazar Atanackovic, Kacper Kapusniak, Michael Bronstein, Avishek Joey Bose, Alexander Tong 
Scalable Equilibrium Sampling with Sequential Boltzmann GeneratorsCharlie Tan, Avishek Joey Bose, Chen Lin, Leon Klein, Michael Bronstein, Alexander Tong 
Timing is important: Risk-aware Fund Allocation based on Time-Series ForecastingFuyuan Lyu, Linfeng Du, Yunpeng Weng, Qiufang Ying, Zhiyan Xu, wenzou, Haolun Wu, xiuqiang He, Xing Tang 
Exploring Sparse Adapters for Scalable Merging of Parameter Efficient ExpertsSamin Yeasar Arnob, Zhan Su, Minseon Kim, Oleksiy Ostapenko, Doina Precup, Lucas Caccia, Alessandro Sordoni 
A Joint Space-Time Encoder for Geographic Time-Series DataDavid Mickisch, Konstantin Klemmer, Mélisande Teng, David Rolnick 
Alberta Wells Dataset: Pinpointing Oil and Gas Wells from Satellite ImageryPratinav Seth, Michelle Lin, Brefo Dwamena Yaw, Jade Boutot, Mary Kang, David Rolnick 
Assessing SAM for tree crown instance segmentation from drone imageryMélisande Teng, Arthur Ouaknine, Etienne Laliberté, Yoshua Bengio, David Rolnick, Hugo Larochelle 
Physics-based data-driven model for CO2 gas diffusion electrodes to drive automated laboratoriesIvan Grega, Félix Therrien, Abhishek Soni, Karry Ocean, Kevan Dettelbach, Ribwar Ahmadi, Mehrdad Mokhtari, Curtis P. Berlinguette, Yoshua Bengiohttps://openreview.net/pdf?id=B73xYizsLV
ON THE ROLE OF PROMPT MULTIPLICITY IN LLM HALLUCINATION EVALUATION  
Hyper-Align: Efficient Modality Alignment via HypernetworksJaisidh Singh, Diganta Misra, Boris Knyazev, Antonio Orvieto 
Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of ExpertsMarta Skreta, Tara Akhound-Sadegh, Viktor Ohanesian, Roberto Bondesan, Alan Aspuru-Guzik, Arnaud Doucet, Rob Brekelmans, Alexander Tong, Kirill Neklyudov 
Path Planning for Masked Diffusion Models with Applications to Biological Sequence GenerationFred Zhangzhi Peng, Zachary Bezemek, Sawan Patel, Jarrid Rector-Brooks, Sherwood Yao, Alexander Tong, Pranam Chatterjee 
SOAPI: Siamese-guided generation of Off-Target-Avoiding Protein InteractionsSophia Vincoff, Oscar Davis, Alexander Tong, Joey Bose, Pranam Chatterjee 
Gumbel-Softmax Score and Flow Matching for Discrete Biological Sequence GenerationSophia Tang, Yinuo Zhang, Alexander Tong, Pranam Chatterjee 
Simulation-Free Structure Learning For Stochastic DynamicsAdam Stecklov, Noah El Rimawi-Fine, Lucas Nelson, Stephen Y. Zhang, Lazar Atanackovic, Alexander Tong, Mathieu Blanchette 
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal UnderstandingAhmed Masry, Juan A. Rodriguez, Tianyu Zhang, Suyuchen Wang, Chao Wang, Aarash Feizi, Akshay Kalkunte Suresh, Abhay Puri, Xiangru Jian, Pierre-André Noël, Sathwik Tejaswi Madhusudhan, Marco Pedersoli, Bang Liu, Nicolas Chapados, Yoshua Bengio, Enamul Hoque, Christopher Pal, Issam H. Laradji, David Vazquez, Perouz Taslakian, Spandana Gella, Sai Rajeswar 
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code GenerationRabiul Awal, Mahsa Massoud, Zichao Li, Aarash Feizi, Suyuchen Wang, Christopher Pal, Aishwarya Agrawal, David Vazquez, Siva Reddy, Juan A. Rodriguez, Perouz Taslakian, Spandana Gella, Sai Rajeswar 
ASYNC-TB: Scaling Off-Policy Exploration for LLM Reinforcement LearningBrian R. Bartoldson, Siddarth Venkatraman, James Diffenderfer, Moksh Jain, Tal Ben-Nun, Seanie Lee, Minsu Kim, Johan Obando-Ceron, Yoshua Bengio, Bhavya Kailkhurahttps://openreview.net/pdf?id=iSyxl2dKyz
UNLEARNING GEO-CULTURAL STEREOTYPES IN MULTILINGUAL LLMSAlireza Dehghanpour Farashah, Aditi Khandelwal, Negar Rostamzadeh, Golnoosh Farnadi 
Generative Verifiers: Reward Modeling as Next Token Prediction  
DASFormer: Self-supervised Pretraining for Earthquake MonitoringQianggang Ding, Zhichao Shen, Weiqiang Zhu, Bang Liuhttps://openreview.net/forum?id=LnWM7aVaFE
TradExpert: Revolutionizing Trading with Mixture of Expert LLMsQianggang Ding, Haochen Shi, Jiadong Guo, Bang Liu 
ICLR 2025 Workshop on Tackling Climate Change with Machine Learning: Data-Centric Approaches in ML for Climate ActionKonstantin Klemmer, Melissa Chapman, Lily Xu, Poon Kin Ho, Mélisande Teng, Patrick Emami, Yoshua Bengio  
Integrating Generative and Experimental Platforms for Biomolecular DesignChenghao Liu, Jarrid Rector-Brooks, Soojung Yang, Sidney Lisanza, Francesca-Zhoufan Li, Hannes Stärk, Jacob Gershon, Lauren Hong, Pranam Chatterjee, Tommi Jaakkola, Regina Barzilay , David Baker , Frances Arnold , Yoshua Bengio