Follow Mila Researchers at ICLR 2025

logo Mila and ICLR

Here is a schedule featuring Mila-affiliated researchers presenting their work at ICLR 2025. All times are in Singapore Standard Time (SST).

 A PDF version is also available here 

 

Mila @ ICLR 2025, 24.04.2025

Poster Session 1 - 10 a.m. (SST)

  • #8 Efficient Evolutionary Search Over Chemical Space with Large Language Models: Haorui Wang, Marta Skreta, Cher-Tian Ser, Wenhao Gao, Lingkai Kong, Felix Strieth-Kalthoff, Chenru Duan, Yuchen Zhuang, Yue Yu, Yanqiao Zhu, Yuanqi Du, Alán Aspuru-Guzik, Kirill Neklyudov, Chao Zhang
  • #13 Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold: Lazar Atanackovic, Xi Zhang, Brandon Amos, Mathieu Blanchette, Leo J Lee, Yoshua Bengio, Alexander Tong, Kirill Neklyudov
  • #135 AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements: Adriana Eufrosina Bora, Pierre-Luc St-Charles, Mirko Bronzi, Arsene Fansi Tchango, Bruno Rousseau, Kerrie Mengersen
  • #138 Accelerating neural network training: An analysis of the AlgoPerf competition: Priya Kasimbeg, Frank Schneider, Runa Eschenhagen, Juhan Bae, Chandramouli Shama Sastry, Mark Saroufim, BOYUAN FENG, Less Wright, Edward Z. Yang, Zachary Nado, Sourabh Medapati, Philipp Hennig, Michael Rabbat, George E. Dahl
  • #164 Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction: Jarrid Rector-Brooks, Mohsin Hasan, Zhangzhi Peng, Zachary Quinn, Chenghao Liu, Sarthak Mittal, Nouha Dziri, Michael Bronstein, Yoshua Bengio, Pranam Chatterjee, Alexander Tong, Avishek, Joey Bose
  • #280 BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks: Juan A. Rodriguez, Xiangru Jian, Siba Smarak Panigrahi, Tianyu Zhang, Aarash Feizi, Abhay Puri, Akshay Kalkunte Suresh, François Savard, Ahmed Masry, Shravan Nayak, Rabiul Awal, Mahsa Massoud, Amirhossein Abaskohi, Zichao Li, Suyuchen Wang, Pierre-Andre Noel, Mats Leon Richter, Saverio Vadacchino, Shubham Agarwal, Sanket Biswas, Sara Shanian, Ying Zhang, Sathwik Tejaswi Madhusudhan, Joao Monteiro, Krishnamurthy Dj Dvijotham, Torsten Scholak, Nicolas Chapados, Sepideh Kharaghani, Sean Hughes, M. Özsu, Siva Reddy, Marco Pedersoli, Yoshua Bengio, Christopher Pal, Issam Hadj Laradji, Spandana Gella, Perouz Taslakian, David Vazquez, Sai Rajeswar
  • #292 An Auditing Test to Detect Behavioral Shift in Language Models: Leo Richter, Xuanli He, Pasquale Minervini, Matt Kusner
  • #319 HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models: Seanie Lee, Haebin Seong, Dong Bok Lee, Minki Kang, Xiaoyin Chen, Dominik Wagner, Yoshua Bengio, Juho Lee, Sung Ju Hwang
  • #377 Solving Hidden Monotone Variational Inequalities with Surrogate Losses: Ryan D'Orazio, Danilo Vucetic, Zichu Liu, Junhyung Lyle Kim, Ioannis Mitliagkas, Gauthier Gidel
  • #404 Towards General-Purpose Model-Free Reinforcement Learning: Scott Fujimoto, Pierluca D'Oro, Amy Zhang, Yuandong Tian, Michael Rabbat
  • #424 Interpreting Emergent Planning in Model-Free: Reinforcement Learning Thomas Bush, Stephen Chung, Usman Anwar, Adrià Garriga-Alonso, David Krueger
  • #494 Pitfalls of Evidence-Based AI Policy: Stephen Casper, David Krueger, Dylan Hadfield-Menell
  • #506 Selective Unlearning via Representation Erasure Using Domain Adversarial Training: Nazanin Mohammadi Sepahvand, Eleni Triantafillou, Hugo Larochelle, Doina Precup, James J. Clark, Daniel M. Roy, Gintare Karolina Dziugaite
  • #555 Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation Models: Andrea Tirinzoni, Ahmed Touati, Jesse Farebrother, Mateusz Guzek, Anssi Kanervisto, Yingchen Xu, Alessandro Lazaric, Matteo Pirotta
  • #635 On the Modeling Capabilities of Large Language Models for Sequential Decision Making: Martin Klissarov, R Devon Hjelm, Alexander T Toshev, Bogdan Mazoure
  • #625 Protecting against simultaneous data poisoning attacks: Neel Alex, Shoaib Ahmed Siddiqui, Amartya Sanyal, David Krueger

Oral Session 1 - 10:30 a.m. (SST)

Oral 1C

Influence Functions for Scalable Data Attribution in Diffusion Models: Bruno Mlodozeniec, Runa Eschenhagen, Juhan Bae, Alexander Immer, David Krueger, Richard E. Turner

Poster Session 2 - 3 p.m. (SST)

  • #7 SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models: Daniel Levy, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Qiang Zhu, Kin Long Kelvin Lee, Mikhail Galkin, Santiago Miret, Siamak Ravanbakhsh
  • #43 MuPT: A Generative Symbolic Music Pretrained Transformer: Xingwei Qu, yuelin bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xeron Du, Shuyue Guo, Yiming Liang, Yizhi LI, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan, Wenhao Huang, Jie Fu, Ge Zhang
  • #196 Fully-inductive Node Classification on Arbitrary Graphs: Jianan Zhao, Zhaocheng Zhu, Mikhail Galkin, Hesham Mostafa, Michael M. Bronstein, Jian Tang
  • #297 INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge: Angelika Romanou, Negar Foroutan, Anna Sotnikova, Sree Harsha Nelaturu, Shivalika Singh, Rishabh Maheshwary, Micol Altomare, Zeming Chen, Mohamed A. Haggag, Snegha A, Alfonso Amayuelas, Azril Hafizi Amirudin, Danylo Boiko, Michael Chang, Jenny Chim, Gal Cohen, Aditya Kumar Dalmia, Abraham Diress, Sharad Duwal, Daniil Dzenhaliou, Daniel Fernando Erazo Florez, Fabian Farestam, Joseph Marvin Imperial, Shayekh Bin Islam, Perttu Isotalo, Maral Jabbarishiviari, Börje F. Karlsson, Eldar Khalilov, Christopher Klamm, Fajri Koto, Dominik Krzemiński, Gabriel Adriano de Melo, Syrielle Montariol, Yiyang Nan, Joel Niklaus, Jekaterina Novikova, Johan Samir Obando Ceron, Debjit Paul, Esther Ploeger, Jebish Purbey, Swati Rajwal, Selvan Sunitha Ravi, Sara Rydell, Roshan Santhosh, Drishti Sharma, Marjana Prifti Skenduli, Arshia Soltani Moakhar, Bardia soltani moakhar, Ayush Kumar Tarun, Azmine Toushik Wasi, Thenuka Ovin Weerasinghe, Serhan Yilmaz, Mike Zhang, Imanol Schlag, Marzieh Fadaee, Sara Hooker, Antoine Bosselut
  • #306 VCR: Pixel-Level Complex Reasoning by Restoring Occluded Text: Tianyu Zhang, Suyuchen Wang, Lu Li, Ge Zhang, Perouz Taslakian, Sai Rajeswar, Jie Fu, Bang Liu, Yoshua Bengio
  • #342 The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws: Tian Jin, Ahmed Imtiaz Humayun, Utku Evci, Suvinay Subramanian, Amir Yazdanbakhsh, Dan Alistarh, Gintare Karolina Dziugaite
  • #371 AdaFisher: Adaptive Second Order Optimization via Fisher Information: Damien MARTINS GOMES, Yanlei Zhang, Eugene Belilovsky, Guy Wolf, Mahdi S. Hosseini
  • #429 Action abstractions for amortized sampling: Oussama Boussif, Lena Nehale Ezzine, Joseph D Viviano, Michał Koziarski, Moksh J. Jain, Nikolay Malkin, Emmanuel Bengio, Rim Assouel, Yoshua Bengio
  • #515 Influence Functions for Scalable Data Attribution in Diffusion Models: Bruno Mlodozeniec, Runa Eschenhagen, Juhan Bae, Alexander Immer, David Krueger, Richard E. Turner
  • #539 PETRA: Parallel End-to-end Training with Reversible Architectures: Stephane Rivaud, Louis Fournier, Thomas Pumir, Eugene Belilovsky, Michael Eickenberg, Edouard Oyallon
  • #588 Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning: Zenan Li, Zhaoyu Li, Wen Tang, Xian Zhang, Yuan Yao, Xujie Si, Fan Yang, Kaiyu Yang, Xiaoxing Ma
  • #615 PQMass: Probabilistic Assessment of the Quality of Generative Models using Probability Mass Estimation : Pablo Lemos, Sammy Nasser Sharief, Nikolay Malkin, Salma Salhi, Connor Stone, Laurence Perreault-Levasseur, Yashar Hezaveh

Oral Session 2 - 3:30 p.m.

Oral 2A

Interpreting Emergent Planning in Model-Free Reinforcement Learning: Thomas Bush, Stephen Chung, Usman Anwar, Adrià Garriga-Alonso, David Krueger

 

Mila @ ICLR 2025, 25.04.2025

Poster Session 3 - 10 a.m. (SST)

  • #4 Structure Language Models for Protein Conformation Generation: Jiarui Lu, Xiaoyin Chen, Stephen Zhewen Lu, Chence Shi, Hongyu Guo, Yoshua Bengio, Jian Tang
  • #12 GlycanML: A Multi-Task and Multi-Structure Benchmark for Glycan Machine Learning : Minghao Xu, Yunteng Geng, Yihang Zhang, Ling Yang, Jian Tang, Wentao Zhang
  • #17 Multi-Modal and Multi-Attribute Generation of Single Cells with CFGen: Alessandro Palma, Till Richter, Hanyi Zhang, Manuel Lubetzki, Alexander Tong, Andrea Dittadi, Fabian J Theis
  • #56 Expressivity of Neural Networks with Random Weights and Learned Biases: Ezekiel Williams, Alexandre Payeur, Avery Hee-Woon Ryoo, Thomas Jiralerspong, Matthew G Perich, Luca Mazzucato, Guillaume Lajoie
  • #60 Credit-based self organizing maps: training deep topographic networks with minimal performance degradation: Amir Ozhan Dehghani, Xinyu Qian, Asa Farahani, Pouya Bashivan
  • #131 Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study: Shawn Tan, Songlin Yang, Aaron Courville, Rameswar Panda, Yikang Shen
  • #138 RevisEval: Improving LLM-as-a-Judge via Response-Adapted References: Qiyuan Zhang, Yufei Wang, Tiezheng YU, Yuxin Jiang, Chuhan Wu, Liangyou Li, Yasheng Wang, Xin Jiang, Lifeng Shang, Ruiming Tang, Fuyuan Lyu, Chen Ma
  • #171 Boosting Latent Diffusion with Perceptual Objectives: Tariq Berrada, Pietro Astolfi, Melissa Hall, Marton Havasi, Yohann Benchetrit, Adriana Romero, Karteek Alahari, Michal Drozdzal, Jakob Verbeek
  • #262 InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation: Gaurav Sahu, Abhay Puri, Juan A. Rodriguez, Amirhossein Abaskohi, Mohammad Chegini, Alexandre Drouin, Perouz Taslakian, Valentina Zantedeschi, Alexandre Lacoste, David Vazquez, Nicolas Chapados, Christopher Pal, Sai Rajeswar, Issam Hadj Laradji
  • #311 AFlow: Automating Agentic Workflow Generation: Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu, Fengwei Teng, XiongHui Chen, Jiaqi Chen, Mingchen Zhuge, Xin Cheng, Sirui Hong, Jinlin Wang, Bingnan Zheng, Bang Liu, Yuyu Luo, Chenglin Wu
  • #314 MMTEB: Massive Multilingual Text Embedding Benchmark: Kenneth Enevoldsen, Isaac Chung, Imene Kerboua, Márton Kardos, Ashwin Mathur, David Stap, Jay Gala, Wissam Siblini, Dominik Krzemiński, Genta Indra Winata, Saba Sturua, Saiteja Utpala, Mathieu Ciancone, Marion Schaeffer, Diganta Misra, Shreeya Dhakal, Jonathan Rystrøm, Roman Solomatin, Ömer Veysel Çağatan, Akash Kundu, Martin Bernstorff, Shitao Xiao, Akshita Sukhlecha, Bhavish Pahwa, Rafał Poświata, Kranthi Kiran GV, Shawon Ashraf, Daniel Auras, Björn Plüster, Jan Philipp Harries, Loïc Magne, Isabelle Mohr, Dawei Zhu, Hippolyte Gisserot-Boukhlef, Tom Aarsen, Jan Kostkan, Konrad Wojtasik, Taemin Lee, Marek Suppa, Crystina Zhang, Roberta Rocca, Mohammed Hamdy, Andrianos Michail, John Yang, Manuel Faysse, Aleksei Vatolin, Nandan Thakur, Manan Dey, Dipam Vasani, Pranjal A Chitale, Simone Tedeschi, Nguyen Tai, Artem Snegirev, Mariya Hendriksen, Michael Günther, Mengzhou Xia, Weijia Shi, Xing Han Lu, Jordan Clive, Gayatri K, Maksimova Anna, Silvan Wehrli, Maria Tikhonova, Henil Shalin Panchal, Aleksandr Abramov, Malte Ostendorff, Zheng Liu, Simon Clematide, Lester James Validad Miranda, Alena Fenogenova, Guangyu Song, Ruqiya Bin Safi, Wen-Ding Li, Alessia Borghini, Federico Cassano, Lasse Hansen, Sara Hooker, Chenghao Xiao, Vaibhav Adlakha, Orion Weller, Siva Reddy, Niklas Muennighoff
  • #316 Input Space Mode Connectivity in Deep Neural Networks: Jakub Vrabel, Ori Shem-Ur, Yaron Oz, David Krueger
  • #378 A Truncated Newton Method for Optimal Transport: Mete Kemertas, Amir-massoud Farahmand, Allan Douglas Jepson
  • #391 Safety Representations for Safer Policy Learning: Kaustubh Mani, Vincent Mai, Charlie Gauthier, Annie S Chen, Samer B. Nashed, Liam Paull
  • #464 MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation : Lu Li, Tianyu Zhang, Zhiqi Bu, Suyuchen Wang, Huan He, Jie Fu, Yonghui Wu, Jiang Bian, Yong Chen, Yoshua Bengio
  • #572 MatExpert: Decomposing Materials Discovery By Mimicking Human Experts: Qianggang Ding, Santiago Miret, Bang Liu
  • #573 Adaptive teachers for amortized samplers: Minsu Kim, Sanghyeok Choi, Taeyoung Yun, Emmanuel Bengio, Leo Feng, Jarrid Rector-Brooks, Sungsoo Ahn, Jinkyoo Park, Nikolay Malkin, Yoshua Bengio
  • #578 Towards Improving Exploration through Sibling Augmented GFlowNets: Kanika Madan, Alex Lamb, Emmanuel Bengio, Glen Berseth, Yoshua Bengio
  • #609 Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference: Matthew Riemer, Gopeshh Subbaraj, Glen Berseth, Irina Rish
  • #612 ZETA: Leveraging Z-order Curves for Efficient Top-K Attention: Qiuhuao Zeng, Jerry Huang, Peng Lu, Gezheng Xu, Boxing Chen, Charles Ling, Boyu Wang
  • #627 MaestroMotif: Skill Design from Artificial Intelligence Feedback: Martin Klissarov, Mikael Henaff, Roberta Raileanu, Shagun Sodhani, Pascal Vincent, Amy Zhang, Pierre-Luc Bacon, Doina Precup, Marlos C. Machado, Pierluca D'Oro

Poster session 4 - 3 p.m. (SST)

  • #4 3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery: Xiuyuan Hu, Guoqing Liu, Can Chen, Yang Zhao, Hao Zhang, Xue Liu
  • #97 Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality: Ge Ya Luo, Gian Mario Favero, Zhi Hao Luo, Alexia Jolicoeur-Martineau, Christopher Pal
  • #141 The Superposition of Diffusion Models Using the Itô Density Estimator: Marta Skreta, Lazar Atanackovic, Joey Bose, Alexander Tong, Kirill Neklyudov
  • #209 Learning diverse attacks on large language models for robust red-teaming and safety tuning : Seanie Lee, Minsu Kim, Lynn Cherif, David Dobre, Juho Lee, Sung Ju Hwang, Kenji Kawaguchi, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Moksh J. Jain
  • #282 Forgetting Transformer: Softmax Attention with a Forget Gate: Zhixuan Lin, Evgenii Nikishin, Xu He, Aaron Courville
  • #369 Accelerating Training with Neuron Interaction and Nowcasting Networks: Boris Knyazev, Abhinav Moudgil, Guillaume Lajoie, Eugene Belilovsky, Simon Lacoste-Julien
  • #395 Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning: Haque Ishfaq, Guangyuan Wang, Mohammad Sami Nur Islam, Doina Precup
  • #410 Multi-agent cooperation through learning-aware policy gradients: Alexander Meulemans, Seijin Kobayashi, Johannes Von Oswald, Nino Scherrer, Eric Elmoznino, Blake Aaron Richards, Guillaume Lajoie, Blaise Aguera y Arcas, João Sacramento
  • #428 Handling Delay in Real-Time Reinforcement Learning: Ivan Anokhin, Rishav, Matthew Riemer, Stephen Chung, Irina Rish, Samira Ebrahimi Kahou
  • #497 Mastering Task Arithmetic: τJp as a Key Indicator for Weight Disentanglement: Kotaro Yoshida, Yuji Naraki, Takafumi Horie, Ryosuke Yamaki, Ryotaro Shimizu, Yuki Saito, Julian McAuley, Hiroki Naganuma
  • #582 Faster, More Efficient RLHF through Off-Policy Asynchronous Learning : Michael Noukhovitch, Shengyi Huang, Sophie Xhonneux, Arian Hosseini, Rishabh Agarwal, Aaron Courville

Oral Session 4 - 3.30 p.m. (SST)

Oral 4B

AFlow: Automating Agentic Workflow Generation: Jiayi Zhang, Jinyu Xiang, Zhaoyang Yu, Fengwei Teng, XiongHui Chen, Jiaqi Chen, Mingchen Zhuge, Xin Cheng, Sirui Hong, Jinlin Wang, Bingnan Zheng, Bang Liu, Yuyu Luo, Chenglin Wu

Oral 4F

MaestroMotif: Skill Design from Artificial Intelligence Feedback: Martin Klissarov, Mikael Henaff, Roberta Raileanu, Shagun Sodhani, Pascal Vincent, Amy Zhang, Pierre-Luc Bacon, Doina Precup, Marlos C. Machado, Pierluca D'Oro

 

Mila @ ICLR 2025, 26.04.2025

Poster Session 5 - 10 a.m. (SST)

  • #31 Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces: DiJia Su, Sainbayar Sukhbaatar, Michael Rabbat, Yuandong Tian, Qinqing Zheng
  • #39 Contractive Dynamical Imitation Policies for Efficient Out-of-Sample Recovery: Amin Soleimani Abyaneh, Mahrokh Boroujeni, Hsiu-Chin Lin, Giancarlo Ferrari-Trecate
  • #106 On the Transfer of Object-Centric Representation Learning: Aniket Rajiv Didolkar, Andrii Zadaianchuk, Anirudh Goyal, Michael Curtis Mozer, Yoshua Bengio, Georg Martius, Maximilian Seitzer
  • #128 CarbonSense: A Multimodal Dataset and Baseline for Carbon Flux Modelling: Matthew Fortier, Mats Leon Richter, Oliver Sonnentag, Christopher Pal
  • #183 ParetoFlow: Guided Flows in Multi-Objective Optimization: Ye Yuan, Can Chen, Christopher Pal, Xue Liu
  • #203 Let Your Features Tell The Differences: Understanding Graph Convolution By Feature Splitting: Yilun Zheng, Xiang Li, Sitao Luan, Xiaojiang Peng, Lihui Chen
  • #224 Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning: Md Rifat Arefin, Gopeshh Subbaraj, Nicolas Gontier, Yann LeCun, Irina Rish, Ravid Shwartz-Ziv, Christopher Pal
  • #259 OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning: Xiaoqiang Wang, Bang Liu
  • #384 MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL: Claas A Voelcker, Marcel Hussing, Eric Eaton, Amir-massoud Farahmand, Igor Gilitschenski
  • #392 Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning: Samuel Garcin, Trevor McInroe, Pablo Castro, Christopher G. Lucas, David Abel, Prakash Panangaden, Stefano V Albrecht
  • #397 Advantage Alignment Algorithms: Juan Agustin Duque, Milad Aghajohari, Tim Cooijmans, razvan ciuca, Tianyu Zhang, Gauthier Gidel, Aaron Courville
  • #399 A Generalist Hanabi Agent: Arjun V Sudhakar, Hadi Nekoei, Mathieu Reymond, Miao Liu, Janarthanan Rajendran, Sarath Chandar
  • #555 Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets: Zhen Liu, Tim Z. Xiao, Weiyang Liu, Yoshua Bengio, Dinghuai Zhang
  • #558 Training Language Models to Self-Correct via Reinforcement Learning: Aviral Kumar, Vincent Zhuang, Rishabh Agarwal, Yi Su, John D Co-Reyes, Avi Singh, Kate Baumli, Shariq Iqbal, Colton Bishop, Rebecca Roelofs, Lei M Zhang, Kay McKinney, Disha Shrivastava, Cosmin Paduraru, George Tucker, Doina Precup, Feryal Behbahani, Aleksandra Faust
  • #572 Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching: Arnav Kumar Jain, Harley Wiltzer, Jesse Farebrother, Irina Rish, Glen Berseth, Sanjiban Choudhury
  • #604 What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models : Ahmed Imtiaz Humayun, Ibtihel Amara, Cristina Nader Vasconcelos, Deepak Ramachandran, Candice Schumann, Junfeng He, Katherine A Heller, Golnoosh Farnadi, Negar Rostamzadeh, Mohammad Havaei
  • #611 AssembleFlow: Rigid Flow Matching with Inertial Frames for Molecular Assembly: Hongyu Guo, Yoshua Bengio, Shengchao Liu
  • #634 Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo: João Loula, Benjamin LeBrun, Li Du, Ben Lipkin, Clemente Pasti, Gabriel Grand, Tianyu Liu, Yahya Emara, Marjorie Freedman, Jason Eisner, Ryan Cotterell, Vikash Mansinghka, Alexander K. Lew, Tim Vieira, Timothy J. O'Donnell
  • #207 Improving Equivariant Networks with Probabilistic Symmetry Breaking: Hannah Lawrence, Vasco Portilheiro, Yan Zhang, Sékou-Oumar Kaba

Poster session 6 - 3 p.m. (SST)

  • #217 Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection: Yun Zhu, Jia-Chen Gu, Caitlin Sikora, Ho Ko, Yinxiao Liu, Chu-Cheng Lin, Lei Shu, Liangchen Luo, Lei Meng, Bang Liu, Jindong Chen
  • #231 Spectra: Surprising Effectiveness of Pretraining Ternary Language Models at Scale: Ayush Kaushal, Tejas Vaidhya, Arnab Kumar Mondal, Tejas Pandey, Aaryan Bhagat, Irina Rish
  • #246 Towards Interpreting Visual Information Processing in Vision-Language Models: Clement Neo, Luke Ong, Philip Torr, Mor Geva, David Krueger, Fazl Barez
  • #304 The Pitfalls of Memorization: When Memorization Hurts Generalization : Reza Bayat, Mohammad Pezeshki, Elvis Dohmatob, David Lopez-Paz, Pascal Vincent
  • #361 Neuroplastic Expansion in Deep Reinforcement Learning: Jiashun Liu, Johan Samir Obando Ceron, Aaron Courville, Ling Pan
  • #363 Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL: Ghada Sokar, Johan Samir Obando Ceron, Aaron Courville, Hugo Larochelle, Pablo Castro
  • #494 Bridging the Data Provenance Gap Across Text, Speech, and Video: Shayne Longpre, Nikhil Singh, Manuel Cherep, Kushagra Tiwary, Joanna Materzynska, William Brannon, Robert Mahari, Naana Obeng-Marnu, Manan Dey, Mohammed Hamdy, Nayan Saxena, Ahmad Mustafa Anis, Emad A. Alghamdi, Vu Minh Chien, Da Yin, Kun Qian, Yizhi LI, Minnie Liang, An Dinh, Shrestha Mohanty, Deividas Mataciunas, Tobin South, Jianguo Zhang, Ariel N. Lee, Campbell S. Lund, Christopher Klamm, Damien Sileo, Diganta Misra, Enrico Shippole, Kevin Klyman, Lester James Validad Miranda, Niklas Muennighoff, Seonghyeon Ye, Seungone Kim, Vipul Gupta, Vivek Sharma, Xuhui Zhou, Caiming Xiong, Luis Villa, Stella Biderman, Alex Pentland, Sara Hooker, Jad Kabbara
  • #550 Multi-session,  multi-task neural decoding from distinct cell-types and brain regions: Mehdi Azabou, Krystal Xuejing Pan, Vinam Arora, Ian Jarratt Knight, Eva L Dyer, Blake Aaron Richards

Oral Session 6 - 3.30 p.m. (SST)

Oral 6A

Training Language Models to Self-Correct via Reinforcement Learning: Aviral Kumar, Vincent Zhuang, Rishabh Agarwal, Yi Su, John D Co-Reyes, Avi Singh, Kate Baumli, Shariq Iqbal, Colton Bishop, Rebecca Roelofs, Lei M Zhang, Kay McKinney, Disha Shrivastava, Cosmin Paduraru, George Tucker, Doina Precup, Feryal Behbahani, Aleksandra Faust

Oral 6B

Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo: João Loula, Benjamin LeBrun, Li Du, Ben Lipkin, Clemente Pasti, Gabriel Grand, Tianyu Liu, Yahya Emara, Marjorie Freedman, Jason Eisner, Ryan Cotterell, Vikash Mansinghka, Alexander K. Lew, Tim Vieira, Timothy J. O'Donnell

Oral 6D

Advantage Alignment Algorithms: Juan Agustin Duque, Milad Aghajohari, Tim Cooijmans, razvan ciuca, Tianyu Zhang, Gauthier Gidel, Aaron Courville