Capturing Individual Human Preferences with Reward Features
Andr'e Barreto
Vincent Dumoulin
Yiran Mao
Nicolas Perez-Nieves
Bobak Shahriari
Yann Dauphin
Offline Model-Based Optimization: Comprehensive Review
Minsu Kim
Jiayao Gu
Zixuan Liu
Can Chen
Offline Model-Based Optimization: Comprehensive Review
Minsu Kim
Jiayao Gu
Zixuan Liu
Can Chen
RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models
Meditation induces shifts in neural oscillations, brain complexity and critical dynamics: Novel insights from MEG
Annalisa Pascarella
Philipp Thölke
David Meunier
Jordan O’Byrne
Tarek Lajnef
Antonino Raffone
Roberto Guidotti
Vittorio Pizzella
Laura Marzetti
UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction
Xiangru Jian
Kevin Qinghong Lin
Juan A. Rodriguez
Montek Kalsi
M. T. ¨Ozsu
David Vazquez
Perouz Taslakian
Sai Rajeswar
Human Annotator
Hitting the right pitch: Cortical tracking of fundamental frequency changes across speech rates in auditory and sensorimotor regions
Yorguin-Jose Mantilla-Ramos
Ana-Sofía Hincapié-Casas
Annalisa Pascarella
Tarek Lajnef
Richard M. Leahy
Emily B.J. Coffey
Véronique Boulenger
Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs
Joshua Greaves
Alex Fr'echette
Carolyne Pelletier
Eric Thibodeau-Laufer
S'andor Toth
Sam Work
Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs
Joshua Greaves
Alex Fr'echette
Carolyne Pelletier
Eric Thibodeau-Laufer
S'andor Toth
Sam Work
Sparse Decomposition of Graph Neural Networks
Yaochen Hu
Mai Zeng
Ge Zhang
Pavel Rumiantsev
Yingxue Zhang
Negotiative Alignment: Embracing Disagreement to Achieve Fairer Outcomes -- Insights from Urban Studies
Sample Compression for Continual Learning