Clustering units in neural networks: upstream vs downstream information
Richard D Lange
Konrad Paul Kording
It has been hypothesized that some form of"modular"structure in artificial neural networks should be useful for learning, compositionality, … (see more)and generalization. However, defining and quantifying modularity remains an open problem. We cast the problem of detecting functional modules into the problem of detecting clusters of similar-functioning units. This begs the question of what makes two units functionally similar. For this, we consider two broad families of methods: those that define similarity based on how units respond to structured variations in inputs ("upstream"), and those based on how variations in hidden unit activations affect outputs ("downstream"). We conduct an empirical study quantifying modularity of hidden layer representations of simple feedforward, fully connected networks, across a range of hyperparameters. For each model, we quantify pairwise associations between hidden units in each layer using a variety of both upstream and downstream measures, then cluster them by maximizing their"modularity score"using established tools from network science. We find two surprising results: first, dropout dramatically increased modularity, while other forms of weight regularization had more modest effects. Second, although we observe that there is usually good agreement about clusters within both upstream methods and downstream methods, there is little agreement about the cluster assignments across these two families of methods. This has important implications for representation-learning, as it suggests that finding modular representations that reflect structure in inputs (e.g. disentanglement) may be a distinct goal from learning modular representations that reflect structure in outputs (e.g. compositionality).
Studying the Practices of Deploying Machine Learning Projects on Docker
Moses Openja
Forough Majidi
Bhagya Chembakottu
Heng Li
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games
Samuel Sokota
Ryan D’orazio
J. Z. Kolter
Nicolas Loizou
Marc Lanctot
Noam Brown
Christian Kroer
This work studies an algorithm, which we call magnetic mirror descent, that is inspired by mirror descent and the non-Euclidean proximal gra… (see more)dient algorithm. Our contribution is demonstrating the virtues of magnetic mirror descent as both an equilibrium solver and as an approach to reinforcement learning in two-player zero-sum games. These virtues include: 1) Being the first quantal response equilibria solver to achieve linear convergence for extensive-form games with first order feedback; 2) Being the first standard reinforcement learning algorithm to achieve empirically competitive results with CFR in tabular settings; 3) Achieving favorable performance in 3x3 Dark Hex and Phantom Tic-Tac-Toe as a self-play deep reinforcement learning algorithm.
The distribution, ecology and predicted habitat use of the Critically Endangered angelshark (Squatina squatina) in coastal waters of Wales and the central Irish Sea
Joanna Barker
Jake Davies
Monika Goralczyk
Surshti Patel
John O'Connor
Jim Evans
Jackson Wesley Evans
Rowland Sharp
Matthew Gollock
Fenella R. Wood
Frank Wood
James Rosindell
Charlie Bartlett
Brett J. Garner
Dafydd Jones
D. J. Jones
Declan Quigley
Ben Wray
Billy Wray
Abstract The angelshark (Squatina squatina) has the northernmost range of any angel shark species, but there is limited information on its d… (see more)istribution, habitat use and ecology at higher latitudes. To address this, Angel Shark Project: Wales gathered 2231 S. squatina records and 142 anecdotal resources from fishers, coastal communities and archives. These spanned the coastal waters of Wales and the central Irish Sea and were dated from 1812 to 2020, with 97.62% of records within 11.1 km (6 nm) of the coast. Commercial, recreational and charter boat fishers provided the majority of S. squatina records (97.18%), with significantly more sightings from three decades (1970s, 1980s and 1990s) and in the months of September, June, August and July (in descending order). The coastal area between Bardsey Island and Strumble Head had the most S. squatina records (n = 1279), with notable concentrations also found in Carmarthen Bay, Conwy Bay and the Outer Severn Estuary. Species distribution models (SDM) identified four environmental variables that had significant influence on S. squatina distribution, depth, chlorophyll‐a concentration, sea surface temperature (SST) and salinity, and these varied between the quarters (Q) of the year. SDM model outputs predicted a larger congruous area of suitable habitat in Q3 (3176 km2) compared to Q2 (2051 km2), with suitability along the three glacial moraines (Sarn Badrig, Sarn‐y‐Bwch and Sarn Cynfelyn) strongly presented. Comparison of modelled environmental variables at the location of S. squatina records for each Q identified reductions in depth and salinity, and increases in chlorophyll‐a and SST when comparing Q2 or Q3 with Q1 or Q4. This shift may suggest S. squatina are making seasonal movements to shallow coastal waters in Q2 and Q3. This is supported by 23 anecdotal resources and may be driven by reproductive behaviour, as there were 85 records of S. squatina individuals ≤60 cm in the dataset, inferred as recently born or juvenile life‐history stages. The results have helped fill significant evidence gaps identified in the Wales Angelshark Action Plan and immediate next research steps are suggested.
Leveraging Integer Linear Programming to Learn Optimal Fair Rule Lists
Julien Ferry
Sébastien Gambs
Marie-José Huguet
Mohamed
Siala
On Neural Architecture Inductive Biases for Relational Tasks
Current deep learning approaches have shown good in-distribution generalization performance, but struggle with out-of-distribution generaliz… (see more)ation. This is especially true in the case of tasks involving abstract relations like recognizing rules in sequences, as we find in many intelligence tests. Recent work has explored how forcing relational representations to remain distinct from sensory representations, as it seems to be the case in the brain, can help artificial systems. Building on this work, we further explore and formalize the advantages afforded by 'partitioned' representations of relations and sensory details, and how this inductive bias can help recompose learned relational structure in newly encountered settings. We introduce a simple architecture based on similarity scores which we name Compositional Relational Network (CoRelNet). Using this model, we investigate a series of inductive biases that ensure abstract relations are learned and represented distinctly from sensory data, and explore their effects on out-of-distribution generalization for a series of relational psychophysics tasks. We find that simple architectural choices can outperform existing models in out-of-distribution generalization. Together, these results show that partitioning relational representations from other information streams may be a simple way to augment existing network architectures' robustness when performing out-of-distribution relational computations.
Few-shot Question Generation for Personalized Feedback in Intelligent Tutoring Systems
Devang Kulshreshtha
Muhammad Shayan
Robert Belfer
Iulian V. Serban
Ekaterina Kochmar
Interacting brains revisited: A cross‐brain network neuroscience perspective
Christian Gerloff
Kerstin Konrad
Christina Büsing
Vanessa Reindl
Technologically-assisted communication attenuates inter-brain synchrony
Linoy Schwartz
Jonathan Levy
Yaara Endevelt-Shapira
Amir Djalovski
Olga Hayut
Ruth Pinkenson Feldman
Modeling electronic health record data using a knowledge-graph-embedded topic model
Yuesong Zou
Ahmad Pesaranghader
Aman Verma
The rapid growth of electronic health record (EHR) datasets opens up promising opportunities to understand human diseases in a systematic wa… (see more)y. However, effective extraction of clinical knowledge from the EHR data has been hindered by its sparsity and noisy information. We present KG-ETM, an end-to-end knowledge graph-based multimodal embedded topic model. KG-ETM distills latent disease topics from EHR data by learning the embedding from the medical knowledge graphs. We applied KG-ETM to a large-scale EHR dataset consisting of over 1 million patients. We evaluated its performance based on EHR reconstruction and drug imputation. KG-ETM demonstrated superior performance over the alternative methods on both tasks. Moreover, our model learned clinically meaningful graph-informed embedding of the EHR codes. In additional, our model is also able to discover interpretable and accurate patient representations for patient stratification and drug recommendations.
Vendor-neutral sequences and fully transparent workflows improve inter-vendor reproducibility of quantitative MRI
Agah Karakuzu
Labonny Biswas
Nikola Stikov
Purpose We developed an end-to-end workflow that starts with a vendor-neutral acquisition and tested the hypothesis that vendor-neutral sequ… (see more)ences decrease inter-vendor variability of T1, MTR and MTsat measurements. Methods We developed and deployed a vendor-neutral 3D spoiled gradient-echo (SPGR) sequence on three clinical scanners by two MRI vendors. We then acquired T1 maps on the ISMRM-NIST system phantom, as well as T1, MTR and MTsat maps in three healthy participants. We performed hierarchical shift function analysis in vivo to characterize the differences between scanners when the vendor-neutral sequence is used instead of commercial vendor implementations. Inter-vendor deviations were compared for statistical significance to test the hypothesis. Results In the phantom, the vendor-neutral sequence reduced inter-vendor differences from 8 - 19.4% to 0.2 - 5% with an overall accuracy improvement, reducing ground truth T1 deviations from 7 - 11% to 0.2 - 4%. In vivo we found that the variability between vendors is significantly reduced (p = 0.015) for all maps (T1, MTR and MTsat) using the vendor-neutral sequence. Conclusion We conclude that vendor-neutral workflows are feasible and compatible with clinical MRI scanners. The significant reduction of inter-vendor variability using vendor-neutral sequences has important implications for qMRI research and for the reliability of multicenter clinical trials.
Genetic correlates of phenotypic heterogeneity in autism
Varun Warrier
Xinhe Zhang
Patrick Reed
Alexandra Havdahl
Tyler M. Moore
Freddy Cliquet
Claire Leblond
Thomas Rolland
Anders Rosengren
Antonia San Jose Hannah Daisy Jessica Jessica Claire Bethany Eva Tony Declan Rosemary Jack Jessica Nicola Meng-Chuan Gwilym Amber Emily Hisham Julia Sara Ambrosino Sarai Yvonne Tabitha Miriam Alyssia Iris Maarten Anna Ver Loren Nico Sarah Larry Carsten Annika Daniel Ineke Yvette Maartje Elzbieta Elodie Kristiina Rouslan Guillaume Yang-Min Thomas Caceres
Antonia San Jose Hannah Daisy Jessica Jessica Claire Betha Caceres Hayward Crawley Faulkner Sabet Ellis Oakle
Antonia San José Cáceres
Hannah Hayward
Daisy Crawley
Jessica Faulkner
Jessica Sabet
Claire Ellis
Beth Oakley
Eva Loth
Tony Charman … (see 67 more)
Declan Murphy
Rosemary Holt
Jack Waldman
Jessica Upadhyay
Nicola Gunby
Meng-Chuan Lai
Gwilym Renouf
Amber N. V. Ruigrok
Emily Taylor
Hisham Ziauddeen
Julia Deakin
Sara Ambrosino di Bruttopilo
Sarai van Dijk
Yvonne Rijks
Tabitha Koops
Miriam Douma
Alyssia Spaan
Iris Selten
Maarten Steffers
Anna Ver Loren van Themaat
Nico Bast
Sarah Baumeister
Larry O’Dwyer
Carsten Bours
Annika Rausch
Daniel von Rhein
Ineke Cornelissen
Yvette de Bruin
Maartje Graauwmans
Elzbieta Kostrzewa
Elodie Cauvet
Kristiina Tammimies
Rouslan Sitnikow
Yang-Min Kim
Thomas Bourgeron
David M. Jonas Thomas Preben Bo Ole Merete Hougaard
David M. Hougaard
Jonas Bybjerg-Grauholm
Thomas Werge
Preben Bo Mortensen
Ole Mors
Merete Nordentoft
Dwaipayan Armandina Carrie Isabelle Tracey Paula Alex Graham J. Alexander E. P. Lidia V. Tal Madeline A. Deepak P. Jonathan Adhya
Dwaipayan Armandina Carrie Isabelle Tracey Paula Alex Graham Adhya Alamanza Allison Garvey Parsons Smith Tsompa
Dwaipayan Adhya
Armandina Alamanza
Carrie Allison
Isabelle Garvey
Tracey Parsons
Paula Smith
Alex Tsompanidis
Graham J. Burton
Alexander E. P. Heazell
Lidia V. Gabis
Tal Biron-Shental
Madeline A. Lancaster
Deepak P. Srivastava
Jonathan Mill
David H. Rowitch
Matthew E. Hurles
Daniel H. Geschwind
Anders D. Børglum
Elise B. Robinson
Jakob Grove
Hilary C. Martin
Simon Baron-Cohen