Publications

Lightweight, Pre-trained Transformers for Remote Sensing Timeseries
Ivan Zvonkov
Mirali Purohit
Hannah Kerner
Machine learning methods for satellite data have a range of societally relevant applications, but labels used to train models can be difficu… (see more)lt or impossible to acquire. Self-supervision is a natural solution in settings with limited labeled data, but current self-supervised models for satellite data fail to take advantage of the characteristics of that data, including the temporal dimension (which is critical for many applications, such as monitoring crop growth) and availability of data from many complementary sensors (which can significantly improve a model's predictive performance). We present Presto (the Pretrained Remote Sensing Transformer), a model pre-trained on remote sensing pixel-timeseries data. By designing Presto specifically for remote sensing data, we can create a significantly smaller but performant model. Presto excels at a wide variety of globally distributed remote sensing tasks and performs competitively with much larger models while requiring far less compute. Presto can be used for transfer learning or as a feature extractor for simple models, enabling efficient deployment at scale.
The Influence of Age, Sex, and Socioeconomic Status on Glycemic Control Among People With Type 1 and Type 2 Diabetes in Canada: Patient-Led Longitudinal Retrospective Cross-sectional Study With Multiple Time Points of Measurement
Seyedmostafa Mousavi
Dana Tannenbaum Greenberg
Ruth Ndjaboué
Michelle Greiver
Olivia Drescher
Selma Chipenda Dansokho
Denis Boutin
Jean-Marc Chouinard
Sylvie Dostie
Robert Fenton
Marley Greenberg
Jonathan McGavock
Adhiyat Najam
Monia Rekik
Tom Weisz
Donald J Willison
Holly O Witteman
Controllable Image Generation via Collage Representations
Arantxa Casanova
Marlene Careil
Adriana Romero
Christopher Pal
Jakob Verbeek
Social Media as a Vector for Escort Ads: A Study on OnlyFans advertisements on Twitter.
Online sex trafficking is on the rise and a majority of trafficking victims report being advertised online. The use of OnlyFans as a platfor… (see more)m for adult content is also increasing, with Twitter as its main advertising tool. Furthermore, we know that traffickers usually work within a network and control multiple victims. Consequently, we suspect that there may be networks of traffickers promoting multiple OnlyFans accounts belonging to their victims. To this end, we present the first study of OnlyFans advertisements on Twitter in the context of finding organized activities. Preliminary analysis of this space shows that most tweets related to OnlyFans contain generic text, making text-based methods less reliable. Instead, focusing on what ties the authors of these tweets together, we propose a novel method for uncovering coordinated networks of users based on their behaviour. Our method, called Multi-Level Clustering (MLC), combines two levels of clustering that considers both the network structure as well as embedded node attribute information. It focuses jointly on user connections (through mentions) and content (through shared URLs). We apply MLC to real-world data of 2 million tweets pertaining to OnlyFans and analyse the detected groups. We also evaluate our method on synthetically generated data (with injected ground truth) and show its superior performance compared to competitive baselines. Finally, we discuss examples of organized clusters as case studies and provide interesting conclusions to our study.
Towards ethical multimodal systems
Esma Aimeur
Tri-process model of interpersonal mindfulness: theoretical framework and study protocol
Bassam Khoury
Viktoriya Manova
Lena Adel
Michael Lifshitz
Rodrigo C. Vergara
Harmehr Sekhon
Soham Rej
According to the Center for Disease Control and Prevention, over 14% of the US population practice mindfulness meditation. The effects of mi… (see more)ndfulness training on physical and mental health have been consistently documented, but its effects on interpersonal relationships are not yet fully understood or investigated. Interpersonal relationships play a crucial role in the wellbeing of individuals and society, and therefore, warrants further study. The aim of this paper is to present a tri-process theoretical model of interpersonal mindfulness and a study protocol to validate the proposed model. Specifically, according to the proposed model, mindfulness meditation training increases the self-awareness, self-regulation, and prosociality of those receiving the training, which ameliorates the quality of interpersonal interactions and the socioemotional support provided to other individuals. Finally, better socioemotional support increases the support receiver’s ability to regulate their emotions. Using a multiphasic longitudinal design involving 640 participants randomized into 480 dyads, the proposed protocol aims to validate the tri-process model and to investigate its mechanisms of actions. The proposed study has important theoretical and social implications and will allow devising new and more effective interpersonal mindfulness programs with applications in multiple fields.
Understanding the normative leadership of the world health organization (who): a mixed-method approach
Miriam Cohen
Jean-Louis Denis
Pierre Larouche
Marie-Andree Girard
Medical SAM Adapter: Adapting Segment Anything Model for Medical Image Segmentation
Junde Wu
Rao Fu
Huihui Fang
Yuanpei Liu
Zhao-Yang Wang
Yanwu Xu
Yueming Jin
The Segment Anything Model (SAM) has recently gained popularity in the field of image segmentation due to its impressive capabilities in var… (see more)ious segmentation tasks and its prompt-based interface. However, recent studies and individual experiments have shown that SAM underperforms in medical image segmentation, since the lack of the medical specific knowledge. This raises the question of how to enhance SAM's segmentation capability for medical images. In this paper, instead of fine-tuning the SAM model, we propose the Medical SAM Adapter (Med-SA), which incorporates domain-specific medical knowledge into the segmentation model using a light yet effective adaptation technique. In Med-SA, we propose Space-Depth Transpose (SD-Trans) to adapt 2D SAM to 3D medical images and Hyper-Prompting Adapter (HyP-Adpt) to achieve prompt-conditioned adaptation. We conduct comprehensive evaluation experiments on 17 medical image segmentation tasks across various image modalities. Med-SA outperforms several state-of-the-art (SOTA) medical image segmentation methods, while updating only 2\% of the parameters. Our code is released at https://github.com/KidsWithTokens/Medical-SAM-Adapter.
Ranking code clones to support maintenance activities
Osama Ehsan
Ying Zou
Dong Qiu
Rhythmic Information Sampling in the Brain during Visual Recognition
Laurent Caplette
Karim Jerbi CoCo Lab
Frédéric Gosselin
Towards Compute-Optimal Transfer Learning
Massimo Caccia
Alexandre Galashov
Arthur Douillard
Amal Rannen-Triki
Dushyant Rao
Michela Paganini
Marc'aurelio Ranzato
When Do Graph Neural Networks Help with Node Classification? Investigating the Impact of Homophily Principle on Node Distinguishability
Qincheng Lu
Jiaqi Zhu
Xiao-Wen Chang
Jure Leskovec
Homophily principle, i.e., nodes with the same labels are more likely to be connected, has been believed to be the main reason for the perfo… (see more)rmance superiority of Graph Neural Networks (GNNs) over Neural Networks on node classification tasks. Recent research suggests that, even in the absence of homophily, the advantage of GNNs still exists as long as nodes from the same class share similar neighborhood patterns. However, this argument only considers intra-class Node Distinguishability (ND) but neglects inter-class ND, which provides incomplete understanding of homophily on GNNs. In this paper, we first demonstrate such deficiency with examples and argue that an ideal situation for ND is to have smaller intra-class ND than inter-class ND. To formulate this idea and study ND deeply, we propose Contextual Stochastic Block Model for Homophily (CSBM-H) and define two metrics, Probabilistic Bayes Error (PBE) and negative generalized Jeffreys divergence, to quantify ND. With the metrics, we visualize and analyze how graph filters, node degree distributions and class variances influence ND, and investigate the combined effect of intra- and inter-class ND. Besides, we discovered the mid-homophily pitfall, which occurs widely in graph datasets. Furthermore, we verified that, in real-work tasks, the superiority of GNNs is indeed closely related to both intra- and inter-class ND regardless of homophily levels. Grounded in this observation, we propose a new hypothesis-testing based performance metric beyond homophily, which is non-linear, feature-based and can provide statistical threshold value for GNNs' the superiority. Experiments indicate that it is significantly more effective than the existing homophily metrics on revealing the advantage and disadvantage of graph-aware modes on both synthetic and benchmark real-world datasets.