Publications

ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning

Qiao Gu

Alihusein Kuwajerwala

Sacha Morin

Krishna Murthy

Bipasha Sen

Aditya Agarwal

Corban Rivera

William Paul

Kirsty Ellis

Rama Chellappa

Chuang Gan

Celso M de Melo

Joshua B. Tenenbaum

Antonio Torralba

Florian Shkurti

Liam Paull

For robots to perform a wide variety of tasks, they require a 3D representation of the world that is semantically rich, yet compact and effi… (see more)cient for task-driven perception and planning. Recent approaches have attempted to leverage features from large vision-language models to encode semantics in 3D representations. However, these approaches tend to produce maps with per-point feature vectors, which do not scale well in larger environments, nor do they contain semantic spatial relationships between entities in the environment, which are useful for downstream planning. In this work, we propose ConceptGraphs, an open-vocabulary graph-structured representation for 3D scenes. ConceptGraphs is built by leveraging 2D foundation models and fusing their output to 3D by multi-view association. The resulting representations generalize to novel semantic classes, without the need to collect large 3D datasets or finetune models. We demonstrate the utility of this representation through a number of downstream planning tasks that are specified through abstract (language) prompts and require complex reasoning over spatial and semantic concepts. (Project page: https://concept-graphs.github.io/ Explainer video: https://youtu.be/mRhNkQwRYnc )

2024-05-12

2024 IEEE International Conference on Robotics and Automation (ICRA) (published)

doi.org

openreview.net

GAGE: Genetic Algorithm-Based Graph Explainer for Malware Analysis

Mohd Saqib

Benjamin C. M. Fung

Philippe Charland

Andrew Walenstein

Malware analysts often prefer reverse engineering using Call Graphs, Control Flow Graphs (CFGs), and Data Flow Graphs (DFGs), which involves… (see more) the utilization of black-box Deep Learning (DL) models. The proposed research introduces a structured pipeline for reverse engineering-based analysis, offering promising results compared to state-of-the-art methods and providing high-level interpretability for malicious code blocks in subgraphs. We propose the Canonical Executable Graph (CEG) as a new representation of Portable Executable (PE) files, uniquely incorporating syntactical and semantic information into its node embeddings. At the same time, edge features capture structural aspects of PE files. This is the first work to present a PE file representation encompassing syntactical, semantic, and structural characteristics, whereas previous efforts typically focused solely on syntactic or structural properties. Furthermore, recognizing the limitations of existing graph explanation methods within Explainable Artificial Intelligence (XAI) for malware analysis, primarily due to the specificity of malicious files, we introduce Genetic Algorithm-based Graph Explainer (GAGE). GAGE operates on the CEG, striving to identify a precise subgraph relevant to predicted malware families. Through experiments and comparisons, our proposed pipeline exhibits substantial improvements in model robustness scores and discriminative power compared to the previous benchmarks. Furthermore, we have successfully used GAGE in practical applications on real-world data, producing meaningful insights and interpretability. This research offers a robust solution to enhance cybersecurity by delivering a transparent and accurate understanding of malware behaviour. Moreover, the proposed algorithm is specialized in handling graph-based data, effectively dissecting complex content and isolating influential nodes.

2024-05-12

IEEE International Conference on Data Engineering (published)

doi.org

Globally Stable Neural Imitation Policies

Amin Abyaneh

Mariana Sosa Guzmán

Hsiu-Chin Lin

2024-05-12

2024 IEEE International Conference on Robotics and Automation (ICRA) (published)

doi.org

arxiv.org

A Neural-Evolutionary Algorithm for Autonomous Transit Network Design

Andrew Holliday

Gregory Dudek

2024-05-12

2024 IEEE International Conference on Robotics and Automation (ICRA) (published)

doi.org

arxiv.org

Open Source in Lab Management

Julien Cohen-Adad

This document explores the advantages of integrating open source software and practices in managing a scientific lab, emphasizing reproducib… (see more)ility and the avoidance of pitfalls. It details practical applications from website management using GitHub Pages to organizing datasets in compliance with BIDS standards, highlights the importance of continuous testing for data integrity, IT management through Ansible for efficient system configuration, open source software development. The broader goal is to promote transparent, reproducible science by adopting open source tools. This approach not only saves time but exposes students to best practices, enhancing the transparency and reproducibility of scientific research.

2024-05-12

ArXiv (preprint)

doi.org

arxiv.org

TEMPLATES: Characterization of a Merger in the Dusty Lensing SPT0418-47 System

Jared Cathey

Anthony H. Gonzalez

Sidney Lower

Kedar A. Phadke

Justin Spilker

Manuel Aravena

Matthew Bayliss

Jack E. Birkin

Simon Birrer

Scott Chapman

Hakon Dahle

Christopher C. Hayward

Yashar Hezaveh

Ryley Hill

Taylor A. Hutchison

Keunho J. Kim

Guillaume Mahler

Daniel P. Marrone

Desika Narayanan

Alexander Navarre … (see 7 more)

Cassie Reuter

Jane R Rigby

Keren Sharon

Manuel Solimano

Nikolaus Sulzenauer

Joaquin Vieira

David Vizgan

2024-05-12

The Astrophysical Journal (published)

doi.org

arxiv.org

The 1st International Workshop on Graph Foundation Models (GFM).

Haitao Mao

Jianan Zhao

Xiaoxin He

Zhikai Chen

Qian Huang

Zhaocheng Zhu

Jian Tang

Micheal Bronstein

Xavier Bresson

Bryan Hooi

Haiyang Zhang

Xianfeng Tang

Luo Chen

Jiliang Tang

Foundation models such as GPT-4 for natural language processing (NLP), Flamingo for computer vision (CV), have set new benchmarks in AI by d… (see more)elivering state-of-the-art results across various tasks with minimal task-specific data. Despite their success, the application of these models to the graph domain is challenging due to the relational nature of graph-structured data. To address this gap, we propose the Graph Foundation Model (GFM) Workshop, the first workshop for GFMs, dedicated to exploring the adaptation and development of foundation models specifically designed for graph data. The GFM workshop focuses on two critical questions: (1) How can the underlying capabilities of existing foundation models be effectively applied to graph data? (2) What foundational principles should guide the creation of models tailored to the graph domain? Through a curated set of panel sections, keynote talks, and paper presentations, our workshop intends to catalyze innovative approaches and theoretical frameworks for Graph Foundation Models (GFMs). We target a broad audience, encompassing researchers, practitioners, and students, and aim to lay the groundwork for the next wave of breakthroughs in integrating graph data with foundation models.

2024-05-12

Companion Proceedings of the ACM on Web Conference 2024 (published)

doi.org

An AI-Resilient Text Rendering Technique for Reading and Skimming Documents

Ziwei Gu

Ian Arawjo

Kenneth Li

Jonathan K. Kummerfeld

Elena L. Glassman

2024-05-10

Proceedings of the CHI Conference on Human Factors in Computing Systems (published)

doi.org

arxiv.org

ChainForge: A Visual Toolkit for Prompt Engineering and LLM Hypothesis Testing

Ian Arawjo

Chelse Swoopes

Priyan Vaithilingam

Martin Wattenberg

Elena L. Glassman

Evaluating outputs of large language models (LLMs) is challenging, requiring making -- and making sense of -- many responses. Yet tools that… (see more) go beyond basic prompting tend to require knowledge of programming APIs, focus on narrow domains, or are closed-source. We present ChainForge, an open-source visual toolkit for prompt engineering and on-demand hypothesis testing of text generation LLMs. ChainForge provides a graphical interface for comparison of responses across models and prompt variations. Our system was designed to support three tasks: model selection, prompt template design, and hypothesis testing (e.g., auditing). We released ChainForge early in its development and iterated on its design with academics and online users. Through in-lab and interview studies, we find that a range of people could use ChainForge to investigate hypotheses that matter to them, including in real-world settings. We identify three modes of prompt engineering and LLM hypothesis testing: opportunistic exploration, limited evaluation, and iterative refinement.

2024-05-10

Proceedings of the CHI Conference on Human Factors in Computing Systems (published)

doi.org

arxiv.org

DirectGPT: A Direct Manipulation Interface to Interact with Large Language Models

Damien Masson

Sylvain Malacria

Géry Casiez

Daniel Vogel

2024-05-10

Proceedings of the CHI Conference on Human Factors in Computing Systems (published)

doi.org

arxiv.org

Calibration‐free parallel transmission of the cervical, thoracic, and lumbar spinal cord at <scp>7T</scp>

Christoph S. Aigner

Manuel F. Sánchez Alarcon

Alexandre D'Astous

Eva Alonso‐Ortiz

Julien Cohen‐Adad

Sebastian Schmitter

The development of universal shims represents a significant advance by eliminating time‐consuming subject‐specific pTx adjustments. This… (see more) approach is expected to make UHF spinal cord imaging more accessible and user‐friendly, particularly for non‐pTx experts.

2024-05-09

Magnetic Resonance in Medicine (published)

doi.org

Exploring the digital divide: results of a survey informing mobile application development

Maira Corinne Claudio

Zachary Rehany

Katerina Stachtari

Elena Guadagno

Esli Osmanlliu

Dan Poenaru

Introduction Mobile health apps risk widening health disparities if they overlook digital inclusion. The digital divide, encompassing access… (see more), familiarity, and readiness, poses a significant barrier to medical interventions. Existing literature lacks exploration of the digital divide's contributing factors. Hence, data are needed to comprehend the challenges in developing inclusive health apps. Methods We created a survey to gauge internet and smartphone access, smartphone familiarity, and readiness for using mobile health apps among caregivers of pediatric patients in tertiary care. Open-ended questions solicited feedback and suggestions on mobile health applications. Responses were categorized by similarity and compared. Developed with patient partners, the survey underwent cognitive testing and piloting for accuracy. Results Data from 209 respondents showed that 23% were affected by the digital divide, mainly due to unfamiliarity with digital skills. Among 49 short text responses about health app concerns, 31 mentioned security and confidentiality, with 7 mentioning the impersonal nature of such apps. Desired features included messaging healthcare providers, scheduling, task reminders, and simplicity. Conclusions This study underscores a digital divide among caregivers of pediatric patients, with nearly a quarter affected primarily due to a lack of digital comfort. Respondents emphasized user-friendliness and online security for health apps. Future apps should prioritize digital inclusion by addressing the significant barriers and carefully considering patient and family concerns.

2024-05-09

Frontiers Digit. Health (published)

doi.org

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Publications

Mila on Udemy

AI Policy Fellowship Publications

Mila Ventures Launchpad

Popular keywords:

Publications