Directed Scattering for Knowledge Graph-Based Cellular Signaling Analysis
Aarthi Venkat
Joyce Chew
Ferran Cardoso Rodriguez
Christopher J. Tape
Michael Perlmutter
Directed graphs are a natural model for many phenomena, in particular scientific knowledge graphs such as molecular interaction or chemical … (voir plus)reaction networks that define cellular signaling relationships. In these situations, source nodes typically have distinct biophysical properties from sinks. Due to their ordered and unidirectional relationships, many such networks also have hierarchical and multiscale structure. However, the majority of methods performing node- and edge-level tasks in machine learning do not take these properties into account, and thus have not been leveraged effectively for scientific tasks such as cellular signaling network inference. We propose a new framework called Directed Scattering Autoencoder (DSAE) which uses a directed version of a geometric scattering transform, combined with the non-linear dimensionality reduction properties of an autoencoder and the geometric properties of the hyperbolic space to learn latent hierarchies. We show this method outperforms numerous others on tasks such as embedding directed graphs and learning cellular signaling networks.
Focal Modulation Networks for Interpretable Sound Classification
Luca Della Libera
The increasing success of deep neural networks has raised concerns about their inherent black-box nature, posing challenges related to inter… (voir plus)pretability and trust. While there has been extensive exploration of interpretation techniques in vision and language, interpretability in the audio domain has received limited attention, primarily focusing on post-hoc explanations. This paper addresses the problem of interpretability by-design in the audio domain by utilizing the recently proposed attention-free focal modulation networks (FocalNets). We apply FocalNets to the task of environmental sound classification for the first time and evaluate their interpretability properties on the popular ESC-50 dataset. Our method outperforms a similarly sized vision transformer both in terms of accuracy and interpretability. Furthermore, it is competitive against PIQ, a method specifically designed for post-hoc interpretation in the audio domain.
PathOCl: Path-Based Prompt Augmentation for OCL Generation with GPT-4
Seif Abukhalaf
Mohammad Hamdaqa
The rapid progress of AI-powered programming assistants, such as GitHub Copilot, has facilitated the development of software applications. T… (voir plus)hese assistants rely on large language models (LLMs), which are foundation models (FMs) that support a wide range of tasks related to understanding and generating language. LLMs have demonstrated their ability to express UML model specifications using formal languages like the Object Constraint Language (OCL). However, the context size of the prompt is limited by the number of tokens an LLM can process. This limitation becomes significant as the size of UML class models increases. In this study, we intro-duce PathOCL, a novel path-based prompt augmentation technique designed to facilitate OCL generation. PathOCL addresses the limi-tations of LLMs, specifically their token processing limit and the challenges posed by large UML class models. PathOCL is based on the concept of chunking, which selectively augments the prompts with a subset of UML classes relevant to the English specification. Our findings demonstrate that PathOCL, compared to augmenting the complete UML class model (UML-Augmentation), generates a higher number of valid and correct OCL constraints using the GPT-4 model. Moreover, the average prompt size crafted using PathOCL significantly decreases when scaling the size of the UML class models.
Resource-Efficient Separation Transformer
Luca Della Libera
Samuele Cornell
Frédéric Lepoutre
François Grondin
Transformers have recently achieved state-of-the-art performance in speech separation. These models, however, are computationally demanding … (voir plus)and require a lot of learnable parameters. This paper explores Transformer-based speech separation with a reduced computational cost. Our main contribution is the development of the Resource-Efficient Separation Transformer (RE-SepFormer), a self-attention-based architecture that reduces the computational burden in two ways. First, it uses non-overlapping blocks in the latent space. Second, it operates on compact latent summaries calculated from each chunk. The RE-SepFormer reaches a competitive performance on the popular WSJ0-2Mix and WHAM! datasets in both causal and non-causal settings. Remarkably, it scales significantly better than the previous Transformer-based architectures in terms of memory and inference time, making it more suitable for processing long mixtures.
SKILL: Similarity-aware Knowledge distILLation for Speech Self-Supervised Learning
Luca Zampierin
Ghouthi Boukli Hacene
Bac Nguyen
Towards Practical Tool Usage for Continually Learning LLMs
Jerry Huang
Prasanna Parthasarathi
Mehdi Rezagholizadeh
Large language models (LLMs) show an innate skill for solving language based tasks. But insights have suggested an inability to adjust for i… (voir plus)nformation or task-solving skills becoming outdated, as their knowledge, stored directly within their parameters, remains static in time. Tool use helps by offloading work to systems that the LLM can access through an interface, but LLMs that use them still must adapt to nonstationary environments for prolonged use, as new tools can emerge and existing tools can change. Nevertheless, tools require less specialized knowledge, therefore we hypothesize they are better suited for continual learning (CL) as they rely less on parametric memory for solving tasks and instead focus on learning when to apply pre-defined tools. To verify this, we develop a synthetic benchmark and follow this by aggregating existing NLP tasks to form a more realistic testing scenario. While we demonstrate scaling model size is not a solution, regardless of tool usage, continual learning techniques can enable tool LLMs to both adapt faster while forgetting less, highlighting their potential as continual learners.
Why People Contribute Software Documentation
Deeksha M. Arya
Martin P. Robillard
Why People Contribute Software Documentation
Deeksha M. Arya
Martin P. Robillard
Software technologies are used by a large population of programmers with diverse backgrounds. To fulfill their need for information, enthusi… (voir plus)asts contribute numerous learning resources that vary in style and content, and act as documentation for the corresponding technology. We interviewed 26 volunteer contributors to understand why they create such documentation. We surface five motivations our informants had for contributing documentation, including to overcome issues they had faced with documentation and to capture their own learning. Among other findings, our observations suggest that the unique experience and background of documentation contributors provides the opportunity to create documentation that caters to users who have information needs and preferences similar to that of the contributor. CCS CONCEPTS • Software and its engineering
Assessing Numerical Analysis Performance with the Practi Mobile App
Kristin Garn
Raymond J. Spiteri
Model-independent Approach of the JUNO 8B Solar Neutrino Program
Jun Zhao
B. Yue
Haoqi Lu
Yufeng Li
J. Ling
Zeyuan Yu
Angel Abusleme
Thomas Adam
Shakeel Ahmad
Rizwan Ahmed
Sebastiano Aiello
Muhammad Akram
Abid Aleem
Tsagkarakis Alexandros
Fengpeng An
Q. An
Giuseppe Andronico
Nikolay Anfimov
Vito Antonelli
Tatiana Antoshkina … (voir 480 de plus)
Burin Asavapibhop
J. Andr'e
Didier Auguste
Weidong Bai
Nikita Balashov
Wander Baldini
Andrea Barresi
Davide Basilico
Eric Baussan
Marco Bellato
Antonio Bergnoli
Thilo Birkenfeld
Sylvie Blin
D. Blum
Simon Blyth
Anastasia Bolshakova
Mathieu Bongrand
Clément Bordereau
Dominique Breton
Augusto Brigatti
Riccardo Brugnera
Riccardo Bruno
Antonio Budano
Jose Busto
I. Butorov
Anatael Cabrera
Barbara Caccianiga
Hao Cai
Xiao Cai
Yanke Cai
Z. Cai
Riccardo Callegari
Antonio Cammi
Agustin Campeny
C. Cao
Guofu Cao
Jun Cao
Rossella Caruso
C. Cerna
Chi Chan
Jinfan Chang
Yun Chang
Guoming Chen
Pingping Chen
Po-An Chen
Shaomin Chen
Xurong Chen
Yixue Chen
Yu Chen
Zhiyuan Chen
Zikang Chen
Jie Cheng
Yaping Cheng
Alexander Chepurnov
Alexey Chetverikov
Davide Chiesa
Pietro Chimenti
Artem Chukanov
Gérard Claverie
Catia Clementi
Barbara Clerbaux
Marta Colomer Molla
Selma Conforti Di Lorenzo
Daniele Corti
Flavio Dal Corso
Olivia Dalager
C. Taille
Z. Y. Deng
Ziyan Deng
Wilfried Depnering
Marco Diaz
Xuefeng Ding
Yayun Ding
Bayu Dirgantara
Sergey Dmitrievsky
Tadeas Dohnal
Dmitry Dolzhikov
Georgy Donchenko
Jianmeng Dong
Evgeny Doroshkevich
Marcos Dracos
Frédéric Druillole
Ran Du
S. X. Du
Stefano Dusini
Martin Dvorak
Timo Enqvist
H. Enzmann
Andrea Fabbri
D. Fan
Lei Fan
Jian Fang
Wen Fang
Marco Fargetta
Dmitry Fedoseev
Zheng-hao Fei
Li-Cheng Feng
Qichun Feng
R. Ford
Amélie Fournier
H. Gan
Feng Gao
Alberto Garfagnini
Arsenii Gavrikov
Marco Giammarchi
Nunzio Giudice
Maxim Gonchar
G. Gong
Hui Gong
Yuri Gornushkin
A. Gottel
Marco Grassi
Maxim Gromov
Vasily Gromov
M. H. Gu
X. Gu
Yunting Gu
M. Guan
Yuduo Guan
Nunzio Guardone
Cong Guo
Jingyuan Guo
Wanlei Guo
Xinheng Guo
Yuhang Guo
Paul Hackspacher
Caren Hagner
Ran Han
Yang Han
Miao He
W. He
Tobias Heinz
Patrick Hellmuth
Yue-kun Heng
Rafael Herrera
Yuenkeung Hor
Shaojing Hou
Yee Hsiung
Bei-Zhen Hu
Hang Hu
Jianrun Hu
Jun Hu
Shouyang Hu
Tao Hu
Yuxiang Hu
Zhuojun Hu
Guihong Huang
Hanxiong Huang
Kaixuan Huang
Wenhao Huang
Xinglong Huang
X. T. Huang
Yongbo Huang
Jiaqi Hui
L. Huo
Wenju Huo
Cédric Huss
Safeer Hussain
Ara Ioannisian
Roberto Isocrate
Beatrice Jelmini
Ignacio Jeria
Xiaolu Ji
Huihui Jia
Junji Jia
Siyu Jian
Di Jiang
Wei Jiang
Xiaoshan Jiang
X. Jing
Cécile Jollet
L. Kalousis
Philipp Kampmann
Li Kang
Rebin Karaparambil
Narine Kazarian
Amina Khatun
Khanchai Khosonthongkee
Denis Korablev
K. Kouzakov
Alexey Krasnoperov
Nikolay Kutovskiy
Pasi Kuusiniemi
Tobias Lachenmaier
Cecilia Landini
Sébastien Leblanc
Victor Lebrin
F. Lefèvre
R. Lei
Rupert Leitner
Jason Leung
Daozheng Li
Demin Li
Fei Li
Fule Li
Gaosong Li
Huiling Li
Mengzhao Li
Min Li
Nan Li
Qingjiang Li
Ruhui Li
Ruiting Lei
Shanfeng Li
Tao Li
Teng Li
Weidong Li
Wei-guo Li
Xiaomei Li
Xiaonan Li
Xinglong Li
Yi Li
Yichen Li
Zepeng Li
Zhaohan Li
Zhibing Li
Ziyuan Li
Zonghui Li
Hao Liang
Jiaming Yan
Ayut Limphirat
Gen Lin
Shengxin Lin
Tao Lin
Ivano Lippi
Yang Liu
Haidong Liu
H. Liu
Hongbang Liu
Hongjuan Liu
Hongtao Liu
Hui Liu
Jianglai Liu
Jinchang Liu
Min Liu
Qian Liu
Q. Liu
Runxuan Liu
Shubin Liu
Shulin Liu
Xiaowei Liu
Xiwen Liu
Yong Liu
Yunzhe Liu
Alexey Lokhov
Paolo Lombardi
Claudio Lombardo
K. Loo
Chuan Lu
Jingbin Lu
Junguang Lu
Shuxiang Lu
Bayarto Lubsandorzhiev
Sultim Lubsandorzhiev
Livia Ludhova
Arslan Lukanov
Daibin Luo
F. Luo
Guang Luo
Shu Luo
Wu Luo
Xiaojie Luo
Vladimir Lyashuk
B. Ma
Bing Ma
R. Q. Ma
Si Ma
Xiaoyan Ma
Xubo Ma
Jihane Maalmi
Jingyu Mai
Yury Malyshkin
Roberto Carlos Mandujano
Fabio Mantovani
Francesco Manzali
Xin Mao
Yajun Mao
S. Mari
F. Marini
Cristina Martellini
Gisèle Martin-chassard
Agnese Martini
Matthias Mayer
Davit Mayilyan
Ints Mednieks
Yu Meng
Anselmo Meregaglia
Emanuela Meroni
David J. Meyhofer
Mauro Mezzetto
Jonathan Andrew Miller
Lino Miramonti
Paolo Montini
Michele Montuschi
Axel Muller
M. Nastasi
D. Naumov
Elena Naumova
Diana Navas-Nicolas
Igor Nemchenok
Minh Thuan Nguyen Thi
Alexey Nikolaev
F. Ning
Zhe Ning
Hiroshi Nunokawa
Lothar Oberauer
Juan Pedro Ochoa-Ricoux
Alexander Olshevskiy
Domizia Orestano
Fausto Ortica
Rainer Othegraven
A. Paoloni
Sergio Parmeggiano
Y. P. Pei
Nicomede Pelliccia
Anguo Peng
Yu Peng
Yuefeng Peng
Z-R Peng
Frédéric Perrot
P. Petitjean
Fabrizio Petrucci
Oliver Pilarczyk
Luis Felipe Piñeres Rico
Artyom Popov
Pascal Poussot
Ezio Previtali
Fazhi Qi
M. Qi
Sen Qian
X. Qian
Zhen Qian
Hao-xue Qiao
Zhonghua Qin
S. Qiu
Gioacchino Ranucci
Neill Raper
A. Re
Henning Rebber
Abdel Rebii
Mariia Redchuk
Bin Ren
Jie Ren
Barbara Ricci
Mariam Rifai
Mathieu Roche
Narongkiat Rodphai
Aldo M. Romani
Bedřich Roskovec
X. Ruan
Arseniy Rybnikov
Andrey Sadovsky
Paolo Saggese
Simone Sanfilippo
Anut Sangka
Utane Sawangwit
Julia Sawatzki
Michaela Schever
Cédric Schwab
Konstantin Schweizer
Alexandr Selyunin
Andrea Serafini
Giulio Settanta
M. Settimo
Zhuang Shao
V. Sharov
Arina Shaydurova
Jingyan Shi
Yanan Shi
Vitaly Shutov
Andrey Sidorenkov
Fedor Šimkovic
Chiara Sirignano
Jaruchit Siripak
Monica Sisti
Maciej Slupecki
Mikhail Smirnov
Oleg Smirnov
Thiago Sogo-Bezerra
Sergey Sokolov
Julanan Songwadhana
Boonrucksar Soonthornthum
Albert Sotnikov
Ondvrej vSr'amek
Warintorn Sreethawong
A. Stahl
Luca Stanco
Konstantin Stankevich
Duvsan Vstef'anik
Hans Steiger
Jochen Steinmann
Tobias Sterr
M. Stock
Virginia Strati
Alexander Studenikin
Jun Su
Shifeng Sun
Xilei Sun
Yongjie Sun Sun
Yongzhao Sun
Zhengyang Sun
Narumon Suwonjandee
Michal Szelezniak
Qiang Tang
Quan Tang
Xiao Tang
Alexander Tietzsch
Igor Tkachev
Tomas Tmej
M. Torri
K. Treskov
Andrea Triossi
Giancarlo Troni
Wladyslaw Trzaska
Cristina Tuve
Nikita Ushakov
Vadim Vedin
Giuseppe Verde
Maxim Vialkov
Benoit Viaud
Cornelius Moritz Vollbrecht
C. Volpe
Katharina von Sturm
Vit Vorobel
Dmitriy Voronin
Lucia Votano
Pablo Walker
Caishen Wang
Chung-Hsiang Wang
En Wang
Guoli Wang
Jian Wang
Jun Wang
Lucinda W. Wang
Meifen Wang
Meng Wang
Ruiguang Wang
Siguang Wang
Wei Wang
Wenshuai Wang
Xi Wang
Xiangyue Wang
Yangfu Wang
Yaoguang Wang
Yi Xing Wang
Yifang Wang
Yuanqing Wang
Yuman Wang
Zhe Wang
Zheng Wang
Zhimin Wang
Zongyi Wang
Apimook Watcharangkool
Wei Wei
Wenlu Wei
Yadong Wei
K. Wen
Kaile Wen
Christopher Wiebusch
S. Wong
Bjoern Wonsak
Diru Wu
Qun Wu
Zhi Wu
Michael Wurm
Jacques Wurtz
Christian Wysotzki
Yufei Xi
D. Xia
Xiang Xiao
Xiaochuan Xie
Yu-guang Xie
Z. P. Xie
Zhao-Liang Xin
Z. Xing
Benda D. Xu
Chengze Xu
Donglian Xu
Fanrong Xu
The physics potential of detecting 8B solar neutrinos will be exploited at the Jiangmen Underground Neutrino Observatory (JUNO), in a model-… (voir plus)independent manner by using three distinct channels of the charged current (CC), neutral current (NC), and elastic scattering (ES) interactions. Due to the largest-ever mass of 13C nuclei in the liquid scintillator detectors and the expected low background level, 8B solar neutrinos are observable in the CC and NC interactions on 13C for the first time. By virtue of optimized event selections and muon veto strategies, backgrounds from the accidental coincidence, muon-induced isotopes, and external backgrounds can be greatly suppressed. Excellent signal-to-background ratios can be achieved in the CC, NC, and ES channels to guarantee the observation of the 8B solar neutrinos. From the sensitivity studies performed in this work, we show that JUNO, with 10 yr of data, can reach the 1σ precision levels of 5%, 8%, and 20% for the 8B neutrino flux, sin 2 θ 12 , and Δ m 21 2 , respectively. Probing the details of both solar physics and neutrino physics would be unique and helpful. In addition, when combined with the Sudbury Neutrino Observatory measurement, the world's best precision of 3% is expected for the measurement of the 8B neutrino flux.
Towards Causal Deep Learning for Vulnerability Detection
Md Mahbubur Rahman
Ira Ceka
Chengzhi Mao
Saikat Chakraborty
Baishakhi Ray
Wei Le
Deep learning vulnerability detection has shown promising results in recent years. However, an important challenge that still blocks it from… (voir plus) being very useful in practice is that the model is not robust under perturbation and it cannot generalize well over the out-of-distribution (OOD) data, e.g., applying a trained model to unseen projects in real world. We hypothesize that this is because the model learned non-robust features, e.g., variable names, that have spurious correlations with labels. When the perturbed and OOD datasets no longer have the same spurious features, the model prediction fails. To address the challenge, in this paper, we introduced causality into deep learning vulnerability detection. Our approach CausalVul consists of two phases. First, we designed novel perturbations to discover spurious features that the model may use to make predictions. Second, we applied the causal learning algorithms, specifically, do-calculus, on top of existing deep learning models to systematically remove the use of spurious features and thus promote causal based prediction. Our results show that CausalVul consistently improved the model accuracy, robustness and OOD performance for all the state-of-the-art models and datasets we experimented. To the best of our knowledge, this is the first work that introduces do calculus based causal learning to software engineering models and shows it's indeed useful for improving the model accuracy, robustness and generalization. Our replication package is located at https://figshare.com/s/0ffda320dcb96c249ef2.
Deep learning for high-resolution dose prediction in high dose rate brachytherapy for breast cancer treatment
Sébastien Quetin
Boris Bahoric
Farhad Maleki
Objective. Monte Carlo (MC) simulations are the benchmark for accurate radiotherapy dose calculations, notably in patient-specific high dose… (voir plus) rate brachytherapy (HDR BT), in cases where considering tissue heterogeneities is critical. However, the lengthy computational time limits the practical application of MC simulations. Prior research used deep learning (DL) for dose prediction as an alternative to MC simulations. While accurate dose predictions akin to MC were attained, graphics processing unit limitations constrained these predictions to large voxels of 3 mm × 3 mm × 3 mm. This study aimed to enable dose predictions as accurate as MC simulations in 1 mm × 1 mm × 1 mm voxels within a clinically acceptable timeframe. Approach. Computed tomography scans of 98 breast cancer patients treated with Iridium-192-based HDR BT were used: 70 for training, 14 for validation, and 14 for testing. A new cropping strategy based on the distance to the seed was devised to reduce the volume size, enabling efficient training of 3D DL models using 1 mm × 1 mm × 1 mm dose grids. Additionally, novel DL architecture with layer-level fusion were proposed to predict MC simulated dose to medium-in-medium (D m,m ). These architectures fuse information from TG-43 dose to water-in-water (D w,w ) with patient tissue composition at the layer-level. Different inputs describing patient body composition were investigated. Main results. The proposed approach demonstrated state-of-the-art performance, on par with the MC D m,m maps, but 300 times faster. The mean absolute percent error for dosimetric indices between the MC and DL-predicted complete treatment plans was 0.17% ± 0.15% for the planning target volume V 100, 0.30% ± 0.32% for the skin D 2cc , 0.82% ± 0.79% for the lung D 2cc , 0.34% ± 0.29% for the chest wall D 2cc and 1.08% ± 0.98% for the heart D 2cc . Significance. Unlike the time-consuming MC simulations, the proposed novel strategy efficiently converts TG-43 D w,w maps into precise D m,m maps at high resolution, enabling clinical integration.