Portrait de Narges Armanfard

Narges Armanfard

Membre académique associé
Professeure agrégée, McGill University, Département de génie électrique et informatique
Sujets de recherche
Apprentissage actif
Apprentissage automatique médical
Apprentissage de représentations
Apprentissage en ligne
Apprentissage multimodal
Apprentissage par renforcement
Apprentissage profond
Détection d'anomalies
IA appliquée
IA en santé
Méthodes de réduction de la dimensionnalité
Modèles génératifs
Réseaux de neurones en graphes
Vision par ordinateur

Biographie

Narges Armanfard (Ph. D., ing.) est la fondatrice et la chercheuse principale du laboratoire iSMART. Elle est professeure adjointe au Département de génie électrique et informatique de l'Université McGill et membre académique associé à Mila – Institut québécois d'intelligence artificielle. Elle est également affiliée au Centre sur les machines intelligentes de McGill (CIM), à l'Initiative de McGill en médecine computationnelle (MiCM) et à l'Institut de génie aérospatial de McGill (MIAE). Sa recherche porte sur le développement d'algorithmes novateurs pour divers domaines tels que l'analyse de données de séries temporelles, la vision par ordinateur, l'apprentissage par renforcement et l'apprentissage par représentation pour des tâches telles que le regroupement de données, la classification et la détection d'anomalies. Ses contributions au domaine de l'IA ont été reconnues par de nombreux prix, décernés notamment par le Conseil de recherches en sciences naturelles et en génie du Canada, AgeWell, Vanier-Banting, les Fonds de recherche du Québec, ainsi que l'Université McMaster, l'Université McGill, l'Université de Toronto, les Instituts de recherche en santé du Canada et Scale AI.

Étudiants actuels

Baccalauréat - McGill
Maîtrise recherche - McGill
Doctorat - McGill
Doctorat - McGill
Doctorat - McGill
Doctorat - McGill
Doctorat - McGill
Maîtrise recherche - McGill
Doctorat - McGill
Maîtrise recherche - McGill
Maîtrise recherche - McGill
Postdoctorat - McGill
Maîtrise recherche - McGill

Publications

Development and Feasibility Study of HOPE Model for Prediction of Depression Among Older Adults Using Wi-Fi-based Motion Sensor Data: Machine Learning Study
Shayan Nejadshamsi
Negar Ghourchian
Howard Bergman
Roland Grad
Machelle Wilchesky
Vladimir Khanassov
Isabelle Vedel
Depression, characterized by persistent sadness and loss of interest in daily activities, greatly reduces quality of life. Early detection i… (voir plus)s vital for effective treatment and intervention. While many studies use wearable devices to classify depression based on physical activity, these often rely on intrusive methods. Additionally, most depression classification studies involve large participant groups and use single-stage classifiers without explainability. This study aims to assess the feasibility of classifying depression using nonintrusive Wi-Fi–based motion sensor data using a novel machine learning model on a limited number of participants. We also conduct an explainability analysis to interpret the model’s predictions and identify key features associated with depression classification. In this study, we recruited adults aged 65 years and older through web-based and in-person methods, supported by a McGill University health care facility directory. Participants provided consent, and we collected 6 months of activity and sleep data via nonintrusive Wi-Fi–based sensors, along with Edmonton Frailty Scale and Geriatric Depression Scale data. For depression classification, we proposed a HOPE (Home-Based Older Adults’ Depression Prediction) machine learning model with feature selection, dimensionality reduction, and classification stages, evaluating various model combinations using accuracy, sensitivity, precision, and F1-score. Shapely addictive explanations and local interpretable model-agnostic explanations were used to explain the model’s predictions. A total of 6 participants were enrolled in this study; however, 2 participants withdrew later due to internet connectivity issues. Among the 4 remaining participants, 3 participants were classified as not having depression, while 1 participant was identified as having depression. The most accurate classification model, which combined sequential forward selection for feature selection, principal component analysis for dimensionality reduction, and a decision tree for classification, achieved an accuracy of 87.5%, sensitivity of 90%, and precision of 88.3%, effectively distinguishing individuals with and those without depression. The explainability analysis revealed that the most influential features in depression classification, in order of importance, were “average sleep duration,” “total number of sleep interruptions,” “percentage of nights with sleep interruptions,” “average duration of sleep interruptions,” and “Edmonton Frailty Scale.” The findings from this preliminary study demonstrate the feasibility of using Wi-Fi–based motion sensors for depression classification and highlight the effectiveness of our proposed HOPE machine learning model, even with a small sample size. These results suggest the potential for further research with a larger cohort for more comprehensive validation. Additionally, the nonintrusive data collection method and model architecture proposed in this study offer promising applications in remote health monitoring, particularly for older adults who may face challenges in using wearable devices. Furthermore, the importance of sleep patterns identified in our explainability analysis aligns with findings from previous research, emphasizing the need for more in-depth studies on the role of sleep in mental health, as suggested in the explainable machine learning study.
Deep Clustering with Self-Supervision using Pairwise Similarities
Deep clustering incorporates embedding into clustering to find a lower-dimensional space appropriate for clustering. In this paper, we propo… (voir plus)se a novel deep clustering framework with self-supervision using pairwise similarities (DCSS). The proposed method consists of two successive phases. In the first phase, we propose to form hypersphere-like groups of similar data points, i.e. one hypersphere per cluster, employing an autoencoder that is trained using cluster-specific losses. The hyper-spheres are formed in the autoencoder's latent space. In the second phase, we propose to employ pairwise similarities to create a
Unveiling the Flaws: A Critical Analysis of Initialization Effect on Time Series Anomaly Detection
Deep learning for time-series anomaly detection (TSAD) has gained significant attention over the past decade. Despite the reported improveme… (voir plus)nts in several papers, the practical application of these models remains limited. Recent studies have cast doubt on these models, attributing their results to flawed evaluation techniques. However, the impact of initialization has largely been overlooked. This paper provides a critical analysis of the initialization effects on TSAD model performance. Our extensive experiments reveal that TSAD models are highly sensitive to hyperparameters such as window size, seed number, and normalization. This sensitivity often leads to significant variability in performance, which can be exploited to artificially inflate the reported efficacy of these models. We demonstrate that even minor changes in initialization parameters can result in performance variations that overshadow the claimed improvements from novel model architectures. Our findings highlight the need for rigorous evaluation protocols and transparent reporting of preprocessing steps to ensure the reliability and fairness of anomaly detection methods. This paper calls for a more cautious interpretation of TSAD advancements and encourages the development of more robust and transparent evaluation practices to advance the field and its practical applications.
Forward-Backward Knowledge Distillation for Continual Clustering
Unsupervised Continual Learning (UCL) is a burgeoning field in machine learning, focusing on enabling neural networks to sequentially learn … (voir plus)tasks without explicit label information. Catastrophic Forgetting (CF), where models forget previously learned tasks upon learning new ones, poses a significant challenge in continual learning, especially in UCL, where labeled information of data is not accessible. CF mitigation strategies, such as knowledge distillation and replay buffers, often face memory inefficiency and privacy issues. Although current research in UCL has endeavored to refine data representations and address CF in streaming data contexts, there is a noticeable lack of algorithms specifically designed for unsupervised clustering. To fill this gap, in this paper, we introduce the concept of Unsupervised Continual Clustering (UCC). We propose Forward-Backward Knowledge Distillation for unsupervised Continual Clustering (FBCC) to counteract CF within the context of UCC. FBCC employs a single continual learner (the ``teacher'') with a cluster projector, along with multiple student models, to address the CF issue. The proposed method consists of two phases: Forward Knowledge Distillation, where the teacher learns new clusters while retaining knowledge from previous tasks with guidance from specialized student models, and Backward Knowledge Distillation, where a student model mimics the teacher's behavior to retain task-specific knowledge, aiding the teacher in subsequent tasks. FBCC marks a pioneering approach to UCC, demonstrating enhanced performance and memory efficiency in clustering across various tasks, outperforming the application of clustering algorithms to the latent space of state-of-the-art UCL algorithms.
AI healthcare research: Pioneering iSMART Lab
Dr Narges Armanfard, Professor, talks us through the AI healthcare research at McGill University which is spearheading a groundbreaking init… (voir plus)iative – the iSMART Lab. Access to high-quality healthcare is not just a fundamental human right; it is the bedrock of our societal wellbeing, with the crucial roles played by doctors, nurses, and hospitals. Yet, healthcare systems globally face mounting challenges, particularly from aging populations. Dr Narges Armanfard, affiliated with McGill University and Mila Quebec AI Institute in Montreal, Canada, has spearheaded a groundbreaking initiative – the iSMART Lab. This laboratory represents a revolutionary leap into the future of healthcare, with its pioneering research in AI for health applications garnering significant attention. Renowned for its innovative integration of AI across diverse domains, iSMART Lab stands at the forefront of harnessing Artificial Intelligence to elevate and streamline health services.
Neural network prediction of the effect of thermomechanical controlled processing on mechanical properties
Sushant Sinha
Denzel Guye
Xiaoping Ma
Kashif Rehman
Stephen Yue
Deep Reinforcement Learning in Human Activity Recognition: A Survey and Outlook.
Human activity recognition (HAR) is a popular research field in computer vision that has already been widely studied. However, it is still a… (voir plus)n active research field since it plays an important role in many current and emerging real-world intelligent systems, like visual surveillance and human-computer interaction. Deep reinforcement learning (DRL) has recently been used to address the activity recognition problem with various purposes, such as finding attention in video data or obtaining the best network structure. DRL-based HAR has only been around for a short time, and it is a challenging, novel field of study. Therefore, to facilitate further research in this area, we have constructed a comprehensive survey on activity recognition methods that incorporate DRL. Throughout the article, we classify these methods according to their shared objectives and delve into how they are ingeniously framed within the DRL framework. As we navigate through the survey, we conclude by shedding light on the prominent challenges and lingering questions that await the attention of future researchers, paving the way for further advancements and breakthroughs in this exciting domain.
Open-Set Multivariate Time-Series Anomaly Detection
Thi Kieu Khanh Ho
Language-Guided Reinforcement Learning for Hard Attention in Few-Shot Learning
Bahareh Nikpour
Attention mechanisms have demonstrated significant potential in enhancing learning models by identifying key portions of input data, particu… (voir plus)larly in scenarios with limited training samples. Inspired by human perception, we propose that focusing on essential data segments, rather than the entire dataset, can improve the accuracy and reliability of the learning models. However, identifying these critical data segments, or"hard attention finding,"is challenging, especially in few-shot learning, due to the scarcity of training data and the complexity of model parameters. To address this, we introduce LaHA, a novel framework that leverages language-guided deep reinforcement learning to identify and utilize informative data regions, thereby improving both interpretability and performance. Extensive experiments on benchmark datasets validate the effectiveness of LaHA.
Artificial Intelligence for Detection of Dementia Using Motion Data: A Scoping Review
Jory Katz
Howard Bergman
Roland Grad
Vladimir Khanassov
Genevieve Gore
Isabelle Vedel
Machelle Wilchesky
Negar Ghourchian
S. A. Rahimi
Background: Dementia is a neurodegenerative disease resulting in the loss of cognitive and psychological functions. Artificial intelligence … (voir plus)(AI) may help in detection and screening of dementia; however, little is known in this area. Objectives: The objective of this study was to identify and evaluate AI interventions for detection of dementia using motion data. Method: The review followed the framework proposed by O’Malley’s and Joanna Briggs Institute methodological guidance for scoping reviews. We adhered to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) checklist for reporting the results. An information specialist performed a comprehensive search from the date of inception until November 2020, in five bibliographic databases: MEDLINE, EMBASE, Web of Science Core Collection, CINAHL, and IEEE Xplore. We included studies aimed at the deployment and testing or implementation of AI interventions using motion data for the detection of dementia among a diverse population, encompassing varying age, sex, gender, economic backgrounds, and ethnicity, extending to their health care providers across multiple health care settings. Studies were excluded if they focused on Parkinson’s or Huntington’s disease. Two independent reviewers screened the abstracts, titles, and then read the full-texts. Disagreements were resolved by consensus, and if this was not possible, the opinion of a third reviewer was sought. The reference lists of included studies were also screened. Results: After removing duplicates, 2,632 articles were obtained. After title and abstract screening and full-text screening, 839 articles were considered for categorization. The authors categorized the papers into six categories, and data extraction and synthesis was performed on 20 included papers from the motion tracking data category. The included studies assessed cognitive performance (n = 5, 25%); screened dementia and cognitive decline (n = 8, 40%); investigated visual behaviours (n = 4, 20%); and analyzed motor behaviors (n = 3, 15%). Conclusions: We presented evidence of AI systems being employed in the detection of dementia, showcasing the promising potential of motion tracking within this domain. Although some progress has been made in this field recently, there remain notable research gaps that require further exploration and investigation. Future endeavors need to compare AI interventions using motion data with traditional screening methods or other tech-enabled dementia detection mechanisms. Besides, future works should aim at understanding how gender and sex, and ethnic and cultural sensitivity can contribute to refining AI interventions, ensuring they are accessible, equitable, and beneficial across all society.
Deep Multirepresentation Learning for Data Clustering
Deep clustering incorporates embedding into clustering in order to find a lower-dimensional space suitable for clustering task. Conventional… (voir plus) deep clustering methods aim to obtain a single global embedding subspace (aka latent space) for all the data clusters. In contrast, in this paper, we propose a deep multi-representation learning (DML) framework for data clustering whereby each difficult to cluster data group is associated with its own distinct optimized latent space, and all the easy to cluster data groups are associated to a general common latent space. Autoencoders are employed for generating the cluster-specific and general latent spaces. To specialize each autoencoder in its associated data cluster(s), we propose a novel and effective loss function which consists of weighted reconstruction and clustering losses of the data points, where higher weights are assigned to the samples more probable to belong to the corresponding cluster(s). Experimental results on benchmark datasets demonstrate that the proposed DML framework and loss function outperform state-of-the-art clustering approaches. In addition, the results show that the DML method significantly outperforms the SOTA on imbalanced datasets as a result of assigning an individual latent space to the difficult clusters. <br>
Spatial Hard Attention Modeling via Deep Reinforcement Learning for Skeleton-Based Human Activity Recognition
Bahareh Nikpour
Deep learning-based algorithms have been very successful in skeleton-based human activity recognition. Skeleton data contains 2-D or 3-D coo… (voir plus)rdinates of human body joints. The main focus of most of the existing skeleton-based activity recognition methods is on designing new deep architectures to learn discriminative features, where all body joints are considered equally important in recognition. However, the importance of joints varies as an activity proceeds within a video and across different activities. In this work, we hypothesize that selecting relevant joints, prior to recognition, can enhance performance of the existing deep learning-based recognition models. We propose a spatial hard attention finding method that aims to remove the uninformative and/or misleading joints at each frame. We formulate the joint selection problem as a Markov decision process and employ deep reinforcement learning to train the proposed spatial-attention-aware agent. No extra labels are needed for the agent’s training. The agent takes a sequence of features extracted from skeleton video as input and outputs a sequence of probabilities for joints. The proposed method can be considered as a general framework that can be integrated with the existing skeleton-based activity recognition methods for performance improvement purposes. We obtain very competitive activity recognition results on three commonly used human activity recognition datasets.