Vladimir Makarenkov

ClustRecNet: A Novel End-to-End Deep Learning Framework for Clustering Algorithm Recommendation

Mohammadreza Bakhtyari

Bogdan Mazoure

Renato Cordeiro De Amorim

Guillaume Rabusseau

2025-09-29

ArXiv (preprint)

Towards an Interpretable Machine Learning Model for Predicting Antimicrobial Resistance

Mohamed Mediouni

Abdoulaye Banire Diallo

2025-08-01

Journal of Global Antimicrobial Resistance (published)

Quantifying antimicrobial resistance in food-producing animals in North America

Mohamed Mediouni

Abdoulaye Banire Diallo

The global misuse of antimicrobial medication has further exacerbated the problem of antimicrobial resistance (AMR), enriching the pool of g… (see more)enetic mechanisms previously adopted by bacteria to evade antimicrobial drugs. AMR can be either intrinsic or acquired. It can be acquired either by selective genetic modification or by horizontal gene transfer that allows microorganisms to incorporate novel genes from other organisms or environments into their genomes. To avoid an eventual antimicrobial mistreatment, the use of antimicrobials in farm animal has been recently reconsidered in many countries. We present a systematic review of the literature discussing the cases of AMR and the related restrictions applied in North American countries (including Canada, Mexico, and the USA). The Google Scholar, PubMed, Embase, Web of Science, and Cochrane databases were searched to find plausible information on antimicrobial use and resistance in food-producing animals, covering the time period from 2015 to 2024. A total of 580 articles addressing the issue of antibiotic resistance in food-producing animals in North America met our inclusion criteria. Different AMR rates, depending on the bacterium being observed, the antibiotic class being used, and the farm animal being considered, have been identified. We determined that the highest average AMR rates have been observed for pigs (60.63% on average), the medium for cattle (48.94% on average), and the lowest for poultry (28.43% on average). We also found that Cephalosporines, Penicillins, and Tetracyclines are the antibiotic classes with the highest average AMR rates (65.86%, 61.32%, and 58.82%, respectively), whereas the use of Sulfonamides and Quinolones leads to the lowest average AMR (21.59% and 28.07%, respectively). Moreover, our analysis of antibiotic-resistant bacteria shows that Streptococcus suis (S. suis) and S. auerus provide the highest average AMR rates (71.81% and 69.48%, respectively), whereas Campylobacter spp. provides the lowest one (29.75%). The highest average AMR percentage, 57.46%, was observed in Mexico, followed by Canada at 45.22%, and the USA at 42.25%, which is most probably due to the presence of various AMR control strategies, such as stewardship programs and AMR surveillance bodies, existing in Canada and the USA. Our review highlights the need for better strategies and regulations to control the spread of AMR in North America.

2025-05-27

Frontiers in Microbiology (published)

Improving clustering quality evaluation in noisy Gaussian mixtures

Renato Cordeiro De Amorim

2025-03-01

ArXiv (preprint)

Improving clustering quality evaluation in noisy Gaussian mixtures

Renato Cordeiro De Amorim

2025-03-01

arXiv (published)

Improving internal cluster quality evaluation in noisy Gaussian mixtures

Renato Cordeiro De Amorim

Clustering is a fundamental technique in machine learning and data analysis, widely used across various domains. Internal clustering validat… (see more)ion measures, such as the Average Silhouette Width, Calinski-Harabasz, and Davies-Bouldin indices, play a crucial role in assessing clustering quality when external ground truth labels are unavailable. However, these measures can be affected by feature relevance, potentially leading to unreliable evaluations in high-dimensional or noisy data sets. In this paper, we introduce a Feature Importance Rescaling (FIR) method designed to enhance internal clustering validation by adjusting feature contributions based on their dispersion. Our method systematically attenuates noise features making clustering compactness and separation clearer, and by consequence aligning internal validation measures more closely with the ground truth. Through extensive experiments on synthetic data sets under different configurations, we demonstrate that FIR consistently improves the correlation between internal validation indices and the ground truth, particularly in settings with noisy or irrelevant features. The results show that FIR increases the robustness of clustering evaluation, reduces variability in performance across different data sets, and remains effective even when clusters exhibit significant overlap. These findings highlight the potential of FIR as a valuable enhancement for internal clustering validation, making it a practical tool for unsupervised learning tasks where labelled data is not available.

2025-03-01

ArXiv (preprint)

Improving internal cluster quality evaluation in noisy Gaussian mixtures

Renato Cordeiro De Amorim

Clustering is a fundamental technique in machine learning and data analysis, widely used across various domains. Internal clustering validat… (see more)ion measures, such as the Average Silhouette Width, Calinski-Harabasz, and Davies-Bouldin indices, play a crucial role in assessing clustering quality when external ground truth labels are unavailable. However, these measures can be affected by feature relevance, potentially leading to unreliable evaluations in high-dimensional or noisy data sets. In this paper, we introduce a Feature Importance Rescaling (FIR) method designed to enhance internal clustering validation by adjusting feature contributions based on their dispersion. Our method systematically attenuates noise features making clustering compactness and separation clearer, and by consequence aligning internal validation measures more closely with the ground truth. Through extensive experiments on synthetic data sets under different configurations, we demonstrate that FIR consistently improves the correlation between internal validation indices and the ground truth, particularly in settings with noisy or irrelevant features. The results show that FIR increases the robustness of clustering evaluation, reduces variability in performance across different data sets, and remains effective even when clusters exhibit significant overlap. These findings highlight the potential of FIR as a valuable enhancement for internal clustering validation, making it a practical tool for unsupervised learning tasks where labelled data is not available.

2025-03-01

ArXiv (preprint)

BayTTA: Uncertainty-aware medical image classification with optimized test-time augmentation using Bayesian model averaging

Zeinab Sherkatghanad

Moloud Abdar

Mohammadreza Bakhtyari

Test-time augmentation (TTA) is a well-known technique employed during the testing phase of computer vision tasks. It involves aggregating m… (see more)ultiple augmented versions of input data. Combining predictions using a simple average formulation is a common and straightforward approach after performing TTA. This paper introduces a novel framework for optimizing TTA, called BayTTA (Bayesian-based TTA), which is based on Bayesian Model Averaging (BMA). First, we generate a model list associated with different variations of the input data created through TTA. Then, we use BMA to combine model predictions weighted by their respective posterior probabilities. Such an approach allows one to take into account model uncertainty, and thus to enhance the predictive performance of the related machine learning or deep learning model. We evaluate the performance of BayTTA on various public data, including three medical image datasets comprising skin cancer, breast cancer, and chest X-ray images and two well-known gene editing datasets, CRISPOR and GUIDE-seq. Our experimental results indicate that BayTTA can be effectively integrated into state-of-the-art deep learning models used in medical image analysis as well as into some popular pre-trained CNN models such as VGG-16, MobileNetV2, DenseNet201, ResNet152V2, and InceptionRes-NetV2, leading to the enhancement in their accuracy and robustness performance.

2024-06-25

ArXiv (preprint)

A self-attention-based CNN-Bi-LSTM model for accurate state-of-charge estimation of lithium-ion batteries

Zeinab Sherkatghanad

Amin Ghazanfari

2024-05-01

Journal of Energy Storage (published)