Publications

Herbarium collections remain essential in the age of community science

Isaac Eckert

Anne Bruneau

D. Metsger

Simon Joly

T. Dickinson

Laura J. Pollock

2024-08-31

Nature Communications (publié)

Progres: Prompted Generative Rescoring on ASR N-Best

Ada Defne Tur

Adel Moumen

Mirco Ravanelli

Large Language Models (LLMs) have shown their ability to improve the performance of speech recognizers by effectively rescoring the n-best h… (voir plus)ypotheses generated during the beam search process. However, the best way to exploit recent generative instruction-tuned LLMs for hypothesis rescoring is still unclear. This paper proposes a novel method that uses instruction-tuned LLMs to dynamically expand the n-best speech recognition hypotheses with new hypotheses generated through appropriately-prompted LLMs. Specifically, we introduce a new zero-shot method for ASR n-best rescoring, which combines confidence scores, LLM sequence scoring, and prompt-based hypothesis generation. We compare Llama-3-Instruct, GPT-3.5 Turbo, and GPT-4 Turbo as prompt-based generators with Llama-3 as sequence scorer LLM. We evaluated our approach using different speech recognizers and observed significant relative improvement in the word error rate (WER) ranging from 5% to 25%.

2024-08-30

ArXiv (prépublication)

Progres: Prompted Generative Rescoring on ASR N-Best

Ada Defne Tur

Adel Moumen

Mirco Ravanelli

Large Language Models (LLMs) have shown their ability to improve the performance of speech recognizers by effectively rescoring the n-best h… (voir plus)ypotheses generated during the beam search process. However, the best way to exploit recent generative instruction-tuned LLMs for hypothesis rescoring is still unclear. This paper proposes a novel method that uses instruction-tuned LLMs to dynamically expand the n-best speech recognition hypotheses with new hypotheses generated through appropriately-prompted LLMs. Specifically, we introduce a new zero-shot method for ASR n-best rescoring, which combines confidence scores, LLM sequence scoring, and prompt-based hypothesis generation. We compare Llama-3-Instruct, GPT-3.5 Turbo, and GPT-4 Turbo as prompt-based generators with Llama-3 as sequence scorer LLM. We evaluated our approach using different speech recognizers and observed significant relative improvement in the word error rate (WER) ranging from 5% to 25%.

2024-08-30

ArXiv (prépublication)

Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration

Rongge Zhang

Haechan Mark Bong

Giovanni Beltrame

Exploration in unknown and unstructured environments is a pivotal requirement for robotic applications. A robot’s exploration behavior can… (voir plus) be inherently affected by the performance of its Simultaneous Localization and Mapping (SLAM) subsystem, although SLAM and exploration are generally studied separately. In this paper, we formulate exploration as an active mapping problem and extend it with semantic information. We introduce a novel active metric-semantic SLAM approach, leveraging recent research advances in information theory and spectral graph theory: we combine semantic mutual information and the connectivity metrics of the underlying pose graph of the SLAM subsystem. We use the resulting utility function to evaluate different trajectories to select the most favorable strategy during exploration. Exploration and SLAM metrics are analyzed in experiments. Running our algorithm on the Habitat dataset, we show that, while maintaining efficiency close to the state-of-the-art exploration methods, our approach effectively increases the performance of metric-semantic SLAM with a 21% reduction in average map error and a 9% improvement in average semantic classification accuracy.

2024-08-27

ArXiv (prépublication)

ARGV: 3D genome structure exploration using augmented reality

Chrisostomos Drogaris

Yanlin Zhang

Éric Zhang

Elena Nazarova

Roman Sarrazin-Gendron

Sélik Wilhelm-Landry

Yan Cyr

Jacek Majewski

Mathieu Blanchette

Jérôme Waldispühl

2024-08-27

BMC Bioinformatics (publié)

A long-context RNA foundation model for predicting transcriptome architecture

Ali Saberi

Benedict Choi

Sean Wang

Aldo Hernández-Corchado

Mohsen Naghipourfar

Arsham Mikaeili Namini

Vijay Ramani

Amin Emad

Hamed S. Najafabadi

Hani Goodarzi

Linking DNA sequence to genomic function remains one of the grand challenges in genetics and genomics. Here, we combine large-scale single-m… (voir plus)olecule transcriptome sequencing of diverse cancer cell lines with cutting-edge machine learning to build LoRNASH, an RNA foundation model that learns how the nucleotide sequence of unspliced pre-mRNA dictates transcriptome architecture—the relative abundances and molecular structures of mRNA isoforms. Owing to its use of the StripedHyena architecture, LoRNASH handles extremely long sequence inputs (∼65 kilobase pairs), allowing for quantitative, zero-shot prediction of all aspects of transcriptome architecture, including isoform abundance, isoform structure, and the impact of DNA sequence variants on transcript structure and abundance. We anticipate that our public data release and proof-of-concept model will accelerate varying aspects of RNA biotechnology. More broadly, we envision the use of LoRNASH as a foundation for fine-tuning of any transcriptome-related downstream prediction task, including cell-type specific gene expression, splicing, and general RNA processing.

2024-08-27

bioRxiv (prépublication)

MeshUp: Multi-Target Mesh Deformation via Blended Score Distillation

Hyunwoo Kim

Itai Lang

Noam Aigerman

Thibault Groueix

Vladimir Kim

Rana Hanocka

We propose MeshUp, a technique that deforms a 3D mesh towards multiple target concepts, and intuitively controls the region where each conce… (voir plus)pt is expressed. Conveniently, the concepts can be defined as either text queries, e.g.,"a dog"and"a turtle,"or inspirational images, and the local regions can be selected as any number of vertices on the mesh. We can effectively control the influence of the concepts and mix them together using a novel score distillation approach, referred to as the Blended Score Distillation (BSD). BSD operates on each attention layer of the denoising U-Net of a diffusion model as it extracts and injects the per-objective activations into a unified denoising pipeline from which the deformation gradients are calculated. To localize the expression of these activations, we create a probabilistic Region of Interest (ROI) map on the surface of the mesh, and turn it into 3D-consistent masks that we use to control the expression of these activations. We demonstrate the effectiveness of BSD empirically and show that it can deform various meshes towards multiple objectives. Our project page is at https://threedle.github.io/MeshUp.

2024-08-27

ArXiv (prépublication)

Pushing the frontiers in climate modelling and analysis with machine learning

Veronika Eyring

William D. Collins

Pierre Gentine

Elizabeth A. Barnes

Marcelo Barreiro

Tom Beucler

Marc Bocquet

Christopher S. Bretherton

Hannah M. Christensen

Katherine Dagon

David John Gagne

David Hall

Dorit Hammerling

Stephan Hoyer

Fernando Iglesias-Suarez

Ignacio Lopez-Gomez

Marie C. McGraw

Gerald A. Meehl

Maria J. Molina

Claire Monteleoni … (voir 9 de plus)

Juliane Mueller

Michael S. Pritchard

Jakob Runge

Philip Stier

Oliver Watt-Meyer

Katja Weigel

Rose Yu

Laure Zanna

2024-08-23

Nature Climate Change (publié)

Pushing the frontiers in climate modelling and analysis with machine learning

Veronika Eyring

William D. Collins

Pierre Gentine

Elizabeth A. Barnes

Marcelo Barreiro

Tom Beucler

Marc Bocquet

Christopher S. Bretherton

Hannah M. Christensen

Katherine Dagon

David John Gagne

David Hall

Dorit Hammerling

Stephan Hoyer

Fernando Iglesias-Suarez

Ignacio Lopez-Gomez

Marie C. McGraw

Gerald A. Meehl

Maria J. Molina

Claire Monteleoni … (voir 9 de plus)

Juliane Mueller

Michael S. Pritchard

Jakob Runge

Philip Stier

Oliver Watt-Meyer

Katja Weigel

Rose Yu

Laure Zanna

2024-08-23

Nature Climate Change (publié)

Pushing the frontiers in climate modelling and analysis with machine learning

Veronika Eyring

William D. Collins

Pierre Gentine

Elizabeth A. Barnes

Marcelo Barreiro

Tom Beucler

Marc Bocquet

Christopher S. Bretherton

Hannah M. Christensen

Katherine Dagon

David John Gagne

David Hall

Dorit Hammerling

Stephan Hoyer

Fernando Iglesias-Suarez

Ignacio Lopez-Gomez

Marie C. McGraw

Gerald A. Meehl

Maria J. Molina

Claire Monteleoni … (voir 9 de plus)

Juliane Mueller

Michael S. Pritchard

Jakob Runge

Philip Stier

Oliver Watt-Meyer

Katja Weigel

Rose Yu

Laure Zanna

2024-08-23

Nature Climate Change (publié)

Pushing the frontiers in climate modelling and analysis with machine learning

Veronika Eyring

William D. Collins

Pierre Gentine

Elizabeth A. Barnes

Marcelo Barreiro

Tom Beucler

Marc Bocquet

Christopher S. Bretherton

Hannah M. Christensen

Katherine Dagon

David John Gagne

David Hall

Dorit Hammerling

Stephan Hoyer

Fernando Iglesias-Suarez

Ignacio Lopez-Gomez

Marie C. McGraw

Gerald A. Meehl

Maria J. Molina

Claire Monteleoni … (voir 9 de plus)

Juliane Mueller

Michael S. Pritchard

Jakob Runge

Philip Stier

Oliver Watt-Meyer

Katja Weigel

Rose Yu

Laure Zanna

2024-08-23

Nature Climate Change (publié)