Davide Baldelli

PhD - Polytechnique Montréal

Supervisor

Sarath Chandar

Co-supervisor

Amal Zouaq

Research Topics

Deep Learning

Natural Language Processing

Website

Google Scholar

GitHub

Blog Posts

December 19, 2025

Improving CAD Design With LLMs

Prashant Govindarajan

Davide Baldelli

Quentin Fournier

Sarath Chandar

Read the article

Publications

Probabilistic Calibration Is a Trainable Capability in Language Models

Sruthi Kuriakose

Language models are increasingly used in settings where outputs must satisfy user-specified randomness constraints, yet their generation pro… (see more)babilities are often poorly calibrated to those targets. We study whether this capability can be improved directly through fine-tuning. Concretely, we fine-tune language models on synthetic prompts that require sampling from mathematical distributions, and compare two Calibration Fine-Tuning variants: a soft-target method that converts the desired output distribution into trie-derived next-token targets, and a hard-target method that trains on sampled completions from the same target distribution. Across 12 models spanning four families, both methods substantially improve structured-sampling fidelity on held-out distribution families and unseen parameter settings, showing that probabilistic calibration is a trainable capability. Under our selected training configurations, the two methods exhibit different empirical profiles: hard-target fine-tuning is often strongest on structured numeric sampling, while soft-target fine-tuning performs better on broader stochastic generation benchmarks, including open-ended random generation, multiple-choice answer-position balancing, and NoveltyBench. The gains sometimes reduce downstream capability, especially arithmetic reasoning, with costs varying by model. Overall, our results show that probabilistic calibration can be improved through fine-tuning, with our hard-target configuration favoring exact numeric fidelity and our soft-target configuration favoring broader stochastic transfer. Code is available at https://github.com/chandar-lab/calibration-finetuning.

2026-05-11

arXiv (preprint)

doi.org

arxiv.org

LLMs Can't Play Hangman: On the Necessity of a Private Working Memory for Language Agents

Davide Baldelli

Ali Parviz

Amal Zouaq

A. Chandar

2026-01-10

ArXiv (preprint)

doi.org

arxiv.org

CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design

Prashant Govindarajan

Davide Baldelli

Jay Pathak

Quentin Fournier

A. Chandar

Computer-aided design (CAD) is the digital construction of 2D and 3D objects, and is central to a wide range of engineering and manufacturin… (see more)g applications like automobile and aviation. Despite its importance, CAD modeling remains largely a time-intensive, manual task. Recent works have attempted to automate this process with small transformer-based models and handcrafted CAD sequence representations. However, there has been little effort to leverage the potential of large language models (LLMs) for sequential CAD design. In this work, we introduce a new large-scale dataset of more than 170k CAD models annotated with high-quality, human-like descriptions generated with our pipeline based on GPT-4.1. Using this dataset, we fine-tune powerful code-LLMs to generate CAD sequences represented in a JSON-based format from natural language descriptions, demonstrating the viability and effectiveness of this approach for text-conditioned CAD generation. Because simple metrics often fail to reflect the quality of generated objects, we introduce geometric and topological metrics based on sphericity, mean curvature, and Euler characteristic to provide richer structural insights. Our experiments and ablation studies on both synthetic and human-annotated data demonstrate that CADmium is able to automate CAD design, drastically speeding up the design of new objects. The dataset, code, and fine-tuned models are available online.

2025-07-12

ArXiv (preprint)

doi.org

arxiv.org

Mila Ventures Launchpad

AI Policy Compass

AI Policy Fellowship Publications

Davide Baldelli

Blog Posts

Publications

Mila Ventures Launchpad

AI Policy Compass

AI Policy Fellowship Publications

Popular keywords:

Davide Baldelli

Blog Posts

Publications