ALMAnaCH, Inria

20/06/25 at 11:00 - Big Blue Button

20 Jun 2025 at 11:00
Big Blue Button

Speaker: Hal Daumé III, University of Maryland, USA

Fairness and Trustworthiness in Generative AI

Abstract: How can we define, measure, and mitigate societal harms in generative AI technologies? I'll discuss a range of research over the past five years that aim to understand how to think about fairness and AI-related harms beyond decision making systems, and provide several concrete examples of how to measure (and sometimes how to mitigate) potential harms that arise in generative AI systems, and how this relates to a person's trust in those systems. These are far from solved problems, so I'll conclude by pointing to some directions I think are crucial for us to solve as a community.

Bio: Hal Daumé III is the Director of AIM, the AI Interdisciplinary Institute at Maryland. He is a Volpi-Cupal endowed Professor of Computer Science and Language Science at the University of Maryland, where he also leads TRAILS, an NSF & NIST-funded institute on Trustworthy AI. His research focus is on developing natural language processing systems that interact naturally with people, promote their self-efficacy, while mitigating societal harms. Together with his students and colleagues, he has received five best paper awards, a best demo award, and a test of time award. He has been program chair for the International Conference on Machine Learning in 2020 (together with Aarti Singh) and for the North American Association for Computational Linguistics in 2013 (together with Katrin Kirchhoff), and he was an inaugural diversity and inclusion co-chair at the Neural Information Processing Systems Conference in 2018 (with Katherine Heller).

Download slides here:

21/03/25 at 11:00 - Big Blue Button

21 Mar 2025 at 11:00
Big Blue Button

Speaker: Aina Garí Soler, Inria

Word Meaning Representation and Negotiation

Abstract: Word meaning is one of the most elusive notions in Linguistics, characterized by its abstract, complex, subjective, and ever-evolving nature. To what extent can distributional-based computational methods capture this complexity? And do we all truly mean the same things when we use the same words?
In this talk, I will present my research on these and related questions, focusing on two key directions: the computational representation of word meaning and the dynamics of word understanding in interaction. I will discuss how subword-based tokenization impacts word representations and how speakers navigate and resolve word-related misunderstandings in conversation.

Download slides here:

7/03/25 at 11:00 - Big Blue Button

7 Mar 2025 at 11:00
Big Blue Button

Speaker: Florian Cafiero, PSL

A Riddle in a Haystack: Using Large Language Models for the Detection of Rare Phenomena

Abstract: Recent advances in natural language processing allow us to harness the reasoning power of large language models not only to massively annotate texts but also to detect rare and complex phenomena within large corpora. I will present two case studies illustrating this approach. In the first case, I seek to identify obstruction strategies used by certain countries during international climate negotiations (COP, etc.), analyzing the meeting reports. In the second case, I extend stylometric research on the Willy workshop (Colette, Curnonsky, etc.), an author who almost never wrote alone, to attempt to identify passages that are uniquely his own. I do so by detecting the cryptic puns and whimsical proper names that he was so fond of.

21/02/25 at 11:00 - Big Blue Button

21 Feb 2025 at 11:00
Big Blue Button

Speaker: Syrielle Montariol, EPFL, Switzerland

Multimodal perception and reasoning

Abstract: Building on the strong textual processing capabilities of large language models, large vision-language models (VLMs) extend LLMs to handle visual inputs. They have brought significant improvements to multi-modal tasks such as visual question answering and image captioning. In particular, they paved the way for tasks involving complex visual reasoning. However, the transfer of LLM's internal knowledge and their reasoning ability to multimodal tasks remains limited. In this talk, I will present two of my recent work on evaluating and improving VLMs' perception and reasoning capabilities.

Download slides here:

7/02/25 at 11:00 - Big Blue Button

7 Feb 2025 at 11:00
Big Blue Button

Speaker: Cécile Pierrot¹ & Camille Desenclos²˒¹, ¹Inria & ²Université de Picardie

Percer le secret des lettres chiffrées de Charles Quint: un travail interdisciplinaire

N.B. This seminar will be held in French 🇫🇷.
Abstract: Hiver 2020. Une dépêche presque intégralement chiffrée est redécouverte dans les 16 kilomètres de rayonnages de la bibliothèque du patrimoine de Nancy. Elle semble datée de 1546 et signée de la main de l'empereur Charles Quint. Mais comment percer le mystère d'une lettre chiffrée multiséculaire ? Venez suivre l'histoire de ce décryptage qui ouvre la voie d'une recherche à la frontière entre cryptographie, histoire, et intelligence artificielle.

20/12/24 at 11:00 - Big Blue Button

20 Dec 2024 at 11:00
Big Blue Button

Speaker: Caio Corro, INSA Rennes

Named-Entity Recognition: Resurrecting Old School Machine Learning in the Era of Deep Learning

N.B. This seminar will be held in French 🇫🇷.
Abstract: In this talk, I will show that we can bridge old-school methods (finite-state automaton and k-means) with neural networks to achieve SOTA results.

First, I will present my EMNLP 2024 paper [1] on discontinuous named-entity recognition, an overlooked setting in the literature. SOTA methods are based on complex pipelines with intricate neural architectures. I will show that using finite-state automaton, we can build a word tagging method that achieves competitive experimental results while being 40x-50x faster than SOTA. Unlike previous attempts to use work tagging in this setting, the proposed approach guarantees well-formedness of predictions.

Second, I will present our COLING 2025 paper [2] on few-shot learning for named-entity recognition. Many approaches in this setting are based on variants of nearest neighbor classification. Unfortunately, they cannot leverage unlabeled data. We propose a novel approach for semi-supervised few-shot learning based on joint k-means and subspace selection. For named-entity recognition, a difficulty arises from the fact that most words are tagged with O (outside a mention): when we include a large amount of unlabeled data, the model can easily collapse to assigning tag O for all words. To prevent this issue, we include a ratio-constraint in the fine-tuning step.

[1] A fast and sound tagging method for discontinuous named-entity recognition (Caio Corro) https://arxiv.org/abs/2409.16243
[2] Few-shot domain adaptation for named-entity recognition via joint constrained k-means and subspace selection (Ayoub Hammal, Benno Uthayasooriyar, Caio Corro) https://arxiv.org/abs/2412.00426

15/11/24 at 11:00 - Big Blue Button

15 Nov 2024 at 11:00
Big Blue Button

Speaker: Raphaël Baena, Imagine Group, École des Ponts ParisTech

A General Framework for Text Line Detection and Recognition

Abstract: In this seminar, I will present a quick overview of my Ph.D research on transfer learning and generalization, followed by a detailed discussion of our recent NeurIPS paper on General Detection-based Text Line Recognition (DTLR). DTLR is a novel approach for recognizing text lines, whether printed or handwritten, across diverse scripts, including Latin, Chinese, and ciphered characters.
Most HTR methods have focused on autoregressive decoding, predicting characters one after the other. Our method shows strong results across various scripts, even those typically addressed by specialized techniques. In particular, we achieve state-of-the-art performance for Chinese script recognition on the CASIA v2 dataset and for cipher recognition on the Borg and Copiale datasets. Finally, I will highlight several collaborative applications and extensions of this work with historians.

11/10/24 at 11:00 - Big Blue Button

11 Oct 2024 at 11:00
Big Blue Button

Speaker: Marine Carpuat, University of Maryland, USA

Beyond Translation: Human-Centered NLP for Cross-Lingual Communication

Abstract: How can we develop NLP technology to effectively support cross-lingual communication, especially given recent progress in machine translation and multilingual language models? In this talk, I will present two main threads of work that aim to broaden the scope of machine translation to more directly support people's needs.
In the first thread, I'll consider the difficulty people face when weighing the potential benefits of machine translation against the risks it may pose. This difficulty arises because users—who typically do not speak either the input or output language—often cannot assess translation quality. I will present results from a human study in medical settings, which highlights the strengths and weaknesses of state-of-the-art quality estimation techniques.
Next, I'll discuss how even accurate translations can fail when users lack background knowledge that is implied in the source language. I will introduce techniques for automatically generating explicitations that explain missing context by considering cultural differences between source and target audiences.
Throughout, I will discuss ongoing research directions aimed at developing human-centered NLP approaches for cross-lingual communication.

Download slides here:

4/07/24 at 11:00 - Big Blue Button

4 Jul 2024 at 11:00
Big Blue Button

Speaker: Paolo Rosso, Universitat Politècnica de València, ValgrAI

Beyond the detection of fake news and explicit hate speech: conspiracy theories and implicit hate speech with stereotypes, jokes and sarcasm

Abstract: The rise of social media has offered a fast and easy way for the propagation of disinformation and conspiracy theories. Despite the research attention that has received, disinformation detection remains an open problem and users keep sharing texts that contain false statements. In this talk I will comment on some studies on the detection of conspiracy theories. In the framework of the PAN Lab, recently we organised a challenge to discriminate between conspiracy narratives and critical thinking. Finally, I will address the other side of harmful information: hate speech. I will present the work done to analyse misogyny and sexism, also in memes, and the work done in collaboration with the Spanish observatory against racism and xenophobia. Moreover, I will briefly present a study of the usage of stereotypes against immigrants by the members of the Spanish Congress of Deputies. Hate speech is often conveyed covertly employing stereotypes and figurative language devices such as irony or sarcasm. I will finally show how hurtful humour is often employed to spread prejudice in social media towards women and feminists, the LGBTIQ community, immigrants and racially discriminated people, and overweight people.

21/06/24 at 11:00 - Big Blue Button

21 Jun 2024 at 11:00
Big Blue Button

Speaker: Nicolas Rollet, Télécom Paris

Interaction Humain-Machine, IA parlante et ethnométhodologie : le diable est dans les détails

N.B. This seminar will be held in French 🇫🇷.
Abstract: Depuis 2015, Nicolas Rollet s’emploie à étudier les interactions humain-machine, que celles-ci soient équipées d’une IA ou non. L’approche ethnométhodologique, et son pendant interactionnel, l’Analyse de Conversation, offrent des outils d’analyse permettant d’interroger de manière détaillée et comme des accomplissements pratiques :

- qu’est-ce qui est « social » dans une interaction sociale
- en quoi les humains s’orientent-ils vers un agent artificiel en tant que partenaire social
- comment le corps, la vision constituent des ressources pour organiser des activités sociales
- qu’est-ce qui « manque » aux IA parlantes pour parler naturellement.

Pour discuter ces points, plusieurs terrains de recherche seront mobilisés : interactions humain-robot, interactions vidéo à distance dans un service d’urgence, interactions dans un cabinet d’échographie prénatale.

Bio: Nicolas Rollet, diplômé d’un doctorat en sciences du langage (ILPGA, Sorbonne Nouvelle Paris 3, 2012), s’est spécialisé dans les études d’interactions dans des contextes ordinaires et professionnels tels que les réunions familiales, les répétitions musicales, l’utilisation de bibliothèque numérique, l’interaction médicale d’urgence, l’interaction homme-robot ou encore les séances d’échographie prénatales (Télécom Paris, SAMU-Centre15, CNRS, BNF, CNR114). Ses travaux s’inscrivent dans le cadre de l’Ethnométhodologie, analyse de la conversation accompagnée d’une sensibilité ethnographique. Il s’intéresse, entre autres, à la manière dont le langage s’intègre au corps et à l’intégration de dispositifs techniques dans des activités sociales complexes. Il est également membre du collectif Encyclopédie de la parole, depuis sa création en 2007. A ce titre il est associé à la production de nombreuses oeuvres : performances, spectacles, installations sonores, conférences (Kunsten Festival, Festival d’Automne, Palais de Tokyo, Théâtre de Montreuil, MAMCO Genève, Festival des Arts de la parole Bordeaux, KAAT Yokohama..). Il est également auteur de plusieurs textes de prose (Leo Scheer, Les Petits Matins, Argol).

31/05/24 at 11:00 - Big Blue Button

31 May 2024 at 11:00
Big Blue Button

Speaker: Marco Bronzini, University of Trento

Unveiling LLMs: The Evolution of Latent Representations in a Temporal Knowledge Graph

Abstract: Large Language Models (LLMs) demonstrate an impressive capacity to recall a vast range of common factual knowledge information. However, unravelling the underlying reasoning of LLMs and explaining their internal mechanisms of exploiting this factual knowledge remain active areas of investigation.
Our work analyzes the factual knowledge encoded in the latent representation of LLMs when prompted to assess the truthfulness of factual claims.
We propose an end-to-end framework that jointly decodes the factual knowledge embedded in the latent space of LLMs from a vector space to a set of ground predicates and represents its evolution across the layers using a temporal knowledge graph. Our framework relies on the technique of activation patching which intervenes in the inference computation of a model by dynamically altering its latent representations.
Consequently, we neither rely on external models nor training processes.
We showcase our framework with local and global interpretability analyses using two claim verification datasets: FEVER and CLIMATE-FEVER. The local interpretability analysis exposes different latent errors from representation to multi-hop reasoning errors. On the other hand, the global analysis uncovered patterns in the underlying evolution of the model's factual knowledge (e.g., store-and-seek factual information).
By enabling graph-based analyses of the latent representations, this work represents a step towards the mechanistic interpretability of LLMs. https://arxiv.org/abs/2404.03623

Download slides here:

5/04/24 at 11:00 - Big Blue Button

5 Apr 2024 at 11:00
Big Blue Button

Speaker: Philippe Blache, CNRS

Etudier le signal cérébral associé au traitement du langage en conversation. Limites, état de nos connaissances, perspectives

N.B. This seminar will be held in French 🇫🇷.
Abstract: Comprendre comment fonctionne le langage nécessite de l'étudier dans sa globalité. Lorsqu'on pose cette question dans une perspective cognitive, il est de plus nécessaire de l'étudier dans son contexte naturel, typiquement celui de la conversation. Quels sont les mécanismes permettant à deux individus d'encoder, transmettre et décoder l'information pendant ce type d'interaction ? Je propose dans cette présentation d'aborder plus particulièrement l'étude des bases cérébrales de l'interaction : le signal cérébral que nous observons nous apprend-il quelque chose sur le traitement du langage ? Les méthodes reposant sur l'électro-encéphalographie (technique d'imagerie la plus facile à mettre en œuvre) consistent essentiellement soit à étudier les potentiels évoqués par phénomène localisé temporellement, soit dynamiquement à utiliser des fonctions de corrélation entre les signaux linguistiques et cérébraux. Il est par exemple possible d'observer une potentiel négatif de grande amplitude en cas d'incongruité sémantique, une baisse de la bande de fréquence 8-12Hz du cerveau associée à la préparation de la réponse à une question ou encore une corrélation entre l'enveloppe acoustique de la parole et la dynamique oscillatoire du cerveau. Ces observations sont cependant très focalisées, et la question qui est désormais posée est celle de la possibilité de rechercher de telles corrélations pendant une conversation naturelle. Le problème ici est double : 1/ limite des modèles de prédiction linguistique adaptés à la langue parlée et 2/ limites de méthodes de traitement du signal cérébral en condition naturelle, extrêmement bruité. Pendant cette présentation, je décrirai plus précisément ces problèmes, l'état de nos connaissances y compris méthodologiques pour traiter ce type de signal et quelques pistes de recherche pour avancer dans cette direction.

15/03/24 at 11:00 - Big Blue Button

15 Mar 2024 at 11:00
Big Blue Button

Speaker: Alexandra Birch, University of Edinburgh

Translation and Large Language Models

Abstract: What is the future of translation research in the era of large language models? Brown et al. in 2020 showed that prompting GPT3 with a few examples of translation could result in translations which were higher quality than SOTA supervised models at the time (into English and only for French, German). Until this point, research on machine translation had been central to the field of natural language processing, often attracting the most submissions in annual NLP conferences and leading to many breakthroughs in the field. Since then, there has been enormous interest in models which can perform a wide variety of tasks and interest in translation as a separate sub-field has somewhat diminished. However, translation remains a compelling and widely used technology. So what is the promise of LLMs for translation and how should we best use them? What opportunities do LLMs unlock and what challenges remain? How can the field of translation still contribute to NLP? I will touch on some of my own research but I focus on these broader questions.

Download slides here:

9/02/24 at 11:00 - Big Blue Button

9 Feb 2024 at 11:00
Big Blue Button