Research projects

EU projects

EHRI “European Holocaust Research Infrastructure”

EHRI “European Holocaust Research Infrastructure”

Digitalisation of the archival research on the Holocaust.

DARIAH

DARIAH

Digital Research Infrastructure for the Arts and Humanities.

CounteR

CounteR

Privacy-first situational awareness platform for violent terrorism and crime prediction, counter-radicalisation and citizen protection.

ANR projects

ParSiTi

ParSiTi

Context-aware parsing and machine translation of user-generated content.

BASNUM

BASNUM

Computational exploitation of Henri Basnage de Beauval’s (1701) encyclopaedic dictionary.

PARSE-ME

PARSE-ME

Multi-word expressions in parsing.

TIME-US

TIME-US

Digital study of remuneration and time budget textile trades in XVIIIth and XIXth century France.

Other national projects

PRAIRIE

PRAIRIE

An institute for interdisciplinary research and education in Artificial Intelligence, foudned by 5 academic an 16 industrial members.

Cap'FALC

Cap'FALC

Development of a text simplification algorithm and an accessible tool to ease the production of FALC (the French equivalent of “Easy read”).

LECTAUREP

LECTAUREP

Development of a platform for the transcription, reading and automatic analysis of notarial deeds present in the National Archives.

DAHN

DAHN

Digitalisation and computational exploitation of archives of historical interest.

NER4archives

NER4archives

Named entity recognition for finding aids in XML-EAD, a standard for encoding descriptive information regarding archival records.

EFL

EFL

Empirical foundations of linguistics, including computational linguistics and natural language processing.

LiLT

LiLT

Linguistic issues in language technology.

Huma-Num

Huma-Num

Very large research infrastructure (TGIR) aimed at facilitating the digitalisation of humanities and social sciences.

Matériaux Anciens et Patrimoniaux

Matériaux Anciens et Patrimoniaux

The Domaine d’intérêt majeur (DIM) Heritage and ancient materials is the Île-de-France research network dedicated to the study of ancient and patrimonial materials.

Nénufar

Nénufar

Digitalisation and exploitation of the early editions of the Petit Larousse dictionary.

International projects

Universal Dependencies Project

Universal Dependencies Project

The Universal Dependencies project is an open community effort with over 300 contributors producing nearly 200 treebanks in over 100 languages.

BigScience

BigScience

Informal international research collaboration on very large language models.

Past projects

EU projects
  • enCollect (COST, 2017-2020): Combining language learning and crowdsourcing for developing language teaching materials and more generic language resources for NLP.
  • DESIR (H2020, 2017-2019): The DESIR project aims at contributing to the sustainability of the DARIAH infrastructure along all its dimensions: dissemination, growth, technology, robustness, trust and education. Inria is responsable for providing of a portfolio of text analytics services based on GROBID and entity-fishing.
  • HIRMEOS (H2020, 2017-2019): Integration of Research Monographs in the European Open Science infrastructure.
  • Parthenos (H2020, 2015-2019): Strengthening the cohesion of research in the broad sector of Linguistic Studies, Humanities, Cultural Heritage, History, Archaeology and related fields through a thematic cluster of European Research Infrastructures, integrating initiatives, e-infrastructures and other world-class infrastructures, and building bridges between different, although tightly interrelated, fields.
  • Iperion CH (H2020, 2015-2019): Coordinating infrastructural activities in the cultural heritage domain.
ANR projects
  • Profiterole (ANR, 2017-2021): Modelling and analysis of Medieval French.
  • SoSweet (ANR, 2015-2020): Studying sociolinguistic variability on Twitter, comparing linguistic and graph-based views on tweets.
  • VerDI (ANR RAPID, 2015-2018): Automatic identification of information concealment on the internet.
Other national projects
  • OPALINe (PIA, 2017-2020): Development of tools for the accessibility of digital books for visually impaired people.
International projects
  • NLP Resources for Analyzing Reactions to Major Events in Hebrew and French Social Media (PHC Maïmonide, 2018-2019): Building NLP resources for analyzing reactions to major events in Hebrew and French social media.
  • MCM-NL (ANR-NSF, 2016-2020): Exploring correlations between data from neuro-imagery (fMRI, EEG) and data from NLP tools (mostly parsers). The data comes from “Le Petit Prince” read in French and English, and parsed with different parsers.