09h00. Reception of the public in room 70 (Attention!!! BnF is closed to the public on Monday mornings. Special access to room 70 is organized by BnF services via the East Hall entrance between 8:30 and 10:00 am, subject to registration on the half-day participant list).
09h30. Opening: SoDUCo team, The SoDUCo research program. Review
10:00 - 12:15 Metadata, cataloguing and data dissemination session
This half-day session will look at the issues of cataloguing, data dissemination and metadata for digital humanities research projects involved in an open science approach. We will present the geo-catalog developed as part of the SoDUCo project to build corpora of documents adapted to research questions from scattered sources, then keep track of the various treatments applied to them, as well as the intermediate and final states of the digital data extracted from them. In particular, we will be discussing the notions of collections and corpora, interoperability and durability, as well as continuing the reflections initiated during previous days on the relationship between inter-institutional heritage cataloguing and cataloguing geared towards research use. How can researchers and documentation specialists work together? How can we organize ourselves to make the most of SoDUCo and the SoDUCo-BnF workshop, and create the conditions for cumulativity from both the point of view of research and that of heritage institutions? What principles, what tools and what working model can we adopt to go further together on other subjects/objects and with other players/publics?
12h00. Lunch break.
13h45. Public reception in room 70.
14h00 - 14h45 - Keynote
14h45. Coffee break
15h00 - 17h30 - Data extraction and structuring session
This half-day session looks at the issues involved in the automatic analysis of semi-structured historical documents, from the detection of their layout to the spatio-temporal structuring of the information they contain. We’ll be presenting the work carried out in the SoDUCo project on 19th-century Parisian trade directories to enable spatio-temporal tracking of each business listed in these directories. On the basis of these experiences, we feel it would be useful to broaden the discussion on two aspects that still open up numerous research prospects: the automatic analysis of ancient documents and the representation of geohistorical knowledge.
17h30. Closing the day.
09h15. Public reception in room 70.
09h30 - 12h00 - Session Historical digital data - critical assessments and prospects for use
The “Digital historical data” morning session aims to compare the experience of researchers who regularly build and analyze digital data corpora based on archival documents (text and maps). At a time when we are witnessing an explosion in the production of open digital data from old sources, the classic questions of the conditions of appropriation and reuse of these new “sources” by third parties are being posed in a heightened way. In particular, this morning’s session will explore these issues in the context of large-scale digital data corpora produced by fully automated document extraction and enrichment processes. Based on the work carried out as part of the SoDUCo program, and in particular the Annuaires historiques parisiens corpus and the géolocalisation historique of directory entries on old maps, we would like to address the following questions in the course of the morning: How and under what conditions can such “digital sources” be effectively grasped? How can we understand them, assess their “quality” and their relationship with the original sources? How can we compare or combine the wide variety of works that can be derived from these datasets? Finally, we would like to collectively discuss the prospects for analyzing and exploiting the Annuaires historiques parisiens corpus.
12h00. Lunch break.
To complement the 4th session of the SoDUCo-BnF Workshop, members of the SoDUCo consortium are organizing 3 training workshops on Tuesday afternoon, November 7, with the help of the BnF-DataLab. These workshops are open by invitation or registration, and are organized in parallel. They have been designed to enable different audiences to appropriate and reuse some of the resources produced as part of the SoDUCo research program. Each workshop will propose a general scenario of use, and will then focus on putting it into practice with the trainees.
BnF DataLab Workshop poster and program
BnF-DataLab, Bibliothèque nationale de France, Site François Mitterrand, Paris.
13h15. Trainees welcome from 1.15pm in the east hall of the BnF.
13h30. Start of DataLab workshops :
ATELIER N°1 – Data mining and mapping for the general public (Access by registration and selection by the SoDUCo team)
How to access data and work with SoDUCo resources.
Org. P.-A. Le Ny & M. Hersent & M. Fernandez & J. Perret & P. Critsofoli, Salle 12 pers. [Cf. link to registration form (coming soon)]
ATELIER N°2 – Using the SoDUCo chain for extracting and enriching digital text documents (closed to the public - access by invitation only)
Org. : E. Carlinet & J. Chazalon & B. [& Nathalie Abadie] Salle 6 pers. (1)
ATELIER N°3 – Professional Geohistorical Knowledge Graphs (closed to the public - access by invitation only)
Build specialized uses of the Parisian directories corpus (data alignment, work on sub-populations, etc.).
Org. N. Abadie & S. Tual & J. Gravier & C. Bernard [& Pascal Critsofoli] Salle 6 pers. (2)
16h30. Break
16h45. Collective assessment and discussion of the workshops
17h30. Closing of the day.