Estimating Loss in the 15th-c. Vernacular Song Repertoire

This notebook uses the functionality in the iceberg package to estimate survival rates for literate song across various languages in fifteenth-century Europe. The capta are sourced from the alpha version of the Digital Index of Late Medieval Song (DILMS).

The capta in question are fairly simple, just pairs of SongName and SourceName for several linguistic buckets (these became song_name and source_name in later releases of DILMS). A sample of the captaset for all languages is shown below; for more details about the delicate and fraught nature of language in DILMS see that project's documentation.

Let's import all of the linguistically-segmented captasets, estimate survival rates using 256 cross-validation experiments, and plot.

Taken as a whole, quite a lot of literate 15th-century song survived the ravages of history, almost 80%! The estimated survival rate is highly dependent on language, though, with Francophone songs far more likely to survive than any other corpus of songs, and Anglophone songs particularly unlikely to survive (probably <30%).