This is an online appendix to the article:

Partanen, Niko and Rueter, Jack 2025: Dialect Cartography of Erzya and Moksha Languages: Digitized Historical Sources and Evaluation of the Contemporary Data. In: Journal of Data Mining and Digital Humanities.

Three distinct regions are annotated and discussed in the article. These are:

  1. Mismatching polygons and points
  2. Settlements of Paasonen that have no polygons
  3. Polygons, under which there are no settlements

This study aims to pave a way to more comprehensive documentation and understanding of the Erzya and Moksha communities, their exact locations, interconnections and histories. Our primary sources have been the maps from the URHIA project:

Rantanen, T., Tolvanen, H., Roose, M., Ylikoski, J. & Vesakoski, O. (2022) “Best practices for spatial language data harmonization, sharing and map creation – A case study of Uralic” PLoS ONE 17(6): e0269648. https://doi.org/10.1371/journal.pone.0269648.

The information about Erzya and Moksha settlements displayed on the map as points is derived from Heikki Paasonen’s fieldwork documented in H. Paasonens Mordwinisches Wörterbuch and text collections in the volumes of Mordwinische Volksdichtung, published by the Finno-Ugrian Society.