• Managing historical linguistic data for computational phylogenetics and computer-assisted language comparison

    Author(s):
    Robert Forkel, Russell D. Gray, Simon J. Greenhill, Johann-Mattis List, Christoph Rzymski, Tiago Tresoldi (see profile)
    Date:
    2019
    Group(s):
    Classical Philology and Linguistics, Digital Humanists, Linguistics
    Subject(s):
    Comparative linguistics, Historical linguistics, Research--Data processing
    Item Type:
    Book chapter
    Tag(s):
    computational historical linguistics, data managment, phylogenetics, Research data management
    Permanent URL:
    http://dx.doi.org/10.17613/pwva-kz72
    Abstract:
    THIS IS A PRE-PRINT, PLEASE CITE IT AS: Tresoldi, Tiago; Rzymski, Christoph; Forkel, Robert; Greenhill, Simon J.; List, Johann-Mattis; and Gray, Russell D. (2019) “Managing historical linguistic data for computational phylogenetics and computer-assisted language comparison (PRE-PRINT)”. Jena: Max-Planck-Institute for the Science of Human History. The popularisation of computer-based methods in comparative linguistics has led to a greater awareness of issues resulting from limited data sustainability and proper data management. In this use-case and its accompanying tutorial, we present principles of data management as applied to computational phylogenetics and computer-assisted language comparison, showcasing the solutions we recommend. Instead of enumerating the many possibilities to code and use linguistic data to conduct a phylogenetic analysis, we illustrate our suggestions for phylogenetic data management in a workflow based on a concrete analysis, showing how data should be managed with the help of a published dataset, exploring the information, file formats, processes, and software involved, explaining and showing how to collect and store cross-linguistic information, how to guarantee that datasets are cross-linguistically comparable, how to store intermediate and final results of the analyses, and how to share data in a reusable form by relying in the tools and principles of the CLDF initiative.
    Notes:
    PLEASE NOTE THAT THIS IS A PRE-PRINT
    Metadata:
    Published as:
    Book chapter    
    Status:
    Published
    Last Updated:
    4 years ago
    License:
    All Rights Reserved
    Share this:

    Downloads

    Item Name: pdf tresoldi-phylogenetics-2.pdf
      Download View in browser
    Activity: Downloads: 460