• Find People
  • Campus Map
  • PiratePort
  • A-Z
    • About
    • Submit
    • Browse
    • Login
    View Item 
    •   ScholarShip Home
    • Dissertations and Theses
    • Master's Theses
    • View Item
    •   ScholarShip Home
    • Dissertations and Theses
    • Master's Theses
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of The ScholarShipCommunities & CollectionsDateAuthorsTitlesSubjectsTypeDate SubmittedThis CollectionDateAuthorsTitlesSubjectsTypeDate Submitted

    My Account

    Login

    Statistics

    View Google Analytics Statistics

    Bibliographic Reference Analysis in Archival Data Using Supervised Machine Learning and Grammatical Features

    Thumbnail
    View/ Open
    PHILIPS-MASTERSTHESIS-2021.pdf (2.796Mb)

    Show full item record
    Author
    Philips, James Patrick
    Abstract
    Bibliographic references are integral to scholarly discourse in humanities disciplines. While prior work has focused on reference extraction and parsing, little research has investigated the classification of footnotes containing bibliographic citations and author commentary using supervised machine learning methodologies. For this thesis, we contextualize bibliographic reference analysis within the broader domain of archival document processing through an original literature survey of current techniques, tools, and trends in the field of historical document processing. Next, we review related work on bibliographic citation identification and reference parsing. Finally, using a historiographic dataset drawn from the JSTOR humanities archive, we train and compare the performance of a suite of single and hybrid machine learning classifiers on a novel, previously unexplored bibliographic reference classification task. Moreover, as a part of this analysis, we compare the performance of traditional features and novel, grammatical features drawn from natural language processing. Our work demonstrates the superiority of hybrid models for classification of scholarly footnotes containing historiographic bibliographic references, the transferability of features from reference extraction to this research problem, and the viability of training machine learning models for this task utilizing novel, grammatical features.
    URI
    http://hdl.handle.net/10342/9733
    Subject
     Bibliographic references; supervised machine learning; grammar 
    Date
    2021-11-19
    Citation:
    APA:
    Philips, James Patrick. (November 2021). Bibliographic Reference Analysis in Archival Data Using Supervised Machine Learning and Grammatical Features (Master's Thesis, East Carolina University). Retrieved from the Scholarship. (http://hdl.handle.net/10342/9733.)

    Display/Hide MLA, Chicago and APA citation formats.

    MLA:
    Philips, James Patrick. Bibliographic Reference Analysis in Archival Data Using Supervised Machine Learning and Grammatical Features. Master's Thesis. East Carolina University, November 2021. The Scholarship. http://hdl.handle.net/10342/9733. January 31, 2023.
    Chicago:
    Philips, James Patrick, “Bibliographic Reference Analysis in Archival Data Using Supervised Machine Learning and Grammatical Features” (Master's Thesis., East Carolina University, November 2021).
    AMA:
    Philips, James Patrick. Bibliographic Reference Analysis in Archival Data Using Supervised Machine Learning and Grammatical Features [Master's Thesis]. Greenville, NC: East Carolina University; November 2021.
    Collections
    • Master's Theses
    Publisher
    East Carolina University

    xmlui.ArtifactBrowser.ItemViewer.elsevier_entitlement

    East Carolina University has created ScholarShip, a digital archive for the scholarly output of the ECU community.

    • About
    • Contact Us
    • Send Feedback