

A significant portion of scientific knowledge resides within scholarly publications, both in print and digital formats. Recent advancements in natural language processing and information extraction techniques have enhanced the accessibility of this knowledge for further automated querying and processing. Structured and semantically-aware representations, such as ontologies, play a crucial role in simplifying and integrating access to this vast pool of knowledge. While several ontologies have been developed to capture the structure and discourse of scientific publications, there is a notable scarcity of ontologies for succinctly representing named entities that are present in scholarly documents.
This paper introduces the Ontology for Named Entity Representation (OnNER) to address this gap. OnNER is designed to represent named entities – the terms identified and labeled using named entity recognition (NER) methods – from scholarly publications. The ontology provides a structured semantic representation of the named entities, how they are labeled, and where they occur. We discuss the overall design of OnNER, its integration with other ontologies, and demonstrate how the ontology facilitates advanced querying of named entities’ presence and collocation within and across publications.