Cristin-resultat-ID: 1675041
Sist endret: 11. februar 2019, 08:48
Resultat
Vitenskapelig foredrag
2019

Ontology-based Semantic Search For Open Government Data

Bidragsytere:
  • Shanshan Jiang
  • Thomas Fjæstad Hagelien
  • Marit Kjøsnes Natvig og
  • Jingyue Li

Presentasjon

Navn på arrangementet: 2019 IEEE 13th International Conference on Semantic Computing (ICSC)
Sted: Newport Beach, California
Dato fra: 30. januar 2019
Dato til: 1. februar 2019

Om resultatet

Vitenskapelig foredrag
Publiseringsår: 2019

Beskrivelse Beskrivelse

Tittel

Ontology-based Semantic Search For Open Government Data

Sammendrag

Open data are increasingly available in amount, but often with unprecise or incomplete description. It is time consuming and difficult to discover relevant datasets. Current open data catalogues provide mostly keyword-based search without the ability to understand the user’s intent and the contextual meaning of the datasets. Ontology-based semantic search has been well explored in semantic web as an attempt to improve the quality of search for relevant documents and web pages. This paper applies semantic and machine learning technologies to open data. It presents an approach for search of open government datasets, a relatively underexplored domain, where the semantics of data relies on metadata that describes the data. The idea is to link the published datasets with concepts from a well-defined ontology and allow searching based on hybrid indexing. A simplified ontology for the transport domain is constructed to demonstrate and test the idea. A prototype search engine has been implemented which supports both manual and automatic linking to concepts in the ontology and exploits hybrid indexing based on these linking methods. Natural language processing (NLP) techniques are applied to dataset linking and indexing and enable the independency of the natural language used for describing the datasets. The manual linking of datasets to ontology concepts is intended for domain experts and data publishers, while the automatic linking is based on the provided dataset descriptions. The automatic linking reduces the overhead of manual concepts linking and the dependency on domain experts. Preliminary results have indicated that semantic search based on ontologies is a promising approach to increase search quality and efficiency for open data search. The success of the automatic mechanism does however depend on the quality and comprehensiveness of the dataset descriptions.

Bidragsytere

Shanshan Jiang

  • Tilknyttet:
    Forfatter
    ved Software Engineering, Safety and Security ved SINTEF AS

Thomas Fjæstad Hagelien

  • Tilknyttet:
    Forfatter
    ved Klima og miljø ved SINTEF Ocean

Marit Kjøsnes Natvig

  • Tilknyttet:
    Forfatter
    ved Software Engineering, Safety and Security ved SINTEF AS

Jingyue Li

  • Tilknyttet:
    Forfatter
    ved Institutt for datateknologi og informatikk ved Norges teknisk-naturvitenskapelige universitet
1 - 4 av 4