SEMANTIC SEARCH BASED ON NATURAL LANGUAGE PROCESSING – A NUMISMATIC EXAMPLE

Authors

  • Karsten Tolle Databases and Information Systems (DBIS), Johann Wolfgang Goethe-University of Frankfurt, Robert-Mayer-Straße 10, D-60325 Frankfurt am Main, Germany
  • Patricia Klinger Databases and Information Systems (DBIS), Johann Wolfgang Goethe-University of Frankfurt, Robert-Mayer-Straße 10, D-60325 Frankfurt am Main, Germany
  • Sebastian Gampe Databases and Information Systems (DBIS), Johann Wolfgang Goethe-University of Frankfurt, Robert-Mayer-Straße 10, D-60325 Frankfurt am Main, Germany
  • Ulrike Peter Berlin-Brandenburgische Akademie der Wissenschaften (BBAW), Unter den Linden 8, D-10117 Berlin, Germany

DOI:

https://doi.org/10.14795/j.v5i3.334

Keywords:

Natural Language Processing, Ontology, Iconography

Abstract

Iconographic representations on ancient artifacts are described in many existing databases and literature as human readable text. We applied Natural Language Processing (NLP) approaches in order to extract the semantics out of these textual descriptions and in this way enable semantic searches over them. This allows more sophisticated requests compared to the common existing keyword searches. As we show in our experiments based on numismatic datasets, the approach is generic in the sense that once the system is trained on one dataset, it can be applied without any further manual work also to datasets that have similar content. Of course, additional adaptions would further improve the results. Since the approach requires manual work only during the training phase, it can easily be applied to huge datasets without manual work and therefore without major extra costs. In fact, in our experience bigger datasets generate even better results because there is more data for training. Since our approach is not bound to a certain domain and the numismatic datasets are just an example, it could serve as a blueprint for many other areas. It could also help to build bridges between disciplines since textual iconographic descriptions are to be found also for pottery, sculpture and elsewhere.

Downloads

Download data is not yet available.

References

BIRD/KLEIN/LOPER 2009 - Bird, St./ Klein, E./ Loper, E., Natural Language Processing with Python – Analyzing Text with the Natural Language Toolkit (Sebastopol: O'Reilly).

BISHOP 2006 - Bishop, Chr. M., Pattern Recognition and Machine Learning (New York: Springer).

CELESTI et alii 2017 -Celesti, A./ Salamone G./ Sapienza A./ Spinelli M./ Puglisi M./ Calatabiano M., An Innovative Cloud-Based System for the Diachronic Analysis in Numismatics. DOI: 10.1145/3084546 [Journal on Computing and Cultural Heritage 10/4, 23.1– 23.18, October 2017].

GRAF 2009 - Graf, F., Apollo (Abingdon: Routledge).

KEMKES 2013 - Kemkes, M., Caracalla - Kaiser, Tyrann, Feldherr. In: Archäologisches Landesmuseum Baden-Württemberg (ed.), Caracalla. Kaiser, Tyrann, Feldherr (Darmstadt), 7-32.

KONSTANTINOVA 2014 - Konstantinova, N., Review of Relation Extraction Methods: What Is New Out There?. DOI: 10.1007/978-3-319-12580-0_2. [In: Ignatov, D./ Khachai, M. Yu./ Panchenko, A. / Konstantinova, N./ Yavorskiy, R. E. (eds.), Analysis of Images, Social Networks and Texts. Third International Conference, AIST 2014, Yekaterinburg, Russia, April 10-12, 2014 Revised Selected Papers (New York), 15–28].

MAAß 1993 - Maaß, M., Das antike Delphi. Orakel, Schätze und Monumente (Darmstadt: Wissenschaftliche Buchgesellschaft).

LAMBRINUDAKIS et alii 1984 - Lambrinudakis, W. et al, s. v. Apollon, Lexicon Iconographicum Mythologiae Classicae, II/1, Zürich/München, 183–326

PEDREGOSA et alii 2011 - Pedregosa, F./ Varoquaux, G./ Gramfort, A./ Michel, V./ Thirion, B./ Grisel, O./ Blondel, M./ Prettenhofer, P./ Weiss, R./ Dubourg, V./ Vanderplas, J./ Passos, A./ Cournapeau, D./ Brucher, M./ Perrot, M./ Duchesnav, E., Scikit-learn: Machine Learning in Python, Journal of Machine Learning Research 12, 2825–2830.

SARKAR 2016 - Sarkar, D., Text Analytics with Python: A Practical Real-World Approach to Gaining Actionable Insights from your Data (New York: Apress)

Downloads

Published

2018-09-12

Issue

Section

Digital Numismatics

How to Cite

SEMANTIC SEARCH BASED ON NATURAL LANGUAGE PROCESSING – A NUMISMATIC EXAMPLE. (2018). JOURNAL OF ANCIENT HISTORY AND ARCHAEOLOGY, 5(3). https://doi.org/10.14795/j.v5i3.334

Similar Articles

1-10 of 48

You may also start an advanced similarity search for this article.