SEMANTIC SEARCH BASED ON NATURAL LANGUAGE PROCESSING – A NUMISMATIC EXAMPLE
DOI:
https://doi.org/10.14795/j.v5i3.334Keywords:
Natural Language Processing, Ontology, IconographyAbstract
Iconographic representations on ancient artifacts are described in many existing databases and literature as human readable text. We applied Natural Language Processing (NLP) approaches in order to extract the semantics out of these textual descriptions and in this way enable semantic searches over them. This allows more sophisticated requests compared to the common existing keyword searches. As we show in our experiments based on numismatic datasets, the approach is generic in the sense that once the system is trained on one dataset, it can be applied without any further manual work also to datasets that have similar content. Of course, additional adaptions would further improve the results. Since the approach requires manual work only during the training phase, it can easily be applied to huge datasets without manual work and therefore without major extra costs. In fact, in our experience bigger datasets generate even better results because there is more data for training. Since our approach is not bound to a certain domain and the numismatic datasets are just an example, it could serve as a blueprint for many other areas. It could also help to build bridges between disciplines since textual iconographic descriptions are to be found also for pottery, sculpture and elsewhere.Downloads
References
BIRD/KLEIN/LOPER 2009 - Bird, St./ Klein, E./ Loper, E., Natural Language Processing with Python – Analyzing Text with the Natural Language Toolkit (Sebastopol: O'Reilly).
BISHOP 2006 - Bishop, Chr. M., Pattern Recognition and Machine Learning (New York: Springer).
CELESTI et alii 2017 -Celesti, A./ Salamone G./ Sapienza A./ Spinelli M./ Puglisi M./ Calatabiano M., An Innovative Cloud-Based System for the Diachronic Analysis in Numismatics. DOI: 10.1145/3084546 [Journal on Computing and Cultural Heritage 10/4, 23.1– 23.18, October 2017].
GRAF 2009 - Graf, F., Apollo (Abingdon: Routledge).
KEMKES 2013 - Kemkes, M., Caracalla - Kaiser, Tyrann, Feldherr. In: Archäologisches Landesmuseum Baden-Württemberg (ed.), Caracalla. Kaiser, Tyrann, Feldherr (Darmstadt), 7-32.
KONSTANTINOVA 2014 - Konstantinova, N., Review of Relation Extraction Methods: What Is New Out There?. DOI: 10.1007/978-3-319-12580-0_2. [In: Ignatov, D./ Khachai, M. Yu./ Panchenko, A. / Konstantinova, N./ Yavorskiy, R. E. (eds.), Analysis of Images, Social Networks and Texts. Third International Conference, AIST 2014, Yekaterinburg, Russia, April 10-12, 2014 Revised Selected Papers (New York), 15–28].
MAAß 1993 - Maaß, M., Das antike Delphi. Orakel, Schätze und Monumente (Darmstadt: Wissenschaftliche Buchgesellschaft).
LAMBRINUDAKIS et alii 1984 - Lambrinudakis, W. et al, s. v. Apollon, Lexicon Iconographicum Mythologiae Classicae, II/1, Zürich/München, 183–326
PEDREGOSA et alii 2011 - Pedregosa, F./ Varoquaux, G./ Gramfort, A./ Michel, V./ Thirion, B./ Grisel, O./ Blondel, M./ Prettenhofer, P./ Weiss, R./ Dubourg, V./ Vanderplas, J./ Passos, A./ Cournapeau, D./ Brucher, M./ Perrot, M./ Duchesnav, E., Scikit-learn: Machine Learning in Python, Journal of Machine Learning Research 12, 2825–2830.
SARKAR 2016 - Sarkar, D., Text Analytics with Python: A Practical Real-World Approach to Gaining Actionable Insights from your Data (New York: Apress)
Downloads
Published
Issue
Section
License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal, we use CC BY-NC-ND license (Attribution-NonCommercial-NoDerivs) wich only allowing others to download your works and share them with others as long as they credit you, but they can’t change them in any way or use them commercially.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).