Debating AI in Archaeology: applications, implications, and ethical considerations

Martina Tenzer; Giada Pistilli; Alex Brandsen; Alex Shenfield

doi:10.11141/ia.67.8

1. Introduction

Although it might seem recent, given the current hype around Large Language Models (LLMs) and generative Artificial Intelligence (AI) models for content generation (such as ChatGPT), AI is not a new development, and deployment of the technology in the fields of archaeology and heritage studies with both object and remote sensing applications has been widely documented (Bickler 2021). However, given developments and advances of AI tools in the field of text-based analysis specifically, this will be the primary focus of this article.

The term Artificial Intelligence was coined in 1956 (Russell and Norvig 2016) and described a hypothetical computer technology developed by Alan Turing (Turing 1950). Following the first AI hype in the 1950s and 60s (over-promising the capabilities of AI technology but under-performing due to the lack of computational power), AI research was interrupted by the AI winter of the 1970s and early 1980s. However, after 60 years of exponential growth, AI tools have now entered the mainstream e.g. chess computers, recommendation systems, and spam filters. Other applications are now leveraging the recent developments in LLMs e.g. the Google search function, instant translations, and closed captioning.

Increasing computational capabilities have enabled the development of Machine Learning (ML) and Neural Networks (NN). In particular, Deep Learning (DL) with its ability to learn features of interest in parallel, e.g. the attention mechanism in LLMs, pushed AI capabilities. These systems are particularly good at detecting correlations and patterns, and can categorise, predict, or extract data in the context of natural language processing. LLMs, such as Google's BARD, OpenAI's ChatGPT, or Meta's LLaMA now form the basis of a new generation of Open Source LLMs, such as Open Assistant (Köpf et al. 2023). These tools can learn and draw from extensive datasets that are based on the wide knowledge of the Internet, including data from, for example, Wikipedia, GitHub, and Google data search.

Following an early adoption of AI technologies in archaeology for objects and remote sensing applications (Bickler 2021; Argyrou and Agapiou 2022), Natural Language Processing (NLP), ML and DL are now being used for processing vast amounts of data accumulated over decades of research. This knowledge deposited in archives and grey literature can be efficiently analysed, structured, and disseminated using AI technologies, an approach that offers new insights and knowledge extraction from archaeological archives on a scale that has never been seen before.

However, while the deployment of AI technologies based on LLMs are capable of processing big data in archaeology and other fields, their application also has ethical implications. The lack of transparency of content and quality of the training data has been shown to reinforce social inequalities, misinformation, privacy issues, racial discrimination, risk to natural resources, and human workforce exploitation. Some of these are the same concerns across the discipline of archaeology and cultural heritage management (CHM), specifically regarding sensibilities around privacy, bias, and model creation in the context of policy and decision-making.

In this article, we focus on archaeology as part of that wider debate and present examples of successful AI applications in archaeology with text-based analysis as a primary focus. We then provide insight into the ethical implications associated with AI before discussing the implications of its use and its application in a safe, sustainable, and socially just way in the future. We want to initiate the discussion as to whether AI is a blessing or a curse for the discipline.

2. Applications of AI in archaeology and CHM

Archaeologists have a long tradition of adopting, adapting, and introducing technologies from other disciplines. For example, the pantograph preceded digital photography or survey methods (Novaković 2018), Lidar has proved useful for detecting sites particularly across difficult terrain (Cohen et al. 2020), and AI image recognition techniques have been introduced in archaeology for remote sensing (Verschoof-van der Vaart et al. 2020) and object recognition (Anichini et al. 2021).

However, adopting AI technology for text analysis is more challenging. Language is complex with ambiguities and hidden meaning beyond the pure text structure. NLP has immensely benefited from the integration of LLMs. ML and DL have been applied to, for example, archaeological prediction and detection (Resler et al. 2021) and convolutional neural networks (CNN) have been used to translate old Sumerian and Akkadian cuneiform tablets (Gutherz et al. 2023). Generative AI is helping to recreate the landscapes of the past for more immersive research of the past (Cobb 2023). Big data has been successfully linked in the project 'Unpath'd Waters (Eagles 2022).

A current cultural heritage project applied NLP, in particular Topic Modelling (TM) and ML, to explore the values attributed by people to familiar cultural landscapes (Tenzer 2022; Tenzer and Schofield 2023). Social media data, online surveys, and interviews provided sufficiently large datasets to infer heritage values from a 'bottom-up' or people-centred perspective. TM allows the identification of patterns as themes latent in or emerging from the data, which guarantees an assumption-free approach to empirical data.

AI can also help to deal with the data deluge being experienced by archaeologists (Bevan 2015). The AGNES project facilitates large-scale synthesising research in the Netherlands, by integrating ML into a search engine which aims to index all the texts about archaeology in the region, some 200,000 documents. Specifically, it uses Named Entity Recognition to automatically detect all time periods, artefacts, and place names, which can then be used in search queries. This allows for more exhaustive and precise searches. In a case study on early medieval cremations, researchers found that 30% more cremations were being found in the literature than were previously known (Brandsen and Lippok 2021).

As well as AI-assisted search and TM, recent advances in the application of LLMs in NLP have shown promise in the identification of personally identifiable information (PII) and potential copyright infringements in digital publishing of archival data from modern historical periods. Legislative requirements (including those imposed by the EU's General Data Protection Regulations and extensions of copyright terms) mean that publishers of historical and heritage archives currently need to spend significant amounts of time and manual effort ensuring compliance in these fields. Supporting publishing and editorial teams in this process has significant benefits in terms of both the amount of material that can be digitised and published and in catching cases of infringing content that might have otherwise been missed.

However, as useful as the technology seems to be, it comes with a human and environmental cost. In the next section, we will present the challenges and risks of AI deployment from an ethical and environmental view as a counterbalance to the advantages and opportunities.

3. Ethical considerations - exclusion, limitation, bias

The latest AI advancements have given rise to several ethical considerations that warrant thorough examination. In particular, concerns have been raised regarding the transparency of the content and quality of the training data used in AI applications (Bender et al. 2021). These factors have been shown to perpetuate social inequalities (Casilli 2019), propagate misinformation (Wilner 2018), and compromise privacy (Véliz 2021). Furthermore, the use of AI technologies has been linked to instances of racial discrimination (Raji et al. 2020), the endangerment of natural resources, and the exploitation of human labour (Crawford 2021).

Within the discipline, concerns surrounding privacy, bias, and model creation, are critical for formulating policies and decision-making. For instance, AI algorithms in analysing archaeological data could inadvertently lead to biased interpretations of historical events or the reinforcement of existing power structures if the models used are not designed with ethical considerations in mind, specifically, potential harms of fostering a linguistic monoculture, unintentionally strengthening existing power structures, and becoming a monocultural value carrier (Johnson et al. 2022; Pistilli 2022). Since archaeology is also about understanding human history through material remains, language becomes a key component of cultural heritage and identity. If archaeological narratives are dominated by a single language or cultural perspective, this can lead to a skewed understanding of the past, privileging certain histories over others.

There is also a need for explainability and transparency in the approach to data collection in qualitative research. As shown in the heritage case study (Tenzer 2022; Tenzer and Schofield 2023), AI can help analyse vast amounts of social media data or survey responses. However, generating models based on such data can introduce or reinforce biases by excluding already marginalised groups for example. Shaping policies on models trained on such data would introduce these societal inequalities into systems of governance. The public also needs to have the opportunity to opt-out with regard to data privacy, particularly in the cases where vast data sets are scraped or mined from the internet for AI training purposes.

While AI has the potential to analyse vast amounts of data and is particularly good at pattern detection (e.g. Casini et al. 2023), the technology has the potential to replace human volunteers in citizen science projects (Ponti and Seredko 2022). This can lead to a decrease in inclusive and engaging projects within archaeology. Excluding the public from the process of data collection and knowledge creation, and instead reducing their participation to the final product of archaeological investigations, can lead to their alienation from archaeology.

Finally, 'garbage in, garbage out' and 'black box' effects carry the risk of creating new content from already flawed data and in an opaque process (Huggett 2021). Kansteiner (2022) and Clavert and Gensburger (2023) warn about the risk of using ChatGPT to reshape historical narratives:

'If we think that the stories and images we consume influence our memories, identities, and future behaviour, we should be very wary about letting AI craft our future entertainment on the basis of our morally and politically deeply flawed cultural heritage' (Kansteiner 2022, 124)

In the same vein, generative AI technology will take realities of cultural heritage into a new dimension with challenges for authenticity and speculative interpretation in a new era of knowledge production and presentation (Spennemann 2023). A similar effect can be expected in the analysis of large archaeological datasets, shaping a narrative of the past based on weights (parameters in neural networks) in hidden layers (Cobb 2023).

Four key messages around ethical considerations result from these observations:

The issue of biases emerging from the data used for training AI models is serious. It is therefore crucial to ensure data are as representative as possible. Researchers across the discipline of archaeology and CHM should work closely with data scientists and social scientists to design representative sampling strategies and data gathering methods, and to develop protocols for assessing and correcting for bias in datasets.
The intersection of data science, philosophy, and archaeology suggests the advent of a new kind of archaeological specialism. Within this area of practice, archaeologists will need to understand the nuances of AI and ML, and be well-versed in ethical considerations. Furthermore, users of the new technology have to understand the agency and autonomy of new technology for, as Huggett (2021, 428) argues, “in some cases the system can appear to replace human expertise”.
The use of AI in shaping historical narratives is controversial. While AI has the potential to analyse large datasets and reveal patterns not always discernible to human eyes, it also carries the risk of propagating flawed interpretations of the past, particularly if the underlying data are biased. Stringent checks will therefore be needed on the application of AI in this context. This includes the implementation of explainable AI (XAI) techniques to make the decision-making processes of these systems understandable to humans. However, the implementation of XAI techniques, even in simple application domains, is challenging. Two contrasting XAI philosophies exist (Barredo Arrieta et al. 2020)
- designing inherently interpretable AI/ML systems
- applying post-hoc explainability models (such as SHAP (Lundberg and Lee 2017)) to try and explain decisions made by AI models. A key disadvantage of inherently interpretable AI models is that they limit the power and complexity of such approaches, particularly in leveraging the latest generations of generative AI systems. However, criticism has been levelled at post-hoc methods regarding how closely their explanations relate to the decisions made by AI algorithms.
Ethical guidelines for AI applications in archaeology and heritage practice need to be drafted and widely adopted to prevent misuse and to promote the responsible use of these powerful technologies. However, crafting ethical guidelines for AI use in archaeology requires a balance between preventing misuse and adapting to the varied legal and practical contexts of global research environments. Discussions at the World Archaeological Congress (WAC 2024) and studies on remote sensing practices (Fisher et al. 2021) stress the challenge of developing standards that accommodate the distinct local regulations and the particularities of conducting research across different cultures and regions. Nevertheless, Davis (2020, 1) argues, that a high level of automation based on algorithms has the potential to create 'consistent definitions which permit reproducible research designs', demonstrating the advantages of automation for compatibility and reproducibility of data.

4. Discussion

Recent developments and the rapid adoption of AI technology into archaeology and heritage practice, as presented here, show the need for a debate around ethical implications and sustainable applications of AI. To enable the discourse, we have presented the advantages and capabilities of the applications, which allow more time and resource efficient workflows (Tenzer 2022; Tenzer and Schofield 2023), and enable the analysis and reuse of 'big data' accumulated over decades of archaeological investigations currently dormant in archives and grey literature (Brandsen and Lippok 2021). We also provide different views on the implications of AI applications from archaeology, heritage studies, data science and philosophy, showing inherent challenges regarding limitation, bias and social impact (Bender et al. 2021; Casilli 2019; Crawford 2021; Véliz 2021).

Interdisciplinary/cross-disciplinary research and collaboration will be necessary in the near future to apply this technology to a wide variety of disciplines. Collaboration between data science, sociology, philosophy and archaeology is becoming increasingly important. Understanding how AI technology can influence epistemology and hermeneutics has to focus the discussion on the agency and cognitive artefacts of the technology in view of the output (Huggett 2021, 421). University courses bridging the complex knowledge of the various disciplines will be increasingly necessary. The projects presented here and the collaboration of the authors of this article exemplify how cooperation can work to foster mutually beneficial collaboration.

The discipline also needs to understand how AI deployment will impact on future employment for archaeologists and the changing work environment. What are the prospects for future archaeologists in professional and academic careers? Do we need to become computer scientists ourselves and teach this to our students? Ultimately, will AI replace archaeologists? Harari (2017) argues that there is 'only a 0.7% chance'. AI can replace the monotonous tasks of daily work and carry out the large-scale analyses that precede archaeological work. However, the technology is evolving with increasing speed, and making predictions about the future impact on the profession, especially after the COVID-19 pandemic, is difficult going forward.

AI deployment in the discipline needs to run alongside the development of strategies and best practice guidelines to safeguard the responsible, fair, and sustainable use of this new technology. Exploitation of human and natural resources, and the cost on the environment, needs to be highlighted, and potential risks to reinforce social inequality must be considered.

Archaeology and CHM scholars are well equipped to study and deal with the societal effects of AI, as they already look at large scale influences on society, and have the theories, methods, and background for these analyses. But to do so here, they first need to understand AI methods and their implications.

5. Conclusion

In post-phenomenological ontology, humans are experiencing the world with and through technology (Gattiglia 2022; Ihde 2009). While we are at a point where machines not only assist humans (first machine revolution), but replace humans in the production or creative workflow (second machine revolution), we need to reorientate and redefine objectives. AI is here to stay, and the question will be how to use it responsibly and sustainably.

This means alignment: where does the technology work towards humanity's values and goals? Where are the dangers and risks of losing control? What are the benefits for society and humanity as a whole (not for the benefit of a few, but for the improvement of the environment, health, and society of the many)? Where does the development go from here? How can AI shape the future of the past - by increasing our understanding of the past, using the vast amount of archaeological and historical data to create material that promotes and conveys this knowledge? Where does the future of the discipline lie regarding cooperation and education?

We are at a point where archaeology and heritage practice cannot just benefit from these technological developments and advances, but must also contribute to the ethical and practical discussion of AI in human culture and society. Coming back to the initial question as to whether AI in archaeology and CHM is a blessing or a curse, we have provided examples of the advantages and beneficial applications of the technology, but have also highlighted the challenges that need to be resolved before AI can be used safely and democratically. The debate is wide open.

Acknowledgements

This article is part of an AHRC/UKRI WRoCAH-funded PhD project at the University of York. Grant reference number: AH/R012733/1.

Anichini, F., Dershowitz, N., Dubbini, N., Gattiglia, G., Itkin, B. and Wolf, L. 2021 'The automatic recognition of ceramics from only one photo: The ArchAIDE app', Journal of Archaeological Science: Reports 36, 102788. https://doi.org/10.1016/j.jasrep.2020.102788

Argyrou, A. and Agapiou, A. 2022 'A review of artificial intelligence and remote sensing for archaeological research', Remote Sensing 14(23), 6000. https://doi.org/10.3390/rs14236000

Barredo Arrieta, A., Díaz-Rodríguez, N., Del Ser, J., Bennetot, A., Tabik, S., Barbado, A., Garcia, A., Gil-Lopez, S., Molina, D., Benjamins, R., Chatila., R. and Herrera, F. 2020 'Explainable artificial intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI', Information Fusion 58, 82-115. https://doi.org/10.1016/j.inffus.2019.12.012

Bender, E.M., Gebru, T., McMillan-Major, A. and Shmitchell, S. 2021 'On the dangers of stochastic parrots: can language models be too big?' in Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. FAccT '21. 1 March 2021, New York, NY, USA: Association for Computing Machinery. 610-623. https://doi.org/10.1145/3442188.3445922

Bevan, A. 2015 'The data deluge', Antiquity 89(348), 1473-1484. https://doi.org/10.15184/aqy.2015.102

Bickler, S. H. 2021 'Machine learning arrives in archaeology', Advances in Archaeological Practice 9(2), 186-191. https://doi.org/10.1017/aap.2021.6

Brandsen, A. and Lippok, F. 2021 'A burning question - using an intelligent grey literature search engine to change our views on early medieval burial practices in the Netherlands', Journal of Archaeological Science 133, 105456. https://doi.org/10.1016/j.jas.2021.105456

Casilli, A. 2019 En Attendant les Robots: Enquête sur le Travail du Clic, Paris: Seuil.

Casini, L., Marchetti, N., Montanucci, A., Orrù, V. and Roccetti, M. 2023 'A human-AI collaboration workflow for archaeological sites detection', Scientific Reports 13(1), 8699. https://doi.org/10.1038/s41598-023-36015-5

Clavert, F. and Gensburger, S. 2023 'Is artificial intelligence the future of collective memory? Bridging AI scholarship and Memory Studies', Call for papers for the second volume of the Memory Studies Review (Brill publishing), 2024. https://www.c2dh.uni.lu/sites/default/files/cfp_is_artificial_intelligence_the_future_of_collective_memory_-_2023.pdf [Last accessed: 2 November 2023]

Cobb, P. 2023 'Large Language Models and Generative AI, Oh My!: Archaeology in the Time of ChatGPT, Midjourney, and Beyond', Advances in Archaeological Practice 11, 363-369. https://doi.org/10.1017/aap.2023.20

Cohen, A., Klassen, S. and Evans, D. 2020 'Ethics in Archaeological Lidar', Journal of Computer Applications in Archaeology 3(1), 76-91. https://doi.org/10.5334/jcaa.48

Crawford, K. 2021 Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence, New Haven: Yale University Press. https://doi.org/10.12987/9780300252392

Davis, D. 2020 'Defining what we study: The contribution of machine automation in archaeological research', Digital Applications in Archaeology and Cultural Heritage 18, e00152. https://doi.org/10.1016/j.daach.2020.e00152

Eagles, P. 2022 'Artificial intelligence for data enhancement, linking and exploration', Unpath'd Waters [website] https://unpathdwaters.org.uk/artificial-intelligence-for-data-enhancement-linking-and-exploration/ [Last accessed 2 November 2023].

Fisher, M., Fradley, M., Flohr, P., Rouhani, B. and Simi, F. 2021 'Ethical considerations for remote sensing and open data in relation to the endangered archaeology in the Middle East and North Africa project', Archaeological Prospection 28(3), 279-292. https://doi.org/10.1002/arp.1816

Gattiglia, G. 2022 'A postphenomenological perspective on digital and algorithmic archaeology', Archeologia e Calcolatori 33(2), 319-334. https://doi.org/10.19282/ac.33.2.2022.17

Gutherz, G., Gordin, S., Sáenz, L., Levy, O. and Berant, J. 2023 'Translating Akkadian to English with neural machine translation', PNAS Nexus 2(5), pgad096. https://doi.org/10.1093/pnasnexus/pgad096

Harari, Y.N. 2017 'The rise of the useless class', ideas.ted.com [website] https://ideas.ted.com/the-rise-of-the-useless-class/ [Last accessed: 17 May 2023].

Huggett, J. 2021 'Algorithmic agency and autonomy in archaeological practice', Open Archaeology 7, 417-434. https://doi.org/10.1515/opar-2020-0136

Ihde, D. 2009 Postphenomenology and Technoscience: The Peking University Lectures, SUNY series in the Philosophy of the Social Sciences, New York, NY, USA: Suny Press. https://sunypress.edu/Books/P/Postphenomenology-and-Technoscience [Last accessed: 17 May 2023].

Johnson, R.L., Pistilli, G., Menédez-González, N., Denisse Dias Duran, L., Panai, E., Kalpokiene, J. and Bertulfo, D.J. 2022 'The ghost in the machine has an American accent: value conflict in GPT-3', arXiv. https://doi.org/10.48550/arXiv.2203.07785

Kansteiner, W. 2022 'Digital doping for historians: can history, memory, and historical theory be rendered artificially intelligent?', History and Theory 61(4), 119-133. https://doi.org/10.1111/hith.12282

Köpf, A., Kilher, Y., von Rütte, D., Anagnostidis, S., Tam, Z.-R, Stevens, K., Barhoum, A., Duc, N.M., Stanley, O., Nagyfi, R., ES, S., Suri, S., Glushkov, D., Dantuluri, A., Maguire, A., Schuhmann, C., Nguyen, H. and Mattick, A. 2023 'OpenAssistant conversations -- Ddemocratizing large language model alignment', arXiv. https://doi.org/10.48550/arXiv.2304.07327 [Last accessed: 10 May 2023].

Lundberg, S. and Lee, S.-I. 2017 'A unified approach to interpreting model predictions', arXiv. https://doi.org/10.48550/arXiv.1705.07874

Novaković, P. 2018 'Impact of the large-scale excavations in the Slovene preventive archaeology'. https://www.academia.edu/42083263/Impact_of_the_large_scale_excavations_in_the_Slovene_preventive_archaeology[Last accessed: 17 May 2023].

Pistilli, G. 2022 'What lies behind AGI: ethical concerns related to LLMs', Revue Ethique et Numérique, hal-03607808. https://hal.science/hal-03607808 [Last accessed: 17 May 2023].

Ponti, M. and Seredko, A. 2022 'Human-machine-learning integration and task allocation in citizen science', Humanities and Social Sciences Communications 9(1), 1-15. https://doi.org/10.1057/s41599-022-01049-z

Raji, I.D., Gebru, T., Mitchell, M., Buolamwini, J. and Lee, J. and Denton, E. 2020 'Saving Face: investigating the ethical concerns of facial recognition auditing' in Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society. AIES '20. 7 February 2020, New York, NY, USA: Association for Computing Machinery. 145-151. https://doi.org/10.1145/3375627.3375820

Resler, A., Yeshurun, R., Natalio, F., and Giryes, R. 2021 'A deep-learning model for predictive archaeology and archaeological community detection', Humanities and Social Sciences Communications 8(1), 1-10. https://doi.org/10.1057/s41599-021-00970-z

Russell, S. and Norvig, P. 2016 Artificial Intelligence: A Modern Approach, Global Edition, Boston: Pearson. https://aima.cs.berkeley.edu/global-index.html [Last accessed: 17 May 2023].

Spennemann, D. H. 2023 'Generative artificial intelligence, human agency and the future of cultural heritage', SSRN Electronic Journal. https://doi.org/10.2139/ssrn.4583327

Tenzer, M. 2022 'Tweets in the Peak: Twitter analysis - the impact of Covid-19 on cultural landscapes', Internet Archaeology 59. https://doi.org/10.11141/ia.59.6

Tenzer, M. and Schofield, J. 2023 'Using Topic Modelling to reassess heritage values from a people-centred perspective - Applications from the north of England', Cambridge Archaeological Journal. https://doi.org/10.1017/S0959774323000203

Turing, A.M. 1950 'I.-computing machinery and intelligence', Mind LIX(236), 433-460. https://doi.org/10.1093/mind/LIX.236.433

Véliz, C. 2021 Privacy is Power: Why and How You Should Take Back Control of Your Data, Corgi.

Verschoof-van der Vaart, W. B., Lambers, K., Kowalczyk, W. and Bourgeois, Q.P.J. 2020 'Combining deep learning and location-based ranking for large-scale archaeological prospection of LiDAR data from the Netherlands', ISPRS International Journal of Geo-Information 9(5), 293. https://doi.org/10.3390/ijgi9050293

World Archaeological Congress (WAC) 2024 Code of Ethics, World Archaeological Congress. https://worldarchaeologicalcongress.com/code-of-ethics/ [Last accessed: 26 February 2024].

Wilner, A. S. 2018 'Cybersecurity and its discontents: artificial intelligence, the Internet of Things, and digital misinformation', International Journal 73(2), 308-316. https://doi.org/10.1177/0020702018782496