
GeologyOracle has been released today, free to use. “It is the first AI assistant specifically trained for geoscience. A typical GeologyOracle session resembles an instant messaging chat, i.e., the human user inputs information, which can be in the form of text, images, or audio, and the AI system simulates the conversation by generating a real-time text in response to the user input.”. Link here: https://geologyoracle.com/
I congratulate the university researchers on their noble intent and hard work to create this system to support geoscience researchers, students and geotourism to support UNESCO Geoparks. They have some genuinely innovative ideas and it is a useful contribution to this area. There is no registration process, so no personal details are captured which is good.
I reached out to the authors and am awaiting details on whether they intend to openly release under FAIR principles (1) the earth science “training” data they used to configure the system, (2) the 152 questions used to benchmark the performance of the system, and (3) open-source the code/prompts written on top of GPT-4o. These would support open-science for the whole geological community.
I have some ethical concerns on this first release, which may be supported by UNESCO Geoparks, which targets geoscientists. These include:
(1) None of the answers cite their source. For a scientific AI tool this is problematic, providing no easy way for answers to be verified or authors to be recognised. This would also go against UNESCO’s own recommendations on AI Ethics. Considering this is in part likely a Retrieval Augmented Generation (RAG) system, it should be easy to do this.
(2) The system uses GPT-4. Nowhere does it state on the website that the text and images you upload may be used by OpenAI for training data which could find their way into commercial models. So confidential content should not be uploaded, which may not be an issue for geotourism, but might be for other geoscientists. Geoscientists should be made aware.
(3) Of the 6 tools provided, I feel the ‘Grade a geoscience text’ tool for teachers and students that provides a score on a scale of 1 to 10 for your work, may not be something that is necessarily ‘good’ to be promoted in UNESCO’s name. One for ethical debate.
From a performance point of view, I compared answers, both text and images, to results from using the generic GPT-4o app. My quick tests appear to indicate tGeologyOracle did not perform as simply using native GPT-4o (which has no additional special geological training). So it is unclear to what extent GeologyOracle improves on what is already available in terms of AI/Large Language Models (Including vision). This was not tested in the paper either, perhaps an area for further research.