WordFinder app: Harnessing generative AI on AWS for aphasia communication

In this post, we showcase how Dr. Kori Ramajoo, Dr. Sonia Brownsett, Prof. David Copland, from QARC, and Scott Harding, a person living with aphasia, used AWS services to develop WordFinder, a mobile, cloud-based solution that helps individuals with aphasia increase their independence through the use of AWS generative AI technology.

In the spirit of giving back to the community and harnessing the art of the possible for positive change, AWS hosted the Hack For Purpose event in 2023. This hackathon brought together teams from AWS customers across Queensland, Australia, to tackle pressing challenges faced by social good organizations.

The University of Queensland’s Queensland Aphasia Research Centre (QARC)’s mission is to improve access to technology for people living with aphasia, a communication disability that can impact an individual’s ability to express and understand spoken and written language.

The challenge: Overcoming communication barriers

In 2023, it was estimated that more than 140,000 people in Australia were living with aphasia. This number is expected to grow to over 300,000 by 2050. Aphasia can make everyday tasks like online banking, using social media, and trying new devices challenging. The goal was to create a mobile app that could assist people with aphasia by generating a word list of the objects that are in a user-selected image and extend the list with related words, enabling them to explore alternative communication methods.

Overview of the solution

The following screenshot shows an example of navigating the WordFinder app, including sign in, image selection, object definition, and related words.

In the preceding diagram, the following scenario unfolds:

Sign in

Image selection

Pick an image to search

Photo access

Image chosen

Animal

Bear

Mammal

Wildlife

Koala

Related words

Related Words

Koala

This flow demonstrates how users can use the app to search for words and concepts by starting with an image, then drilling down into related terminology—a visual approach to expanding vocabulary or finding associated words.

The following diagram illustrates the solution architecture on AWS.

In the following sections, we discuss the flow and key components of the solution in more detail.

Secure access using Route 53 and Amplify

Amazon Route 53

AWS Amplify

React Native

Secure authentication with Amazon Cognito

Amazon Cognito

Image capture and storage with Amplify and Amazon S3

Amazon Simple Storage Service (Amazon S3)

Object recognition with Amazon Rekognition

Amazon Rekognition

Semantic word associations with API Gateway and Lambda

AWS Lambda

Amazon API Gateway

Lambda with Amazon Bedrock, and generative AI and prompt engineering using Amazon Bedrock

Amazon Bedrock

WordFinder app component details

In this section, we take a closer look at the components of the WordFinder app.

React Native and Expo

WordFinder was built using React Native, a popular framework for building cross-environment mobile apps. To streamline the development process, Expo was used, which allows for write-once, run-anywhere capabilities across Android and iOS operating systems.

Amplify

Amplify played a crucial role in accelerating the app’s development and provisioning the necessary backend infrastructure. Amplify is a set of tools and services that enable developers to build and deploy secure, scalable, and full stack apps. In this architecture, the frontend of the word finding app is hosted on Amplify. The solution uses several Amplify components:

Authentication and access control

Storage

Image recognition

Related words

The generated initial word list is the first step toward finding the desired word, but the labels returned by Amazon Rekognition might not be the exact word that someone is looking for. The project team then considered how to implement a thesaurus-style lookup capability. Although the project team initially explored different programming libraries, they found this approach to be somewhat rigid and limited, often returning only synonyms and not entities that are related to the source word. The libraries also added overhead associated with packaging and maintaining the library and dataset moving forward.

To address these challenges and improve responses for related entities, the project team turned to the capabilities of generative AI. By using the generative AI foundation models (FMs), the project team was able to offload the ongoing overhead of managing this solution while increasing the flexibility and curation of related words and entities that are returned to users. The project team integrated this capability using the following services:

Amazon Bedrock

API Gateway

Lambda

Benefits of API Gateway and Lambda

The project team briefly considered using the AWS SDK for JavaScript v3 and credentials sourced from Amazon Cognito to directly interface with Amazon Bedrock. Although this would work, there were several benefits associated with implementing API Gateway and a Lambda function:

Security

Change management

Monitoring

Data optimization

Caching layer

Prompt engineering

One of the core features of WordFinder is its ability to generate related words and concepts based on a user-provided source word. This source word (obtained from the mobile app through an API request) is embedded into the following prompt by the Lambda function, replacing {word}:

prompt = "I have Aphasia. Give me the top 10 most common words that are related words to the word supplied in the prompt context. Your response should be a valid JSON array of just the words. No surrounding context. {word}"

The team tested multiple different prompts and approaches during the hackathon, but this basic guiding prompt was found to give reliable, accurate, and repeatable results, regardless of the word supplied by the user.

After the model responds, the Lambda function bundles the related words and returns them to the mobile app. Upon receipt of this data, the WordFinder app updates and displays the new list of words for the user who has aphasia. The user might then find their word, or drill deeper into other related words.

To maintain efficient resource utilization and cost optimization, the architecture incorporates several resource cleanup mechanisms:

Lambda automatic scaling: The Lambda function responsible for interacting with Amazon Bedrock is configured to automatically scale down to zero instances when not in use, minimizing idle resource consumption. Amazon S3 lifecycle policies: The S3 bucket storing the user-uploaded images is configured with lifecycle policies to automatically expire and delete objects after a specified retention period, freeing up storage space. API Gateway throttling and caching: API Gateway is configured with throttling limits to help prevent excessive requests, and caching mechanisms are implemented to reduce the load on downstream services such as Lambda and Amazon Bedrock.

Conclusion

The QARC team and Scott Harding worked closely with AWS to develop WordFinder, a mobile app that addresses communication challenges faced by individuals living with aphasia. Their winning entry at the 2023 AWS Queensland Hackathon showcased the power of involving those with lived experiences in the development process. Harding’s insights helped the tech team understand the nuances and impact of aphasia, leading to a solution that empowers users to find their words and stay connected.

References

AWS Queensland Hackathon 2023: Hack For Purpose

Tech Hub helps people with aphasia reclaim their independence

QARC – School of Health and Rehabilitation Services

AWS Case Study: Improving Communication and Speech for People Living with Aphasia

About the Authors

Kori Ramijoo is a research speech pathologist at QARC. She has extensive experience in aphasia rehabilitation, technology, and neuroscience. Kori leads the Aphasia Tech Hub at QARC, enabling people with aphasia to access technology. She provides consultations to clinicians and provides advice and support to help people with aphasia gain and maintain independence. Kori is also researching design considerations for technology development and use by people with aphasia.

Scott Harding lives with aphasia after a stroke. He has a background in Engineering and Computer Science. Scott is one of the Directors of the Australian Aphasia Association and is a consumer representative and advisor on various state government health committees and nationally funded research projects. He has interests in the use of AI in developing predictive models of aphasia recovery.

Sonia Brownsett is a speech pathologist with extensive experience in neuroscience and technology. She has been a postdoctoral researcher at QARC and led the aphasia tech hub as well as a research program on the brain mechanisms underpinning aphasia recovery after stroke and in other populations including adults with brain tumours and epilepsy.

David Copland is a speech pathologist and Director of QARC. He has worked for over 20 years in the field of aphasia rehabilitation. His work seeks to develop new ways to understand, assess and treat aphasia including the use of brain imaging and technology. He has led the creation of comprehensive aphasia treatment programs that are being implemented into health services.

Mark Promnitz is a Senior Solutions Architect at Amazon Web Services, based in Australia. In addition to helping his enterprise customers leverage the capabilities of AWS, he can often be found talking about Software as a Service (SaaS), data and cloud-native architectures on AWS.

Kurt Sterzl is a Senior Solutions Architect at Amazon Web Services, based in Australia. He enjoys working with public sector customers like UQ QARC to support their research breakthroughs.