AWS Machine Learning Blog 01月14日
How BQA streamlines education quality reporting using Amazon Bedrock
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文介绍了如何利用智能文档处理(IDP)和生成式AI技术来优化教育评估流程。巴林教育和培训质量管理局(BQA)通过与AWS合作,利用Amazon Bedrock和SageMaker等服务,构建了一套自动化解决方案。该方案能够从教育机构提交的报告和证据中提取文本,进行总结和评估,并生成符合标准的反馈。通过自动化文档处理,BQA显著提高了评估的效率和准确性,减少了人工审查的时间和成本,同时也促进了教育机构的合规性和质量提升。

🚀 **IDP与生成式AI结合**: 将智能文档处理与生成式AI相结合,利用大型语言模型(LLMs)不仅能提取文档数据,还能理解、总结并生成洞察,实现更智能的自动化文档处理流程。

📑 **BQA的挑战与解决方案**: BQA面临报告提交不完整、证据不足以及人工审查耗时等挑战。通过AWS云服务,BQA构建了一个解决方案,利用Amazon Bedrock和Titan Express模型,实现文档自动化处理和评估。

⚙️ **自动化工作流程**: 该方案包括文档上传、文本提取、内容总结、合规性评估等多个环节。Amazon Textract用于提取文本,SageMaker JumpStart部署Llama模型进行文本总结,而Amazon Bedrock则用于生成AI摘要和评论,形成完整的自动化流程。

📝 **精细化的Prompt工程**: 通过精心设计的提示词(Prompt),确保Amazon Titan Express模型生成的评估结果符合预期的结构和格式。提示词包括证据提交、评估标准、评估指令和响应格式等,从而实现更准确的评估反馈。

Given the value of data today, organizations across various industries are working with vast amounts of data across multiple formats. Manually reviewing and processing this information can be a challenging and time-consuming task, with a margin for potential errors. This is where intelligent document processing (IDP), coupled with the power of generative AI, emerges as a game-changing solution.

Enhancing the capabilities of IDP is the integration of generative AI, which harnesses large language models (LLMs) and generative techniques to understand and generate human-like text. This integration allows organizations to not only extract data from documents, but to also interpret, summarize, and generate insights from the extracted information, enabling more intelligent and automated document processing workflows.

The Education and Training Quality Authority (BQA) plays a critical role in improving the quality of education and training services in the Kingdom Bahrain. BQA reviews the performance of all education and training institutions, including schools, universities, and vocational institutes, thereby promoting the professional advancement of the nation’s human capital.

BQA oversees a comprehensive quality assurance process, which includes setting performance standards and conducting objective reviews of education and training institutions. The process involves the collection and analysis of extensive documentation, including self-evaluation reports (SERs), supporting evidence, and various media formats from the institutions being reviewed.

The collaboration between BQA and AWS was facilitated through the Cloud Innovation Center (CIC) program, a joint initiative by AWS, Tamkeen, and leading universities in Bahrain, including Bahrain Polytechnic and University of Bahrain. The CIC program aims to foster innovation within the public sector by providing a collaborative environment where government entities can work closely with AWS consultants and university students to develop cutting-edge solutions using the latest cloud technologies.

As part of the CIC program, BQA has built a proof of concept solution, harnessing the power of AWS services and generative AI capabilities. The primary purpose of this proof of concept was to test and validate the proposed technologies, demonstrating their viability and potential for streamlining BQA’s reporting and data management processes.

In this post, we explore how BQA used the power of Amazon Bedrock, Amazon SageMaker JumpStart, and other AWS services to streamline the overall reporting workflow.

The challenge: Streamlining self-assessment reporting

BQA has traditionally provided education and training institutions with a template for the SER as part of the review process. Institutions are required to submit a review portfolio containing the completed SER and supporting material as evidence, which sometimes did not adhere fully to the established reporting standards.

The existing process had some challenges:

These challenges highlighted the need for a more streamlined and efficient approach to the submission and review process.

Solution overview

The proposed solution uses Amazon Bedrock and the Amazon Titan Express model to enable IDP functionalities. The architecture seamlessly integrates multiple AWS services with Amazon Bedrock, allowing for efficient data extraction and comparison.

Amazon Bedrock is a fully managed service that provides access to high-performing foundation models (FMs) from leading AI startups and Amazon through a unified API. It offers a wide range of FMs, allowing you to choose the model that best suits your specific use case.

The following diagram illustrates the solution architecture.

The solution consists of the following steps:

    Relevant documents are uploaded and stored in an Amazon Simple Storage Service (Amazon S3) bucket. An event notification is sent to an Amazon Simple Queue Service (Amazon SQS) queue to align each file for further processing. Amazon SQS serves as a buffer, enabling the different components to send and receive messages in a reliable manner without being directly coupled, enhancing scalability and fault tolerance of the system. The text extraction AWS Lambda function is invoked by the SQS queue, processing each queued file and using Amazon Textract to extract text from the documents. The extracted text data is placed into another SQS queue for the next processing step. The text summarization Lambda function is invoked by this new queue containing the extracted text. This function sends a request to SageMaker JumpStart, where a Meta Llama text generation model is deployed to summarize the content based on the provided prompt. In parallel, the InvokeSageMaker Lambda function is invoked to perform comparisons and assessments. It compares the extracted text against the BQA standards that the model was trained on, evaluating the text for compliance, quality, and other relevant metrics. The summarized data and assessment results are stored in an Amazon DynamoDB table Upon request, the InvokeBedrock Lambda function invokes Amazon Bedrock to generate generative AI summaries and comments. The function constructs a detailed prompt designed to guide the Amazon Titan Express model in evaluating the university’s submission.

Prompt engineering using Amazon Bedrock

To take advantage of the power of Amazon Bedrock and make sure the generated output adhered to the desired structure and formatting requirements, a carefully crafted prompt was developed according to the following guidelines:

To use this prompt template, you can create a custom Lambda function with your project. The function should handle the retrieval of the required data, such as the indicator name, the university’s submitted evidence, and the rubric criteria. Within the function, include the prompt template and dynamically populate the placeholders (${indicatorName}, ${JSON.stringify(allContent)}, and ${JSON.stringify(c.comment)}) with the retrieved data.

The Amazon Titan Text Express model will then generate the evaluation response based on the provided prompt instructions, adhering to the specified format and guidelines. You can process and analyze the model’s response within your function, extracting the compliance score, relevant analysis, and evidence.

The following is an example prompt template:

for (const c of comments) {        const prompt = `        Below is the evidence submitted by the university under the indicator "${indicatorName}":        ${JSON.stringify(allContent)}             Analyze and Evaluate the university's eviedence based on the provided rubric criteria:        ${JSON.stringify(c.comment)}        - If the evidence does not relate to the indicator, indicate that it is not applicable (N/A) without any additional commentary.               Choose one from the below compliance score based on evidence submitted:       1. Non-compliant: The comment does not meet the criteria or standards.        2.Compliant with recommendation: The comment meets the criteria but includes a suggestion or recommendation for improvement.        3. Compliant: The comment meets the criteria or standards.        THE END OF THE RESPONSE THERE SHOULD BE EITHER SCORE: [SCORE: COMPLIANT OR NON-COMPLIANT OR COMPLIANT WITH RECOMMENDATION]        Write your response in concise bullet points, focusing strictly on relevant analysis and evidence.        **LIMIT YOUR RESPONSE TO 100 WORDS ONLY.**        `;        logger.info(`Prompt for comment ${c.commentId}: ${prompt}`);        const body = JSON.stringify({          inputText: prompt,          textGenerationConfig: {            maxTokenCount: 4096,            stopSequences: [],            temperature: 0,            topP: 0.1,          },        });

The following screenshot shows an example of the Amazon Bedrock generated response.

Results

The implementation of Amazon Bedrock enabled institutions with transformative benefits. By automating and streamlining the collection and analysis of extensive documentation, including SERs, supporting evidence, and various media formats, institutions can achieve greater accuracy and consistency in their reporting processes and readiness for the review process. This not only reduces the time and cost associated with manual data processing, but also improves compliance with the quality expectations, thereby enhancing the credibility and quality of their institutions.

For BQA the implementation helped in achieving one of its strategic objectives focused on streamlining their reporting processes and achieve significant improvements across a range of critical metrics, substantially enhancing the overall efficiency and effectiveness of their operations.

Key success metrics anticipated include:

The following screenshot shows an example generating new evaluations using Amazon Bedrock

Conclusion

This post outlined the implementation of Amazon Bedrock at the Education and Training Quality Authority (BQA), demonstrating the transformative potential of generative AI in revolutionizing the quality assurance processes in the education and training sectors. For those interested in exploring the technical details further, the full code for this implementation is available in the following GitHub repo. If you are interested in conducting a similar proof of concept with us, submit your challenge idea to the Bahrain Polytechnic or University of Bahrain CIC website.


About the Author

Maram AlSaegh is a Cloud Infrastructure Architect at Amazon Web Services (AWS), where she supports AWS customers in accelerating their journey to cloud. Currently, she is focused on developing innovative solutions that leverage generative AI and machine learning (ML) for public sector entities.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

智能文档处理 生成式AI Amazon Bedrock 教育评估 自动化
相关文章