AWS Machine Learning Blog 07月17日 06:36
Accenture scales video analysis with Amazon Nova and Amazon Bedrock Agents
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文介绍了Accenture Spotlight,一个利用Amazon Nova和Amazon Bedrock Agents构建的可扩展、经济高效的视频亮点生成解决方案。Spotlight能够自动分析长视频,生成个性化的短视频片段,广泛应用于体育集锦、内容匹配和实时零售推广等领域。它通过多层代理工作流实现视频处理自动化,并提供质量控制。Spotlight的架构灵活可配置,能有效平衡速度与质量,为内容创作者提供了新的可能性。

🎬Spotlight利用Amazon Nova基础模型和Amazon Bedrock Agents,提供可扩展且经济高效的视频亮点生成方案,能够快速分析长视频内容,生成个性化短视频。

⚽️Spotlight已在多个行业场景中应用,包括个性化短视频生成、体育编辑与集锦制作、内容匹配以及实时零售推广。例如,在体育领域,Spotlight可以根据用户偏好自动生成足球、F1等赛事的精彩片段。

⚙️Spotlight的架构核心在于任务驱动的模型选择和代理编排。它根据任务复杂度和时延要求,动态选择传统AI模型或Amazon Nova模型,从而实现速度与智能的平衡。同时,专门的代理负责特定分析任务,并通过中央协调代理进行管理。

✅Spotlight具有跨行业应用、实时处理和成本效益高等优势。与传统方法相比,Spotlight在视频处理延迟、集锦审核成本和整体生成成本上都有显著优势,可实现更高效的视频内容创作。

This post was written with Ilan Geller, Kamal Mannar, Debasmita Ghosh, and Nakul Aggarwal of Accenture.

Video highlights offer a powerful way to boost audience engagement and extend content value for content publishers. These short, high-impact clips capture key moments that drive viewer retention, amplify reach across social media, reinforce brand identity, and open new avenues for monetization. However, traditional highlight creation workflows are slow and labor-intensive. Editors must manually review footage, identify significant moments, cut clips, and add transitions or narration—followed by manual quality checks and formatting for distribution. Although this provides editorial control, it creates bottlenecks that don’t scale efficiently.

This post showcases how Accenture Spotlight delivers a scalable, cost-effective video highlight generation solution using Amazon Nova and Amazon Bedrock Agents. Amazon Nova foundation models (FMs) deliver frontier intelligence and industry-leading price-performance. With Spotlight, content owners can configure AI models and agents to support diverse use cases across the media industry while offering a human-in-the-loop option for quality assurance and collaborative refinement. This maintains accuracy, editorial oversight, and alignment with brand guidelines—without compromising on speed or scalability.

Real-world use cases

Spotlight has been applied across a range of industry scenarios, including:

Spotlight’s architecture

Spotlight’s architecture addresses the challenge of scalable video processing, efficiently analyzing and generating content while maintaining speed and quality. It incorporates both task-specific models and Amazon Nova FMs that are orchestrated by specialized Amazon Bedrock agents. Key architectural highlights include:

Spotlight uses a multi-layered agent workflow to automate video processing and generation while maintaining quality control. For example, to generate dynamic video highlights, Spotlight uses three specialized “super agents” that work in coordination under a central orchestrator agent’s supervision. Each super agent is powered by Amazon Nova models, and is supported by a collection of utility agents (see the following diagram). These agents work together to understand video content, generate high-quality highlights, and maintain alignment with user requirements and brand standards.

The workflow consists of the following super agents and utility agents:

See Spotlight in action:

Solution overview

To interact with Spotlight, users access a frontend UI where they provide natural language input to specify their objective. Spotlight then employs its agentic workflow powered by Amazon Nova to achieve its given task. The following diagram illustrates the solution architecture for video highlight generation.

The workflow consists of the following key components (as numbered in the preceding diagram):

    Frontend UI for user interaction:
      Users interact through a web portal secured by Amazon Cognito authentication and delivered using Amazon CloudFront. Amazon API Gateway serves a restful endpoint for video processing and highlight generation services.
    Live video stream processing:
      AWS Elemental MediaLive processes incoming video stream and triggers AWS Lambda to initiate workflows. (Spotlight also accepts video archive content as media files for processing and highlight generation.)
    Video processing workflow orchestrated with AWS Step Functions:
      Open source models hosted on Amazon SageMaker enable speech analysis and computer vision for person and object detection. The video processing agent powered by Amazon Nova Pro analyzes video and generates fine-grained metadata (for example, identifying patterns from viral videos). The reviewer agent powered by Amazon Nova Premier maintains alignment with brand standards. Open source utility tooling is used for pre-analysis tasks.
    Highlight generation workflow orchestrated with Step Functions:
      Amazon Nova Pro analyzes the user query for clips of interest to understand intent, and reformulates the query for downstream processing. The short video generation agent powered by Amazon Nova Pro constructs a video highlight using segment recommendations. The reviewer agent powered by Amazon Nova Premier makes sure the constructed highlight aligns with quality, brand, and contextual expectations. AWS Elemental Media Convert and open source tooling enable video highlight construction and postprocessing (such as subtitle layover, aspect ratio change, and transitions).
    Storage and monitoring:
      Amazon Simple Storage Service (Amazon S3) stores metadata extracted from processing workflows, reference content (such as scripts and brand guidelines), and generated outputs. Amazon CloudWatch maintains end-to-end system health and monitors performance.

Key benefits

Spotlight’s approach to video processing and generation creates dynamic value. Additionally, its technical design using Amazon Nova and an integrated agentic workflow helps content owners realize gains in their video processing and editorial operations. Key benefits for Spotlight include:

The following table provides is a comparative analysis of Spotlight’s video processing approach to conventional approaches for video highlight creation.

Metric Spotlight Performance Conventional Approach
Video Processing Latency Minutes for 2–3-hour sessions Hours to days
Highlight Review Cost (3–5 minutes) 10 times lower with Amazon Nova High cost using conventional approaches
Overall Highlight Generation Cost 10 times lower using serverless and on-demand LLM deployment Manual workflows with high operational overhead
Deployment Architecture Fully serverless with scalable LLM invocation Typically resource-heavy and statically provisioned
Use Case Flexibility Sports, media editing, retail personalization, and more Often tailored to a single use case

Conclusion

Spotlight represents a cutting-edge agentic solution designed to tackle complex media processing and customer personalization challenges using generative AI. With modular, multi-agent workflows built on Amazon Nova, Spotlight seamlessly enables dynamic short-form video generation. The solution’s core framework is also extensible to diverse industry use cases that require multimodal content analysis at scale.

As an AWS Premier Tier Services Partner and Managed Services Provider (MSP), Accenture brings deep cloud and industry expertise. Accenture and AWS have worked together for more than a decade to help organizations realize value from their applications and data. Accenture brings its industry understanding and generative AI specialists to build and adapt generative AI solutions to client needs. Together with AWS, through the Accenture AWS Business Group (AABG), we help enterprises unlock business value by rapidly scaling generative AI solutions tailored to their needs—driving innovation and transformation in the cloud.

Try out Spotlight for your own use case, and share your feedback in the comments.


About the authors

Ilan Geller is a Managing Director in the Data and AI practice at Accenture. He is the Global AWS Partner Lead for Data and AI and the Center for Advanced AI. His roles at Accenture have primarily been focused on the design, development, and delivery of complex data, AI/ML, and most recently Generative AI solutions.

Dr. Kamal Mannar is a Global Computer Vision Lead at Accenture’s Center for Advanced AI, with over 20 years of experience applying AI across industries like agriculture, healthcare, energy, and telecom. He has led large-scale AI transformations, built scalable GenAI and computer vision solutions, and holds 10+ patents in areas including deep learning, wearable AI, and vision transformers. Previously, he headed AI at Vulcan AI, driving cutting-edge innovation in precision agriculture. Kamal holds a Ph.D. in Industrial & Systems Engineering from the University of Wisconsin–Madison.

Debasmita Ghosh is working as Associate Director in Accenture with 21 years of experience in Information Technology (8 years in AI/Gen AI capability), who currently among multiple responsibilities leads Computer Vision practice in India. She has presented her paper on Handwritten Text Recognition in multiple conferences including MCPR 2020, GHCI 2020. She has patent granted on Handwritten Text Recognition solution and received recognition from Accenture under the Accenture Inventor Award Program being named as an inventor on a granted patent. She has multiple papers on Computer Visions solutions like Table Extraction including non-uniform and borderless tables accepted and presented in the ComPE 2021 and CCVPR 2021 international conferences. She has managed projects across multiple technologies (Oracle Apps, SAP). As a programmer, she has worked during various phases of SDLC with experience on Oracle Apps Development across CRM, Procurement, Receivables, SCM, SAP Professional Services, SAP CRM. Debasmita holds M.Sc. in Statistics from Calcutta University.

Nakul Aggarwal is a Subject Matter Expert in Computer Vision and Generative AI at Accenture, with around 7 years of experience in developing and delivering cutting-edge solutions across computer vision, multimodal AI, and agentic systems. He holds a Master’s degree from the Indian Institute of Technology (IIT) Delhi and has authored several research papers presented at international conferences. He holds two patents in AI and currently leads multiple projects focused on multimodal and agentic AI. Beyond technical delivery, he plays a key role in mentoring teams and driving innovation by bridging advanced research with real-world enterprise applications.

Aramide Kehinde is Global Partner Solutions Architect for Amazon Nova at AWS. She works with high growth companies to build and deliver forward thinking technology solutions using AWS Generative AI. Her experience spans multiple industries, including Media & Entertainment, Financial Services, and Healthcare. Aramide enjoys building the intersection of AI and creative arenas and spending time with her family.

Rajdeep Banerjee is a Senior Partner Solutions Architect at AWS helping strategic partners and clients in the AWS cloud migration and digital transformation journey. Rajdeep focuses on working with partners to provide technical guidance on AWS, collaborate with them to understand their technical requirements, and designing solutions to meet their specific needs. He is a member of Serverless technical field community. Rajdeep is based out of Richmond, Virginia.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Spotlight 视频生成 人工智能 Amazon Nova 短视频
相关文章