AWS announces Pixtral Large 25.02 model in Amazon Bedrock serverless

<section class="blog-post-content lb-rtxt"><table id="amazon-polly-audio-table"><tbody><tr><td id="amazon-polly-audio-tab"></td></tr></tbody></table>Today at AWS Summit Paris, we announced that the <a href="https://aws.amazon.com/bedrock/mistral/?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=blog">Pixtral Large 25.02 model</a> is now available in <a href="https://aws.amazon.com/bedrock/">Amazon Bedrock</a> as a fully managed, serverless offering. AWS is the first major cloud provider to deliver Pixtral Large as a fully managed, serverless model.Working with <a href="https://aws.amazon.com/what-is/large-language-model/?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=blog">large foundation models (FMs)</a> often requires significant infrastructure planning, specialized expertise, and ongoing optimization to handle the computational demands effectively. Many customers find themselves managing complex environments or making trade-offs between performance and cost when deploying these sophisticated models.The Pixtral Large model, developed by <a href="https://mistral.ai">Mistral AI</a>, represents their first multimodal model that combines advanced vision capabilities with powerful language understanding. A 128K context window makes it ideal for complex visual reasoning tasks. The model delivers exceptional performance on key benchmarks including <a href="https://mathvista.github.io/">MathVista</a>, <a href="https://www.docvqa.org/">DocVQA</a>, and <a href="https://visualqa.org/">VQAv2</a>, demonstrating its effectiveness across document analysis, chart interpretation, and natural image understanding.One of the most powerful aspects of Pixtral Large is its multilingual capability. The model supports dozens of languages including English, French, German, Spanish, Italian, Chinese, Japanese, Korean, Portuguese, Dutch, and Polish, making it accessible to global teams and applications. It’s also trained on more than 80 programming languages including Python, Java, C, C++, JavaScript, Bash, Swift, and Fortran, providing robust code generation and interpretation capabilities.Developers will appreciate the model’s agent-centric design with built-in function calling and JSON output formatting, which simplifies integration with existing systems. Its strong system prompt adherence improves reliability when working with <a href="https://aws.amazon.com/what-is/retrieval-augmented-generation/?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=blog">Retrieval Augmented Generation (RAG)</a> applications and large context scenarios.With Pixtral Large in Amazon Bedrock, you can now access this advanced model without having to provision or manage any infrastructure. The serverless approach lets you scale usage based on actual demand without upfront commitments or capacity planning. You pay only for what you use, with no idle resources.Cross-Region inference Pixtral Large is now available in Amazon Bedrock across multiple <a href="https://docs.aws.amazon.com/glossary/latest/reference/glos-chap.html#region?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=blog">AWS Regions</a> through cross-Region inference.<a href="https://docs.aws.amazon.com/bedrock/latest/userguide/cross-region-inference.html?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=blog">With Amazon Bedrock cross-Region inference</a>, you can access a single FM across multiple geographic Regions while maintaining high availability and low latency for global applications. For example, when a model is deployed in both European and US Regions, you can access it through Region-specific API endpoints using distinct prefixes: <code>eu.model-id</code> for European Regions and <code>us.model-id</code> for US Regions . This approach enables Amazon Bedrock to route inference requests to the geographically closest endpoint, reducing latency while helping to meet regulatory compliance by keeping data processing within desired geographic boundaries. The system automatically handles traffic routing and load balancing across these Regional deployments, providing seamless scalability and redundancy without requiring you to keep track of individual Regions where the model is actually deployed.See it in action As a developer advocate, I’m constantly exploring how our newest capabilities can solve real problems. Recently, I had a perfect opportunity to test the new multimodal capabilities in the <a href="https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_Converse.html?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=blog">Amazon Bedrock Converse API</a> when my daughter asked for help with her physics exam preparation.Last weekend, my kitchen table was covered with practice exams full of complex diagrams, force vectors, and equations. My daughter was struggling with conceptualizing how to approach these problems. That’s when I realized this was the perfect use case for the multimodal capabilities we’d just launched. I snapped photos of a particularly challenging problem sheet containing several graphs and mathematical notation, then used the Converse API to create a simple application that could analyze the images. Together, we uploaded the physics exam materials and asked the model to explain the solution approach.<a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/03/28/problem2-1.png"><img class="aligncenter wp-image-94783" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/03/28/problem2-1.png" alt="Physics problem in french" width="800" height="567" /></a>What happened next impressed both of us—the model interpreted the diagrams, recognized the french language and the mathematical notation, and provided a step-by-step explanation of how to solve each problem. As we asked follow-up questions about specific concepts, the model maintained context across our entire conversation, creating a tutoring experience that felt remarkably natural.<pre class="lang-markdown"># Effet Doppler avec une Source Sonore en Rotation## Analyse du problèmeCe problème concerne l'effet Doppler produit par une source sonore en rotation. Une source émettant un son à 1500 Hz tourne sur une table tournante dans le sens antihoraire, et nous devons déterminer comment les fréquences sont perçues par un microphone fixe.## Concepts clésL'effet Doppler se produit lorsqu'il y a un mouvement relatif entre une source sonore et un observateur. La fréquence perçue dépend de si la source se rapproche ou s'éloigne de l'observateur.La formule de l'effet Doppler est:- Pour une source qui se rapproche: $f{observée} = f{source} \cdot \frac{v{son}}{v{son} - v{source}}$- Pour une source qui s'éloigne: $f{observée} = f{source} \cdot \frac{v{son}}{v{son} + v{source}}$## Solution étape par étape### Étape 1: Analyser le mouvement de la sourceLa source tourne sur un cercle et le microphone est fixe à 40 cm du centre de rotation. Aux différentes positions (a, b, c), la source a des vitesses radiales différentes par rapport au microphone:- Position a: La source se déplace perpendiculairement à la ligne source-microphone (vitesse radiale nulle)- Position b: La source s'éloigne du microphone (vitesse radiale positive)- Position c: La source se rapproche du microphone (vitesse radiale négative)### Étape 2: Déterminer l'effet sur les fréquences perçues- À la position a: Pas d'effet Doppler car la vitesse radiale est nulle, donc $fa = f{source} = 1500$ Hz- À la position b: La source s'éloigne, donc $fb < f{source}$, donc $f_b < 1500$ Hz- À la position c: La source se rapproche, donc $fc > f{source}$, donc $f_c > 1500$ Hz### Étape 3: Comparer les fréquencesD'après notre analyse:- $f_a = 1500$ Hz- $f_b < 1500$ Hz- $f_c > 1500$ HzDonc: $f_c > f_a > f_b$## ConclusionLa proposition correcte est: $f_c > f_a > f_b$Cette réponse correspond à la troisième proposition dans l'énoncé. Lorsque la source tourne, la fréquence perçue est plus élevée quand la source se rapproche (position c), égale à la fréquence émise quand la source se déplace perpendiculairement (position a), et plus basse quand la source s'éloigne (position b).</pre>The model uses the language of the question to respond. After a thoughtful analysis, it says that the correct answer is f_c > f_a > f_b (and it is right!)The beauty of this interaction was how seamlessly the Converse API handled the multimodal inputs. As a builder, I didn’t need to worry about the complexity of processing images alongside text—the API managed that complexity and returned structured responses that my simple application could present directly to my daughter.Here is the code I wrote. I used the Swift programming language, just to show that Python is not the only option you have 😇.<pre class="lang-swift">private let modelId = "us.mistral.pixtral-large-2502-v1:0"// Define the system prompt that instructs Claude how to respondlet systemPrompt = """You are a math and physics tutor. Your task is to:1. Read and understand the math or physics problem in the image2. Provide a clear, step-by-step solution to the problem3. Briefly explain any relevant concepts used in solving the problem4. Be precise and accurate in your calculations5. Use mathematical notation when appropriateFormat your response with clear section headings and numbered steps."""let system: BedrockRuntimeClientTypes.SystemContentBlock = .text(systemPrompt)// Create the user message with text prompt and imagelet userPrompt = "Please solve this math or physics problem. Show all steps and explain the concepts involved."let prompt: BedrockRuntimeClientTypes.ContentBlock = .text(userPrompt)let image: BedrockRuntimeClientTypes.ContentBlock = .image(.init(format: .jpeg, source: .bytes(finalImageData)))// Create the user message with both text and image contentlet userMessage = BedrockRuntimeClientTypes.Message( content: [prompt, image], role: .user)// Initialize the messages array with the user messagevar messages: [BedrockRuntimeClientTypes.Message] = []messages.append(userMessage)// Configure the inference parameterslet inferenceConfig: BedrockRuntimeClientTypes.InferenceConfiguration = .init(maxTokens: 4096, temperature: 0.0)// Create the input for the Converse API with streaminglet input = ConverseStreamInput(inferenceConfig: inferenceConfig, messages: messages, modelId: modelId, system: [system])// Make the streaming requestdo { // Process the stream let response = try await bedrockClient.converseStream(input: input) // Iterate through the stream events for try await event in stream { switch event { case .messagestart: print("AI-assistant started to stream") case let .contentblockdelta(deltaEvent): // Handle text content as it arrives if case let .text(text) = deltaEvent.delta { DispatchQueue.main.async { self.streamedResponse += text } } case .messagestop: print("Stream ended") // Create a complete assistant message from the streamed response let assistantMessage = BedrockRuntimeClientTypes.Message( content: [.text(self.streamedResponse)], role: .assistant ) messages.append(assistantMessage) default: break } }</pre>And the result in the app is stunning.<a href="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/03/28/2025-03-28_17-34-39.png"><img class="aligncenter size-full wp-image-94789" src="https://d2908q01vomqb2.cloudfront.net/da4b9237bacccdf19c0760cab7aec4a8359010b0/2025/03/28/2025-03-28_17-34-39.png" alt="iOS Physics problem resolver" width="447" height="892" /></a>By the time her exam rolled around, she felt confident and prepared—and I had a compelling real-world example of how our multimodal capabilities in Amazon Bedrock can create meaningful experiences for users.Get started today The new model is available through these Regional API endpoints: US East (Ohio, N. Virginia), US West (Oregon), and Europe (Frankfurt, Ireland, Paris, Stockholm). This Regional availability helps you meet data residency requirements while minimizing latency.You can start using the model through either the <a href="https://console.aws.amazon.com">AWS Management Console</a> or programmatically through the <a href="https://aws.amazon.com/cli/">AWS Command Line Interface (AWS CLI)</a> and <a href="https://aws.amazon.com/developer/tools/?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=blog">AWS SDK</a> using the model ID <code>mistral.pixtral-large-2502-v1:0</code>.This launch represents a significant step forward in making advanced multimodal AI accessible to developers and organizations of all sizes. By combining Mistral AI’s cutting-edge model with AWS serverless infrastructure, you can now focus on building innovative applications without worrying about the underlying complexity.Visit the <a href="https://us-east-1.console.aws.amazon.com/bedrock/home?region=us-east-1#/overview?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=blog">Amazon Bedrock console</a> today to start experimenting with Pixtral Large 25.02 and discover how it can enhance your AI-powered applications.<a href="https://linktr.ee/sebsto">— seb</a><hr />How is the News Blog doing? Take this <a href="https://amazonmr.au1.qualtrics.com/jfe/form/SV_eyD5tC5xNGCdCmi">1 minute survey</a>!(This <a href="https://amazonmr.au1.qualtrics.com/jfe/form/SV_eyD5tC5xNGCdCmi">survey</a> is hosted by an external company. AWS handles your information as described in the <a href="https://aws.amazon.com/privacy/?trk=4b29643c-e00f-4ab6-ab9c-b1fb47aa1708&amp;sc_channel=blog">AWS Privacy Notice</a>. AWS will own the data gathered via this survey and will not share the information collected with survey respondents.)</section><aside id="Comments" class="blog-comments"><div data-lb-comp="aws-blog:cosmic-comments" data-env="prod" data-content-id="e605a31b-37a6-428e-84d4-332e42dc5985" data-title="AWS announces Pixtral Large 25.02 model in Amazon Bedrock serverless" data-url="https://aws.amazon.com/blogs/aws/aws-announces-pixtral-large-25-02-model-in-amazon-bedrock-serverless/">Loading comments…</div></aside>

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签