June 2025

Build and deploy AI inference workflows with new enhancements to the Amazon SageMaker Python SDK

Amazon SageMaker Inference has been a popular tool for deploying advanced machine learning (ML) and generative AI models at scale. As AI applications become increasingly complex, customers want to deploy multiple models in a coordinated group that collectively process inference requests for an application. In addition, with the evolution of generative AI applications, many use […]

Build and deploy AI inference workflows with new enhancements to the Amazon SageMaker Python SDK Read More »

Context extraction from image files in Amazon Q Business using LLMs

To effectively convey complex information, organizations increasingly rely on visual documentation through diagrams, charts, and technical illustrations. Although text documents are well-integrated into modern knowledge management systems, rich information contained in diagrams, charts, technical schematics, and visual documentation often remains inaccessible to search and AI assistants. This creates significant gaps in organizational knowledge bases, leading

Context extraction from image files in Amazon Q Business using LLMs Read More »

Build AWS architecture diagrams using Amazon Q CLI and MCP

Creating professional AWS architecture diagrams is a fundamental task for solutions architects, developers, and technical teams. These diagrams serve as essential communication tools for stakeholders, documentation of compliance requirements, and blueprints for implementation teams. However, traditional diagramming approaches present several challenges: Time-consuming process – Creating detailed architecture diagrams manually can take hours or even days

Build AWS architecture diagrams using Amazon Q CLI and MCP Read More »

AWS costs estimation using Amazon Q CLI and AWS Cost Analysis MCP

Managing and optimizing AWS infrastructure costs is a critical challenge for organizations of all sizes. Traditional cost analysis approaches often involve the following: Complex spreadsheets – Creating and maintaining detailed cost models, which requires significant effort Multiple tools – Switching between the AWS Pricing Calculator, AWS Cost Explorer, and third-party tools Specialized knowledge – Understanding

AWS costs estimation using Amazon Q CLI and AWS Cost Analysis MCP Read More »

Tailor responsible AI with new safeguard tiers in Amazon Bedrock Guardrails

Amazon Bedrock Guardrails provides configurable safeguards to help build trusted generative AI applications at scale. It provides organizations with integrated safety and privacy safeguards that work across multiple foundation models (FMs), including models available in Amazon Bedrock, as well as models hosted outside Amazon Bedrock from other model providers and cloud providers. With the standalone

Tailor responsible AI with new safeguard tiers in Amazon Bedrock Guardrails Read More »

Structured data response with Amazon Bedrock: Prompt Engineering and Tool Use

Generative AI is revolutionizing industries by streamlining operations and enabling innovation. While textual chat interactions with GenAI remain popular, real-world applications often depend on structured data for APIs, databases, data-driven workloads, and rich user interfaces. Structured data can also enhance conversational AI, enabling more reliable and actionable outputs. A key challenge is that LLMs (Large

Structured data response with Amazon Bedrock: Prompt Engineering and Tool Use Read More »

Using Amazon SageMaker AI Random Cut Forest for NASA’s Blue Origin spacecraft sensor data

The successful deorbit, descent, and landing of spacecraft on the Moon requires precise control and monitoring of vehicle dynamics. Anomaly detection provides a unique utility for identifying important states that might represent vehicle behaviors of interest. By producing unique vehicle behavior points, critical spacecraft system states can be identified to be more appropriately addressed and

Using Amazon SageMaker AI Random Cut Forest for NASA’s Blue Origin spacecraft sensor data Read More »

Build an intelligent multi-agent business expert using Amazon Bedrock

In this post, we demonstrate how to build a multi-agent system using multi-agent collaboration in Amazon Bedrock Agents to solve complex business questions in the biopharmaceutical industry. We show how specialized agents in research and development (R&D), legal, and finance domains can work together to provide comprehensive business insights by analyzing data from multiple sources.

Build an intelligent multi-agent business expert using Amazon Bedrock Read More »

Driving cost-efficiency and speed in claims data processing with Amazon Nova Micro and Amazon Nova Lite

Amazon operations span the globe, touching the lives of millions of customers, employees, and vendors every day. From the vast logistics network to the cutting-edge technology infrastructure, this scale is a testament to the company’s ability to innovate and serve its customers. With this scale comes a responsibility to manage risks and address claims—whether they

Driving cost-efficiency and speed in claims data processing with Amazon Nova Micro and Amazon Nova Lite Read More »

Power Your LLM Training and Evaluation with the New SageMaker AI Generative AI Tools

Today we are excited to introduce the Text Ranking and Question and Answer UI templates to SageMaker AI customers. The Text Ranking template enables human annotators to rank multiple responses from a large language model (LLM) based on custom criteria, such as relevance, clarity, or factual accuracy. This ranked feedback provides critical insights that help

Power Your LLM Training and Evaluation with the New SageMaker AI Generative AI Tools Read More »