Amazon today launched SageMaker Data Wrangler, a new AWS service designed to speed up data prep for machine learning and AI applications. Alongside it, the company took the wraps off of SageMaker Feature Store, a purpose-built product for naming, organizing, finding, and sharing features, or the individual independent variables that act like a input in a machine learning system. Beyond this, Amazon unveiled SageMaker Pipelines, which CEO Andy Jassy described as a CI/CD service for AI. And the company detailed DevOps Guru, QuickSight, and QuickSight Q, offerings that uses machine learning to identify operational issues, provide business intelligence, and find answers to questions in knowledge stores.
During a keynote at Amazon’s re:Invent conference, Jassy said that Data Wrangler has over 300 built-in conversion transformation types. The service recommends transformations based on data in a target dataset and applies these transformations to features, providing a preview of the transformations in real time. Dat Wrangler also checks to ensure that the data is “valid and balanced.” As for SageMaker Feature Store, Jassy said that the service, which is accessible from SageMaker Studio, acts as a storage component for features and can access features in either batches or subsets. SageMaker Pipelines, meanwhile, allows users to define, share, and reuse each step of an end-to-end machine learning workflow with preconfigured customizable workflow templates while logging each step in SageMaker Experiments.
DevOps Guru is a different beast altogether. Amazon says that when deployed in a cloud environment, it can identify missing or misconfigured alarms to warn of approaching resource limits and code and config changes that might cause outages. In addition, DevOps Guru spotlights things like under-provisioned compute capacity, database I/O overutilization, and memory leaks while recommending remediating actions.
Amazon QuickSight aims to provide scalable, embeddable business intelligence solutions tailored for the cloud. To that end, Amazon says it can scale to tens of thousands of users without any infrastructure management or capacity planning. QuickSight can be embedded into applications with dashboards and is available with pay-per-session pricing, automatically generating summaries of dashboards in plain language. A complimentary AWS service called QuickSight Q answers questions in natural language, drawing on available resources and using natural language processing to understand domain-specific business language and generate responses that reflect industry jargon.
Amazon didn’t miss the opportunity this morning to roll out updates across Amazon Connect, its omnichannel cloud contact center offering. New as of today is Real-Time Contact Lens, which identifies issues in real time to impact customer actions during calls. Amazon Connect Voice ID, which also works in real time, performs authentication using machine learning-powered voice analysis “without disrupting natural conversation.” And Connect Tasks ostensibly makes follow-up tasks easier for agents by enabling managers to automate some tasks entirely.
The slew of announcements come after Amazon debuted AWS Trainium, a chip custom-designed to deliver what the company describes as cost-effective machine learning model training in the cloud. Amazon claims that when Trainium launches in the second half of 2020, it’ll offer the most teraflops of any machine learning instance in the cloud, where a teraflop translates to a chip being able to process one trillion calculations a second.