Distributed training and efficient scaling with the Amazon SageMaker Model Parallel and Data Parallel Libraries

By Admin 16/04/2024

AWS Machine Learning Blog There has been tremendous progress in the field of distributed deep learning for large language models (LLMs), especially after the release of ChatGPT in December 2022. LLMs continue to grow in size with billions or even trillions of parameters, and they often won’t fit into a single accelerator device such as […]Continue reading

A secure approach to generative AI with AWS

By Admin 16/04/2024

AWS Machine Learning Blog Generative artificial intelligence (AI) is transforming the customer experience in industries across the globe. Customers are building generative AI applications using large language models (LLMs) and other foundation models (FMs), which enhance customer experiences, transform operations, improve employee productivity, and create new revenue channels. FMs and the applications built around them […]Continue reading

Cost-effective document classification using the Amazon Titan Multimodal Embeddings Model

By Admin 11/04/2024

AWS Machine Learning Blog Organizations across industries want to categorize and extract insights from high volumes of documents of different formats. Manually processing these documents to classify and extract information remains expensive, error prone, and difficult to scale. Advances in generative artificial intelligence (AI) have given rise to intelligent document processing (IDP) solutions that can […]Continue reading

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

By Admin 11/04/2024

AWS Machine Learning Blog AWS was delighted to present to and connect with over 18,000 in-person and 267,000 virtual attendees at NVIDIA GTC, a global artificial intelligence (AI) conference that took place March 2024 in San Jose, California, returning to a hybrid, in-person experience for the first time since 2019. AWS has had a long-standing […]Continue reading

Build an active learning pipeline for automatic annotation of images with AWS services

By Admin 10/04/2024

AWS Machine Learning Blog This blog post is co-written with Caroline Chung from Veoneer. Veoneer is a global automotive electronics company and a world leader in automotive electronic safety systems. They offer best-in-class restraint control systems and have delivered over 1 billion electronic control units and crash sensors to car manufacturers globally. The company continues […]Continue reading

Knowledge Bases for Amazon Bedrock now supports custom prompts for the RetrieveAndGenerate API and configuration of the maximum number of retrieved results

By Admin 09/04/2024

AWS Machine Learning Blog With Knowledge Bases for Amazon Bedrock, you can securely connect foundation models (FMs) in Amazon Bedrock to your company data for Retrieval Augmented Generation (RAG). Access to additional data helps the model generate more relevant, context-specific, and accurate responses without retraining the FMs. In this post, we discuss two new features […]Continue reading

Knowledge Bases for Amazon Bedrock now supports metadata filtering to improve retrieval accuracy

By Admin 08/04/2024

AWS Machine Learning Blog At AWS re:Invent 2023, we announced the general availability of Knowledge Bases for Amazon Bedrock. With Knowledge Bases for Amazon Bedrock, you can securely connect foundation models (FMs) in Amazon Bedrock to your company data using a fully managed Retrieval Augmented Generation (RAG) model. For RAG-based applications, the accuracy of the […]Continue reading

Build knowledge-powered conversational applications using LlamaIndex and Llama 2-Chat

By Admin 08/04/2024

AWS Machine Learning Blog Unlocking accurate and insightful answers from vast amounts of text is an exciting capability enabled by large language models (LLMs). When building LLM applications, it is often necessary to connect and query external data sources to provide relevant context to the model. One popular approach is using Retrieval Augmented Generation (RAG) […]Continue reading

Boost inference performance for Mixtral and Llama 2 models with new Amazon SageMaker containers

By Admin 08/04/2024

AWS Machine Learning Blog In January 2024, Amazon SageMaker launched a new version (0.26.0) of Large Model Inference (LMI) Deep Learning Containers (DLCs). This version offers support for new models (including Mixture of Experts), performance and usability improvements across inference backends, as well as new generation details for increased control and prediction explainability (such as […]Continue reading

Use everyday language to search and retrieve data with Mixtral 8x7B on Amazon SageMaker JumpStart

By Admin 08/04/2024

AWS Machine Learning Blog With the widespread adoption of generative artificial intelligence (AI) solutions, organizations are trying to use these technologies to make their teams more productive. One exciting use case is enabling natural language interactions with relational databases. Rather than writing complex SQL queries, you can describe in plain language what data you want […]Continue reading