Enable faster training with Amazon SageMaker data parallel library

By Admin 05/12/2023

AWS Machine Learning Blog Large language model (LLM) training has become increasingly popular over the last year with the release of several publicly available models such as Llama2, Falcon, and StarCoder. Customers are now training LLMs of unprecedented size ranging from 1 billion to over 175 billion parameters. Training these LLMs requires significant compute resources […]Continue reading

Boost inference performance for LLMs with new Amazon SageMaker containers

By Admin 28/11/2023

AWS Machine Learning Blog Today, Amazon SageMaker launches a new version (0.25.0) of Large Model Inference (LMI) Deep Learning Containers (DLCs) and adds support for NVIDIA’s TensorRT-LLM Library. With these upgrades, you can effortlessly access state-of-the-art tooling to optimize large language models (LLMs) on SageMaker and achieve price-performance benefits – Amazon SageMaker LMI TensorRT-LLM DLC […]Continue reading

Use Amazon SageMaker Studio to build a RAG question answering solution with Llama 2, LangChain, and Pinecone for fast experimentation

By Admin 21/11/2023

AWS Machine Learning Blog Retrieval Augmented Generation (RAG) allows you to provide a large language model (LLM) with access to data from external knowledge sources such as repositories, databases, and APIs without the need to fine-tune it. When using generative AI for question answering, RAG enables LLMs to answer questions with the most relevant, up-to-date […]Continue reading

Improve LLM responses in RAG use cases by interacting with the user

By Admin 13/11/2023

AWS Machine Learning Blog One of the most common applications of generative AI and large language models (LLMs) is answering questions based on a specific external knowledge corpus. Retrieval-Augmented Generation (RAG) is a popular technique for building question answering systems that use an external knowledge base. To learn more, refer to Build a powerful question […]Continue reading

Build trust and safety for generative AI applications with Amazon Comprehend and LangChain

By Admin 11/11/2023

AWS Machine Learning Blog We are witnessing a rapid increase in the adoption of large language models (LLM) that power generative AI applications across industries. LLMs are capable of a variety of tasks, such as generating creative content, answering inquiries via chatbots, generating code, and more. Organizations looking to use LLMs to power their applications […]Continue reading

Harnessing the power of enterprise data with generative AI: Insights from Amazon Kendra, LangChain, and large language models

By Admin 07/11/2023

AWS Machine Learning Blog Large language models (LLMs) with their broad knowledge, can generate human-like text on almost any topic. However, their training on massive datasets also limits their usefulness for specialized tasks. Without continued learning, these models remain oblivious to new data and trends that emerge after their initial training. Furthermore, the cost to […]Continue reading

Improve performance of Falcon models with Amazon SageMaker

By Admin 11/10/2023

AWS Machine Learning Blog What is the optimal framework and configuration for hosting large language models (LLMs) for text-generating generative AI applications? Despite the abundance of options for serving LLMs, this is a hard question to answer due to the size of the models, varying model architectures, performance requirements of applications, and more. The Amazon […]Continue reading

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

By Admin 05/10/2023

AWS Machine Learning Blog Large language models (LLMs) have captured the imagination and attention of developers, scientists, technologists, entrepreneurs, and executives across several industries. These models can be used for question answering, summarization, translation, and more in applications such as conversational agents for customer support, content creation for marketing, and coding assistants. Recently, Meta released […]Continue reading

FMOps/LLMOps: Operationalize generative AI and differences with MLOps

By Admin 01/09/2023

AWS Machine Learning Blog Nowadays, the majority of our customers is excited about large language models (LLMs) and thinking how generative AI could transform their business. However, bringing such solutions and models to the business-as-usual operations is not an easy task. In this post, we discuss how to operationalize generative AI applications using MLOps principles […]Continue reading

AI helps robots manipulate objects with their whole bodies

By Admin 24/08/2023

MIT News – Artificial intelligence Imagine you want to carry a large, heavy box up a flight of stairs. You might spread your fingers out and lift that box with both hands, then hold it on top of your forearms and balance it against your chest, using your whole body to manipulate the box. Humans […]Continue reading