Frugality meets Accuracy: Cost-efficient training of GPT NeoX and Pythia models with AWS Trainium

By Admin 12/12/2023

AWS Machine Learning Blog Large language models (or LLMs) have become a topic of daily conversations. Their quick adoption is evident by the amount of time required to reach a 100 million users, which has gone from “4.5yrs by facebook” to an all-time low of mere “2 months by ChatGPT.” A generative pre-trained transformer (GPT) […]Continue reading

Enable faster training with Amazon SageMaker data parallel library

By Admin 05/12/2023

AWS Machine Learning Blog Large language model (LLM) training has become increasingly popular over the last year with the release of several publicly available models such as Llama2, Falcon, and StarCoder. Customers are now training LLMs of unprecedented size ranging from 1 billion to over 175 billion parameters. Training these LLMs requires significant compute resources […]Continue reading

Use Amazon SageMaker Studio to build a RAG question answering solution with Llama 2, LangChain, and Pinecone for fast experimentation

By Admin 21/11/2023

AWS Machine Learning Blog Retrieval Augmented Generation (RAG) allows you to provide a large language model (LLM) with access to data from external knowledge sources such as repositories, databases, and APIs without the need to fine-tune it. When using generative AI for question answering, RAG enables LLMs to answer questions with the most relevant, up-to-date […]Continue reading

Improve LLM responses in RAG use cases by interacting with the user

By Admin 13/11/2023

AWS Machine Learning Blog One of the most common applications of generative AI and large language models (LLMs) is answering questions based on a specific external knowledge corpus. Retrieval-Augmented Generation (RAG) is a popular technique for building question answering systems that use an external knowledge base. To learn more, refer to Build a powerful question […]Continue reading

Build trust and safety for generative AI applications with Amazon Comprehend and LangChain

By Admin 11/11/2023

AWS Machine Learning Blog We are witnessing a rapid increase in the adoption of large language models (LLM) that power generative AI applications across industries. LLMs are capable of a variety of tasks, such as generating creative content, answering inquiries via chatbots, generating code, and more. Organizations looking to use LLMs to power their applications […]Continue reading

Harnessing the power of enterprise data with generative AI: Insights from Amazon Kendra, LangChain, and large language models

By Admin 07/11/2023

AWS Machine Learning Blog Large language models (LLMs) with their broad knowledge, can generate human-like text on almost any topic. However, their training on massive datasets also limits their usefulness for specialized tasks. Without continued learning, these models remain oblivious to new data and trends that emerge after their initial training. Furthermore, the cost to […]Continue reading

Dialogue-guided visual language processing with Amazon SageMaker JumpStart

By Admin 01/11/2023

AWS Machine Learning Blog Visual language processing (VLP) is at the forefront of generative AI, driving advancements in multimodal learning that encompasses language intelligence, vision understanding, and processing. Combined with large language models (LLM) and Contrastive Language-Image Pre-Training (CLIP) trained with a large quantity of multimodality data, visual language models (VLMs) are particularly adept at […]Continue reading

Improve performance of Falcon models with Amazon SageMaker

By Admin 11/10/2023

AWS Machine Learning Blog What is the optimal framework and configuration for hosting large language models (LLMs) for text-generating generative AI applications? Despite the abundance of options for serving LLMs, this is a hard question to answer due to the size of the models, varying model architectures, performance requirements of applications, and more. The Amazon […]Continue reading

Fast and cost-effective LLaMA 2 fine-tuning with AWS Trainium

By Admin 05/10/2023

AWS Machine Learning Blog Large language models (LLMs) have captured the imagination and attention of developers, scientists, technologists, entrepreneurs, and executives across several industries. These models can be used for question answering, summarization, translation, and more in applications such as conversational agents for customer support, content creation for marketing, and coding assistants. Recently, Meta released […]Continue reading

Helping computer vision and language models understand what they see

By Admin 13/09/2023

MIT News – Artificial intelligence Powerful machine-learning algorithms known as vision and language models, which learn to match text with images, have shown remarkable results when asked to generate captions or summarize videos. While these models excel at identifying objects, they often struggle to understand concepts, like object attributes or the arrangement of items in […]Continue reading