0% found this document useful (0 votes)

5 views1 page

Selecting The Right Text Based LLM For Your Use Case

Uploaded by

gergerger002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views1 page

Selecting The Right Text Based LLM For Your Use Case

Uploaded by

gergerger002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Selecting the right

text-based LLM
for your use case
Maximize the business value of LLMs with
these insights and strategies

Generative artificial intelligence (gen AI) applications powered

by large language models (LLMs) have the potential to transform
every industry. Staying up to speed with LLMs is challenging as
the technology continues to evolve at lightning speed.

Read on to discover best practices that will help you navigate the
LLM landscape with confidence.

What can text-based LLMs do for you?

LLMs are trained on trillions of words across many natural language tasks.
When supported by the right gen AI infrastructure, they can carry out functions
in a conversational manner. These functions include:

Engaging in Understanding, Answering Summarizing Providing

interactive learning, and questions dialogues and suggestions
conversations generating text documents

LLMs are powering applications across multiple industries,

including healthcare and life sciences, media and entertainment,
financial services, and more.

Key factors to consider

for text-based LLMs
The list of available LLMs is growing fast as technology evolves. Understanding the
factors that shape LLM options can help you with selection, gaining a competitive
advantage, and increasing the business value of your gen AI investments.

Model
capabilities
Assess the model’s ability to perform specific
tasks, such as text generation, translation,
summarization, and code generation.

Model size
and complexity
Larger models often offer better performance but
require more computational resources. Evaluate your
specific needs to determine the optimal size.

Model
customization
Determine whether the model can be used
directly for your tasks or if it requires
fine-tuning your specific domain data

Cost and
to achieve optimal performance.

accessibility
Factor in the cost of using the model, including
API fees or licensing costs. Consider
open source options for more flexibility.

Ethical
considerations
Be mindful of potential biases in the model’s
outputs and take steps to mitigate them.
Ensure the model aligns with ethical guidelines
and principles.

Find the right text-based

LLM for your use case
Beyond the factors that shape LLMs, selecting the right text-based LLM for your use case can increase
the accuracy and quality of your inputs, improve performance, and drive greater cost-efficiency.

TA SK S

TE X T CL A SSIFIC ATIO N

TE X T G E N E R ATIO N

Q&A

T R A N SL ATIO N

C O N V E R S ATIO N A L A I

S U M M A R IZ ATIO N

D O C U M E N T U N D E R STA N DIN G

S O F T W A R E D E V ELO P M E N T

LIFE S CIE N CES

Interested in other LLM use cases? Go beyond text-based LLMs with the broadest selection
of multi-modal options on Amazon SageMaker AI and Amazon Bedrock, including the new
Amazon Nova foundation models (FMs). Learn more about leveraging your customized
models across AI21 Labs, Anthropic, Luma AI, Meta, Mistral, Stability.AI, and more.

Learn more ›

Choose the right infrastructure to

optimize performance and costs for LLMs
Build with specialized AI infrastructure that delivers the performance you need while reducing costs.

1 2
Rightsize Choose the optimal
your model infrastructure
You may not need the largest model. Explore purpose-built infrastructure solutions
Pick the right type and size model that are uniquely designed from the ground up
depending on your use case. to accelerate innovation, enhance security, and
improve performance while lowering costs.

AWS offers infrastructure and services designed to help

you get the most performance out of your LLMs while
optimizing your costs:

ACCELER ATE D CO M P U TIN G

From the highest-performance NVIDIA GPU-based Amazon Elastic Compute

Cloud (Amazon EC2) to continued investments in our purpose-built machine
learning (ML) accelerators AWS Trainium and AWS Inferentia, AWS delivers the
best price performance for training and deploying gen AI models at scale.

A M A ZO N SAG E M A K ER A I

Amazon SageMaker AI is a fully managed service that brings together a broad

set of tools to enable high-performance, low-cost ML for any use case. With
SageMaker AI, you can build, train, and deploy ML models at scale using tools,
such as notebooks, debuggers, profilers, pipelines, machine learning operations
(MLOps), and more—all in one integrated development environment (IDE).

N E T W O R K IN G

Purpose-built to meet the performance demands for gen AI, Amazon Web
Services (AWS) provides high-throughput and low-latency networking that
includes Elastic Fabric Adapter (EFA) and Amazon EC2 UltraClusters.

STO R AG E

Accelerate compute workloads with Amazon FSx for Lustre, which provides
sub-millisecond latencies, up to hundreds of gigabytes per second (GBps) of
throughout, and millions of input/output operations per second (IOPS) while
quickly accessing and processing your datasets on Amazon Simple Storage
Service (Amazon S3).

SECU RIT Y

Our accelerated computing Amazon EC2 instances and networking are built on a
foundation of the AWS Nitro System, which has been validated by independent
cybersecurity firm NCC Group. The level of security protection offered is so critical
that we’ve added it to our AWS Service Terms to provide additional assurance to
all of our customers.

AWS Trainium2–based Amazon AWS Inferentia2–based

EC2 Trn2 instances deliver Amazon EC2 Inf2 instances deliver

U P TO U P TO

30%
better price performance than current
4 0%
lower cost per inference than
generation GPU-based EC2 instances comparable EC2 instances

Unleash the power of

generative AI LLMs
Optimize performance and costs for LLM deployment

While LLMs hold the potential to transform your business and give it a competitive
edge, building, training, and deploying them requires an unprecedented level
of infrastructure resources. To succeed, you need an infrastructure strategy that
delivers the right processing power without compromising on cost or performance;
low-latency, high-throughput networking; storage solutions that help accelerate
and cost-optimize your compute; and a deep set of cloud services and partners.
Empower your organization with LLMs—start your journey with AWS today.

Get started with AWS AI infrastructure ›

Understanding The Costs of Generative Ai
No ratings yet
Understanding The Costs of Generative Ai
1 page
E-Book The DOs and DON'Ts For LLMs Today
No ratings yet
E-Book The DOs and DON'Ts For LLMs Today
1 page
Handout Select The Right AI ML and Generative AI Tools For Your Use Case
No ratings yet
Handout Select The Right AI ML and Generative AI Tools For Your Use Case
46 pages
Cheat Sheet AWS Certified AI Practitioner 250428 135214
No ratings yet
Cheat Sheet AWS Certified AI Practitioner 250428 135214
41 pages
Prashant Generative AI On AWS
No ratings yet
Prashant Generative AI On AWS
32 pages
AWS AI/ML Sessions Overview
No ratings yet
AWS AI/ML Sessions Overview
1 page
AWS AI/ML Sessions: Intro to Advanced
No ratings yet
AWS AI/ML Sessions: Intro to Advanced
1 page
LLMs in Software Engineering
100% (1)
LLMs in Software Engineering
75 pages
Build Business Outcomes With Artificial Intelligence and Machine Learning - Spencer Marley and Aashmeet Kalra-1
No ratings yet
Build Business Outcomes With Artificial Intelligence and Machine Learning - Spencer Marley and Aashmeet Kalra-1
30 pages
Large Language Models
No ratings yet
Large Language Models
17 pages
Gen AI in Action From POC To Business Value
No ratings yet
Gen AI in Action From POC To Business Value
42 pages
Cheat Sheet AWS AI Practitioner
100% (3)
Cheat Sheet AWS AI Practitioner
50 pages
Responsible Use of Large Language Models
No ratings yet
Responsible Use of Large Language Models
12 pages
Ai 101
No ratings yet
Ai 101
3 pages
Gen AI Foundation
No ratings yet
Gen AI Foundation
40 pages
190702transform Your Business With Aws Ai Machine Learning
No ratings yet
190702transform Your Business With Aws Ai Machine Learning
52 pages
AES401 Use Gen AI To Query Space Imagery APIs With Natural Language Prompts
No ratings yet
AES401 Use Gen AI To Query Space Imagery APIs With Natural Language Prompts
22 pages
Aryan A. What Is LLMOps. Large Language Models in Production 2024
100% (1)
Aryan A. What Is LLMOps. Large Language Models in Production 2024
67 pages
Kickstart Your Journey With LLM - A Comprehensive Guide
No ratings yet
Kickstart Your Journey With LLM - A Comprehensive Guide
2 pages
(25D2S01) - (Keynote Speech) AI First
No ratings yet
(25D2S01) - (Keynote Speech) AI First
43 pages
Data Seminar
No ratings yet
Data Seminar
10 pages
File 38
No ratings yet
File 38
9 pages
AWS AI/ML Innovation Agenda 2023
No ratings yet
AWS AI/ML Innovation Agenda 2023
1 page
ICT5358 Himanshu Patel
No ratings yet
ICT5358 Himanshu Patel
5 pages
Using Chatgpt, Gpt-4, & Large Language Models in The Enterprise
No ratings yet
Using Chatgpt, Gpt-4, & Large Language Models in The Enterprise
20 pages
Ai in Cloud PDF 1
No ratings yet
Ai in Cloud PDF 1
14 pages
Ship A I To Production
No ratings yet
Ship A I To Production
13 pages
Steps To: Master Custom LLM
No ratings yet
Steps To: Master Custom LLM
10 pages
AWS AI Practitioner Exam Prep Guide
No ratings yet
AWS AI Practitioner Exam Prep Guide
20 pages
LLM Deployment Strategies and Insights
No ratings yet
LLM Deployment Strategies and Insights
5 pages
Pieces DZ RC 393 Getting Started Llms 2024
No ratings yet
Pieces DZ RC 393 Getting Started Llms 2024
8 pages
Document
No ratings yet
Document
2 pages
PDF Div Class 2qs3tf Truncatedtext Module Wrapper Fg1km9p Classtruncatedtext Module Lineclamped 85ulhh Style Max Lines5building Llms For Production Louis Francois Bouchard P Div Compress
No ratings yet
PDF Div Class 2qs3tf Truncatedtext Module Wrapper Fg1km9p Classtruncatedtext Module Lineclamped 85ulhh Style Max Lines5building Llms For Production Louis Francois Bouchard P Div Compress
120 pages
LLM Intro
No ratings yet
LLM Intro
8 pages
Machine Learning in Practice 1111656172 180813160029
No ratings yet
Machine Learning in Practice 1111656172 180813160029
50 pages
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
100% (2)
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
254 pages
AWS - Machine Learning Essentials - Lynn Lagit
No ratings yet
AWS - Machine Learning Essentials - Lynn Lagit
7 pages
Overview AI ML PDF
No ratings yet
Overview AI ML PDF
19 pages
Document 2
No ratings yet
Document 2
2 pages
Slides Rethink Machine Learning For Regulated Industries
No ratings yet
Slides Rethink Machine Learning For Regulated Industries
30 pages
GAI Workshop L200 Budiling With GenAI On AWS ASEAN
No ratings yet
GAI Workshop L200 Budiling With GenAI On AWS ASEAN
87 pages
Lenovo NVIDIA GenAI Ebook FINALpdf
No ratings yet
Lenovo NVIDIA GenAI Ebook FINALpdf
18 pages
LLMOps Toolkit - Prashant Sahu
No ratings yet
LLMOps Toolkit - Prashant Sahu
12 pages
Amazon Bedrock State of The Union
No ratings yet
Amazon Bedrock State of The Union
51 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
BT-IBM AI Collaboration Insights 2022-2024
No ratings yet
BT-IBM AI Collaboration Insights 2022-2024
30 pages
RSTR Airesources en
No ratings yet
RSTR Airesources en
4 pages
2.8 Choosing A Model
No ratings yet
2.8 Choosing A Model
2 pages
Generative AI Executive Deck
No ratings yet
Generative AI Executive Deck
63 pages
Llm-Based Software Deployment
No ratings yet
Llm-Based Software Deployment
23 pages
Basic AI & ML Concepts Explained - LinkedIn
No ratings yet
Basic AI & ML Concepts Explained - LinkedIn
10 pages
File 35
No ratings yet
File 35
14 pages
File 22
No ratings yet
File 22
37 pages
7 Leading Machine Learning Use Cases
No ratings yet
7 Leading Machine Learning Use Cases
11 pages
CHR Online 20230921132430
No ratings yet
CHR Online 20230921132430
196 pages
SAP PP Configuration
No ratings yet
SAP PP Configuration
84 pages
User Interface Design Principles
No ratings yet
User Interface Design Principles
32 pages
Unit V-1
No ratings yet
Unit V-1
6 pages
CV 2022 Network
No ratings yet
CV 2022 Network
4 pages
AVR Projects - ATMega32 AVR
No ratings yet
AVR Projects - ATMega32 AVR
6 pages
Dork
No ratings yet
Dork
5 pages
Anna University 2015 Exam Results
No ratings yet
Anna University 2015 Exam Results
6 pages
DAC 40 XM
No ratings yet
DAC 40 XM
15 pages
Computer System Servicing Guide
No ratings yet
Computer System Servicing Guide
1 page
Be Information Technology Semester 4 2024 May Computer Network and Network Design Rev 2019 C Scheme
No ratings yet
Be Information Technology Semester 4 2024 May Computer Network and Network Design Rev 2019 C Scheme
1 page
RALSTM ADT Inbound Interface Guide
No ratings yet
RALSTM ADT Inbound Interface Guide
3 pages
Colorlight S4 User Manual
No ratings yet
Colorlight S4 User Manual
22 pages
Naga Manisha-SFDC Lightning Developer Resume
50% (2)
Naga Manisha-SFDC Lightning Developer Resume
6 pages
Resume Shravani Sorte-1
No ratings yet
Resume Shravani Sorte-1
1 page
Log
No ratings yet
Log
179 pages
Pre-ILT SMART Objectives
No ratings yet
Pre-ILT SMART Objectives
1 page
Alpha Technologies Python Mock Test
No ratings yet
Alpha Technologies Python Mock Test
2 pages
String Calculator Kata v1
No ratings yet
String Calculator Kata v1
3 pages
3.1 Computer Architecture EMK Notes 2023
No ratings yet
3.1 Computer Architecture EMK Notes 2023
6 pages
Hikvision Swing Barrier Gate Overview
No ratings yet
Hikvision Swing Barrier Gate Overview
4 pages
Algorithm Design MCQs with Answers
100% (1)
Algorithm Design MCQs with Answers
12 pages
Configuring Order Fulfillment Responsibilities
No ratings yet
Configuring Order Fulfillment Responsibilities
10 pages
Twiter
No ratings yet
Twiter
44 pages
Optical Measure Recognition in Common Music Notation
No ratings yet
Optical Measure Recognition in Common Music Notation
6 pages
Microprocessor mcq01
No ratings yet
Microprocessor mcq01
4 pages
Remote Boiler Control Solutions
No ratings yet
Remote Boiler Control Solutions
15 pages
Report
No ratings yet
Report
172 pages
Chief Mate Phase 2 Nav Aids Theory 2.2
No ratings yet
Chief Mate Phase 2 Nav Aids Theory 2.2
34 pages
Shimmer Matlab Instrument Driver User Manual Rev2.8a
No ratings yet
Shimmer Matlab Instrument Driver User Manual Rev2.8a
40 pages