0% found this document useful (0 votes)

82 views19 pages

AI Tools

The document provides an overview of Large Language Models (LLMs), explaining their definition, mechanics, and relevance for developers. It covers how LLMs are trained, their architecture, and practical applications such as code generation and document summarization. Additionally, it highlights prompt engineering techniques and best practices for effectively interacting with LLMs to optimize their outputs.

Uploaded by

Ash Ketchum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

82 views19 pages

AI Tools

Uploaded by

Ash Ketchum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

AI Tools + Prompt Engineering for Developers

What It Is (Definition)

LLM (Large Language Model)

A Large Language Model is a type of AI model trained on massive amounts of text data
to understand, generate, and interact with human language.

Think of an LLM as a very smart autocomplete system that doesn't just finish your
sentence — it can summarize articles, write code, answer questions, translate
languages, and more.

It’s called “large” because:

• It has billions (or even trillions) of parameters.

• It’s trained on huge datasets scraped from the internet (books, articles, code,
etc.).
Examples:

• OpenAI’s GPT series (e.g., GPT-4)

• Google’s PaLM

• Meta’s LLaMA

• Anthropic’s Claude

How It Works (Mechanics)

At a high level, here’s how LLMs work:

1. Training

• The model is trained on tons of text using self-supervised learning.

• Goal: Predict the next word in a sentence.

Example:

"The capital of France is ___" → "Paris"

2. Architecture

• Most LLMs use a Transformer architecture (introduced in the 2017 “Attention Is

All You Need” paper).

• Transformers allow the model to pay attention to different parts of a sentence at

once — this is how they understand context so well.

3. Tokens, Not Words

• LLMs don’t see words directly — they see tokens (pieces of words).

• “ChatGPT” → might be split into [‘Chat’, ‘G’, ‘PT’]

4. Inference (Using the Model)

• After training, you can send a prompt (input) to the model.

• It returns a response based on its learned patterns.

Think of it as:
You give it a question (prompt), and it responds like a very educated guess based on
everything it has seen during training.
Why It Matters (Real-World Relevance)

LLMs are changing the way developers build software:

Developers use LLMs to:

• Build chatbots, code assistants, and documentation generators

• Generate or autocomplete code snippets

• Parse unstructured text (like emails, logs, PDFs)

• Summarize customer feedback, legal docs, or health records

• Translate or localize app content

• Power AI agents in tools like LangChain, AutoGen, etc.

Software engineers today are expected to:

• Know how to use APIs like OpenAI or Hugging Face

• Understand prompt engineering

• Integrate LLMs in apps using frameworks like LangChain, FastAPI, or Flask

Theory of LLMs (Large Language Models)

A Large Language Model (LLM) is a deep neural network, typically based on the
Transformer architecture, trained on a massive corpus of text to model the probability
of a word (token) given a sequence of previous tokens.

In simpler terms:

It learns the structure and patterns of human language by predicting the next token in
a sequence.

At its core, an LLM is a probabilistic model:

P(tokent∣token1,token2,...,tokent−1)P(\text{token}_t | \text{token}_1, \text{token}_2, ...,

\text{token}_{t-1})

That means the model doesn't "understand" language like humans — it learns
statistical associations.

How It Works (Mechanics – Deep Dive)

Let’s break it into components:

1. Tokenization

Before any text can be processed, it's split into tokens (subword units):

• "ChatGPT is amazing!" → [Chat, G, PT, is, amazing, !]

• Tokens are mapped to numbers via a vocabulary (token-ID mapping).

2. Embedding Layer

Each token is converted into a high-dimensional vector using embedding matrices.

• For example:
"Chat" → [0.22, -0.88, ..., 1.05] (say, 768 dimensions)

• These vectors are learned during training and represent semantic meaning.
3. Transformer Architecture (The Brain)

Core Components:

• Self-Attention:
Allows each token to look at all other tokens in the sequence to understand
context.

• Multi-Head Attention:
Multiple attention mechanisms run in parallel to capture different aspects of
context.

• Positional Encoding:
Since Transformers don’t process text in order like RNNs, this adds info about
token position.

• Feedforward Layers:
After attention, these are regular neural network layers that refine the
representation.

⏱ Transformer Block Flow:

Input Tokens Embeddings [Multi-Head Attention Feedforward Norm &

Residual] x N Output logits

4. Training Objective (Next-Token Prediction)

The model is trained to minimize the loss between predicted tokens and actual tokens
in a sequence.

• Loss Function: Usually cross-entropy

• It learns through backpropagation and gradient descent

Example:

Input: "The sky is"

Target: "blue"

Model guess: "cloudy" → Loss is high

Model guess: "blue" → Loss is low

5. Generation & Sampling (Inference Time)

Once trained, we use the model for text generation using sampling methods:

• Greedy Search: Always pick the most probable token

• Top-k Sampling: Randomly pick from top-k likely tokens

• Temperature: Controls randomness in output

• Beam Search: Keeps multiple candidates during generation for more fluent
results

Why It Matters (Real-World Relevance – Theoretical Lens)

Understanding theory gives you powerful leverage:

• Helps you write better prompts — knowing how tokenization & attention
work improves precision.

• Helps debug model issues — hallucinations, length limits, context-window

overflows.

• Lets you fine-tune or adapt models using transfer learning or LoRA.

• Helps you optimize for latency, cost, or performance by understanding size,

token limits, etc.

Examples & Use Cases (From Theory to Practice)

1. Context Length Matters (Attention is Limited)

• GPT-4-turbo supports ~128k tokens (~300 pages)

• If you input more than that? Earlier tokens are truncated or ignored

• Fix: Use windowing or summarization techniques

2. Long Prompts = Longer Attention Computation

• Attention complexity is O(n²) — doubling the prompt length quadruples

compute time!

• Tip: Keep prompts short when possible

Developer Tips

Best Practices

• Learn prompt engineering with knowledge of tokenization and embeddings

• Use token counters to avoid hitting limits (e.g., tiktoken)

• Choose right sampling methods for creative vs. factual tasks

Common Misconceptions

• “LLMs understand like humans” → No, they mimic patterns from data

• “Bigger = always better” → Not always; use right size for your use case

• “Training from scratch is better than using OpenAI/HF” → Hugely expensive

and impractical for 99% of developers
Mastering LLM Tools – Overview

Let’s go tool-by-tool and break them down across:

Tool Best For Key Strengths API / Interface

ChatGPT Code, analysis, tool use, Plugins, function calling, Web + API
(OpenAI) API workflows GPTs, Vision (chat + tools)

Reasoning,
Claude Huge context (up to 200K+), Web + Claude
summarization, long
(Anthropic) low hallucinations API
docs

Gemini Google ecosystem, Multimodal, Android Web + Vertex AI

(Google) images + code integrations API

ChatGPT – Power Use Guide

Best Use Cases:

• Code generation, debugging, test writing

• API or CLI assistant

• Function calling & workflows

• File analysis (w/ Pro)

Pro Features (GPT-4 Turbo):

• Custom GPTs with instructions & tools

• Vision: Upload images, diagrams, UI mockups

• Files: Upload codebases, PDFs, CSVs for analysis

• Tools: Code Interpreter, Browser, DALL·E

Dev API Features:

POST [Link]

• Models: gpt-3.5-turbo, gpt-4, gpt-4-turbo

• Function calling (JSON schema)

• Streaming + role-based conversations

Prompting Tip:

"You're a senior Python engineer. Improve this FastAPI endpoint for security and
performance."

Claude 3 – Deep Reasoning & Document Expert

Best Use Cases:

• Long document summarization (200K+ tokens!)

• Writing-first tasks (letters, guides, explanations)

• Ingesting PDFs, markdown, CSVs with deep understanding

Claude 3 Models:

• Claude 3 Haiku (fastest)

• Claude 3 Sonnet (default)

• Claude 3 Opus (best reasoning)

Dev API Features:

POST [Link]

• Models: claude-3-opus-20240229, etc.

• File upload + message threading

• Tool use coming soon (early access)

Prompting Tip:

“Here’s a product spec. Summarize key requirements, risks, and edge cases.”

Gemini – Multimodal + Google Integration

Best Use Cases:

• Code generation with real-time web context

• Android app workflows (Firebase, Studio)

• Multimodal: Images + PDFs + Text in one prompt

Features:

• 1M token context window (on Gemini 1.5)

• Auto-detects uploaded content (PDF, image, code)

• Good for Google Sheets, Gmail, Calendar integration

Dev API (via Vertex AI):

from vertexai.language_models import ChatModel

• Models: gemini-1.5-pro, gemini-1.0-pro

• Supports tool calling + function execution

• Multi-part input support

Prompting Tip:

“You’re an Android Studio tutor. Write a Kotlin UI layout for this screen idea [upload
image].”

Comparison Cheat Sheet

Feature/Need ChatGPT Claude Gemini

Code generation Excellent Good Good

Long doc support Medium (128K) Huge (200K+) (1M in lab)

Reasoning Great Best Good

Vision/Image support Vision (for now) Multimodal

Google integration Sheets, Gmail

Tool + function calling Powerful (coming) In Gemini 1.5

Cost efficiency Turbo is cheap Haiku fast API still pricy

Dev Tips

Use each for what they’re best at:

• Claude = long-form clarity + structured docs

• ChatGPT = building + code + tooling

• Gemini = multimodal + search + Google tools

Test APIs for automation:

• Auto-summarize PRs with Claude

• Generate test cases from schema with ChatGPT

• Create Google Calendar events from voice notes with Gemini

Don’t:

• Mix tools blindly (choose based on task)

• Expect perfect reasoning from free-tier models

• Ignore token limits or rate caps

Prompt Engineering for Developers

1. What It Is (Definition)

Prompt Engineering is the practice of crafting effective inputs (prompts) to large

language models (LLMs) to get reliable, relevant, and accurate outputs.

For developers, it's like giving clear function parameters to a smart but fuzzy API.

Think of it as designing the frontend UX for your backend AI brain.

2. How It Works (Mechanics)

An LLM doesn’t “think” — it predicts the next token based on patterns in training data.
Your prompt becomes the full program context, so:

Key Concepts:

• System Prompt: Sets identity/personality (e.g., "You're a senior Python dev").

• User Prompt: The actual instruction or question.

• Few-shot Examples: Including samples to guide output.

• Chain-of-Thought: Ask the model to “think out loud” before answering.

• Output Formatting: Use markdown, JSON, or code fences to control style.

Prompt = Programming

Prompting is like scripting:

System: "You're an AI who writes clean Python tests."

User: "Write a pytest function to test this FastAPI endpoint: ..."

3. Why It Matters (Real-World Relevance)

Prompting is how developers turn LLMs into usable tools:

From code generation to auto-documentation, bug detection, and chat agents.

No fine-tuning required — prompt engineering = fast, cheap way to prototype

Used by devs in Copilot, ChatGPT, RAG, LangChain, Agents, API wrappers

It's the foundation of LLM-as-a-Tool development.

4. Examples & Use Cases

1. Code Generation Prompt

You're a senior Python engineer.

Write a FastAPI route to upload an image and store it in a local folder.

Use Pydantic for validation.

2. Bug Fixing Prompt

Here’s a Python function. It throws a `TypeError` on line 23.

Explain the bug and suggest a fix:

```python

def calculate_total(items: list[int]):

return sum(item['price'] for item in items)

3. Unit Test Writing

Write a pytest function to test this endpoint:

POST /login with JSON {username, password}. Should return 200 on success.

4. Docstring Generator

Add PEP-257 docstrings to all the functions in this Python file:

```python

def add(x, y): return x + y

5. Regex Explainer

Explain this regex: `^(?=.[A-Z])(?=.\d)[A-Za-z\d]{8,}$`

Also, convert it into Python code with a usage example.

5. Developer Tips

Best Practices:

• Be explicit: Assume the model needs context. Add roles, constraints, formats.

• Use markdown/code blocks to structure inputs.

• Give examples (few-shot learning).

• Use step-by-step / chain-of-thought to improve accuracy.

• Use JSON output formatting for structured data.

Common Mistakes:

• Vague prompts like “help me with code”

• Asking for too many tasks in one prompt

• Ignoring context window (cut-off outputs)

• Relying on default chat behavior for precise tasks

Prompt Engineering Tools:

• PromptLayer – Track and version prompts

• LangSmith – Debug and monitor prompt chains

• OpenAI Playground – Experiment easily

6. Bonus Resources

Learn Prompting:

• [Link] – Free, dev-focused course

• OpenAI Cookbook: Prompting Guide

• FlowGPT – Prompt sharing and discovery

Experiment & Test:

• OpenAI Playground

• Prompt Engineering Guide

TL;DR Cheat Sheet for Devs

Task Prompt Format Idea

Role setup “You are a senior engineer who...”

Structure “Output JSON in this format: {name, type, summary}”

Testing “Write a test case for...”

Task Prompt Format Idea

Tool usage “Use Python’s re module to…”

Code chaining “First analyze the code, then write improved version”

RAG flows “Summarize this document. Then generate 3 follow-up questions”

How to Write Effective Prompts: A Simple Structure

1. Context / Background

Provide a brief context or background info so the AI understands the setting or topic.

• Example: "I’m building a web app that tracks fitness routines..."

2. Clear Task / Question

State exactly what you want the AI to do.

• Example: "Explain how to implement user authentication in FastAPI."

3. Constraints / Requirements

Add any specific constraints or details you want included or avoided.

• Example: "Use OAuth2 and include code samples in Python."

4. Format / Output Style

Specify how you want the answer formatted.

• Example: "Give me a step-by-step guide with bullet points."

5. Optional: Examples / References

Provide examples or mention preferred styles for clarity.

• Example: "Use simple language like explaining to a beginner."

Template

[Context / Background]

[Clear Task / Question]

[Constraints / Requirements]

[Format / Output Style]

(Optional) [Examples / References]

Example Prompt Using This Structure

[Role/Persona]

Act as a highly experienced Full Stack Developer with expertise in Spring Boot (Java),
[Link], and JWT-based authentication. You are also skilled in designing scalable
RESTful APIs and follow best practices in clean code and software architecture.

[Task or Goal]

Your goal is to help me design a robust backend architecture for a Hospital Management
System (HMS) that includes user authentication, role-based access control (Admin,
Doctor, Patient), and appointment booking functionality. You also need to recommend
the proper folder structure, best practices, and database schema for scalability.

[Context or Background]

This is a final-year academic project I’m building with a team of 4 members. I am

responsible for the backend and JWT-based authentication module. The system will
have three types of users – Admin (as a User with role-based access), Doctors, and
Patients. Patients can register, book appointments, and view records. Doctors can
manage appointments and view patient details. Admin can manage doctor and patient
records. We are using MySQL as the database. I want to follow a layered architecture
with Controllers, Services, DTOs, Repositories, and Entities. Exception handling and
security filters are mandatory.

[Instructions or Constraints]

- Provide detailed guidance on folder structure following industry standards.

- Use simple yet professional language suitable for an academic submission.

- Recommend proper class naming conventions and package names.

- Include code snippets (not the full project) to explain key parts like security
configuration, token filter, user authentication flow, and appointment booking API.

- Mention the use of annotations like `@PreAuthorize` for role-based access.

- Avoid overly advanced enterprise-level patterns that are not needed for a college
project.
[Output Format]

Break down the response into the following clear sections:

1. Project Folder Structure (with explanation)

2. Entity and Database Design (with ER Diagram in text format)

3. Key Classes & Code Snippets (Controller, Service, SecurityConfig, Filter)

4. Authentication Flow Explanation (Step-by-step)

5. Best Practices to Follow

6. Conclusion & Next Steps

Introduction - Delta Exchange API
No ratings yet
Introduction - Delta Exchange API
163 pages
Co Pilot Content
No ratings yet
Co Pilot Content
88 pages
1st BTST METHOD Williams R
No ratings yet
1st BTST METHOD Williams R
18 pages
Intro to Deterministic Finite Automata
No ratings yet
Intro to Deterministic Finite Automata
11 pages
Market Analyst Pro Overview
No ratings yet
Market Analyst Pro Overview
1 page
Compiler Design: Lexical Analysis Sample Exercises and Solutions
No ratings yet
Compiler Design: Lexical Analysis Sample Exercises and Solutions
30 pages
Fake Breakout Trading Guide
No ratings yet
Fake Breakout Trading Guide
8 pages
Trade
No ratings yet
Trade
14 pages
LWB GWM Brochure
No ratings yet
LWB GWM Brochure
75 pages
Unlocking Automation Power - A Deep Dive Into n8n's Capabilities-1
No ratings yet
Unlocking Automation Power - A Deep Dive Into n8n's Capabilities-1
5 pages
Crypto Futures Trading Strategies For Beginners
No ratings yet
Crypto Futures Trading Strategies For Beginners
9 pages
Webhooks
No ratings yet
Webhooks
5 pages
Crypto Trading Guide for Beginners
No ratings yet
Crypto Trading Guide for Beginners
1 page
EEG Classification Using Long Short-Term Memory Recurrent Neural Networks
No ratings yet
EEG Classification Using Long Short-Term Memory Recurrent Neural Networks
29 pages
Standard Template Library STL in C
No ratings yet
Standard Template Library STL in C
20 pages
Momentum Surge Strategy Deck StrykeX
No ratings yet
Momentum Surge Strategy Deck StrykeX
55 pages
OpenAI API Reference Guide
No ratings yet
OpenAI API Reference Guide
46 pages
CTC 27.0 Brochure
No ratings yet
CTC 27.0 Brochure
5 pages
ChatGPT Boosts PayBitoPro Trading
No ratings yet
ChatGPT Boosts PayBitoPro Trading
2 pages
Candlestick Patterns & Market Trends
No ratings yet
Candlestick Patterns & Market Trends
5 pages
QI GS News Analytics
No ratings yet
QI GS News Analytics
97 pages
3Ll+Babybar Breakout: Compiled by Smstricks4U
No ratings yet
3Ll+Babybar Breakout: Compiled by Smstricks4U
1 page
Stock Price Pattern Recognition - A Recurrent NN Approach - K Kamijo T Tanigawa
0% (1)
Stock Price Pattern Recognition - A Recurrent NN Approach - K Kamijo T Tanigawa
7 pages
Sideways Price Action Trading Guide
No ratings yet
Sideways Price Action Trading Guide
7 pages
Reversal Candlestick Structure (LuxAlgo)
No ratings yet
Reversal Candlestick Structure (LuxAlgo)
9 pages
Deep Learning Approach For Earthquake Parameters Classification in Earthquake Early Warning System
No ratings yet
Deep Learning Approach For Earthquake Parameters Classification in Earthquake Early Warning System
5 pages
Stock Market Lecture Contents
No ratings yet
Stock Market Lecture Contents
8 pages
HFT Study Plan
No ratings yet
HFT Study Plan
2 pages
Technical Analysis For Beginners - New
No ratings yet
Technical Analysis For Beginners - New
16 pages
Trading Concepts BoS CHoCH Inducement Short Guide
No ratings yet
Trading Concepts BoS CHoCH Inducement Short Guide
1 page
FinGPT: Democratizing Internet-Scale Financial Data With LLMs
No ratings yet
FinGPT: Democratizing Internet-Scale Financial Data With LLMs
7 pages
Wolfx Cornix Settings
No ratings yet
Wolfx Cornix Settings
14 pages
Nifty Option Buying
100% (1)
Nifty Option Buying
1 page
AI Product Essentials Crash Course
No ratings yet
AI Product Essentials Crash Course
270 pages
1.a.3 130415 Busted Breakouts in Forex
100% (1)
1.a.3 130415 Busted Breakouts in Forex
5 pages
Forex Hedging
No ratings yet
Forex Hedging
104 pages
OV Intraday Break - Out Opportunities
No ratings yet
OV Intraday Break - Out Opportunities
2 pages
Intro to Crypto Technical Analysis
No ratings yet
Intro to Crypto Technical Analysis
16 pages
Havana Oreen TradingView Indicator Guide
No ratings yet
Havana Oreen TradingView Indicator Guide
5 pages
Copy Trade Manual
100% (1)
Copy Trade Manual
20 pages
Technical Indicators & Overlays
No ratings yet
Technical Indicators & Overlays
61 pages
Node.js Basics for Beginners
No ratings yet
Node.js Basics for Beginners
20 pages
Cracking The Trading Code 2025 Opportunities Unlocked 5
No ratings yet
Cracking The Trading Code 2025 Opportunities Unlocked 5
111 pages
OI Funding Strategy CheatSheet
No ratings yet
OI Funding Strategy CheatSheet
6 pages
Launching GTM of Zerodha Varsity
No ratings yet
Launching GTM of Zerodha Varsity
12 pages
Algo Setup
No ratings yet
Algo Setup
39 pages
EPS and Stage Analysis
No ratings yet
EPS and Stage Analysis
23 pages
Compiler Lab
No ratings yet
Compiler Lab
60 pages
Day Trading with Pivot Points
No ratings yet
Day Trading with Pivot Points
4 pages
Limit Order Book Oxford
No ratings yet
Limit Order Book Oxford
34 pages
Understanding Artificial Intelligence
No ratings yet
Understanding Artificial Intelligence
10 pages
Ingliz Tili1
No ratings yet
Ingliz Tili1
11 pages
Stock Trading Excel: Buy/Sell Signals
No ratings yet
Stock Trading Excel: Buy/Sell Signals
3 pages
2022 - Developer GTM Playbook - OpenView-1
No ratings yet
2022 - Developer GTM Playbook - OpenView-1
71 pages
AI Market Study and Analysis Report
No ratings yet
AI Market Study and Analysis Report
11 pages
Mypivotpoint PDF
100% (1)
Mypivotpoint PDF
5 pages
OB Continuations
No ratings yet
OB Continuations
11 pages
DAB311 DL Week 11 RNN
No ratings yet
DAB311 DL Week 11 RNN
25 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
Day 2
No ratings yet
Day 2
3 pages
ServerlessLLM Low-Latency Serverless Inference For Large Language Models
No ratings yet
ServerlessLLM Low-Latency Serverless Inference For Large Language Models
20 pages
A Multimodal Generative AI Copilot For Human Patho
No ratings yet
A Multimodal Generative AI Copilot For Human Patho
26 pages
Taming Genai To Automate Business Processes:: The Aloma Example
No ratings yet
Taming Genai To Automate Business Processes:: The Aloma Example
15 pages
2025-Divide-and-Contrast - A Text-Based Method For Firm Market Risk Prediction
No ratings yet
2025-Divide-and-Contrast - A Text-Based Method For Firm Market Risk Prediction
18 pages
Investigating The Role of ChatGPT in Supporting Metacognitive Processes During Problem Solving Activities
No ratings yet
Investigating The Role of ChatGPT in Supporting Metacognitive Processes During Problem Solving Activities
25 pages
Do Any of The LLM Have A Free API
No ratings yet
Do Any of The LLM Have A Free API
3 pages
Top LLM Research Papers (Aug 2024)
No ratings yet
Top LLM Research Papers (Aug 2024)
18 pages
Gen AI Roadmap 2025
No ratings yet
Gen AI Roadmap 2025
19 pages
Pravin Coder-1
No ratings yet
Pravin Coder-1
1 page
Prompting Science Report
No ratings yet
Prompting Science Report
19 pages
AutoTimes Autoregressive Time Series Forecasters Via Large Language Models
No ratings yet
AutoTimes Autoregressive Time Series Forecasters Via Large Language Models
25 pages
Digital Case Study Compendium 0304
No ratings yet
Digital Case Study Compendium 0304
111 pages
39808709
No ratings yet
39808709
3 pages
AI Enhances Decision Making for Leaders
No ratings yet
AI Enhances Decision Making for Leaders
11 pages
Osdi25 Ren
No ratings yet
Osdi25 Ren
19 pages
Aif-C01 - 166qa
No ratings yet
Aif-C01 - 166qa
135 pages
Aqua GPT
No ratings yet
Aqua GPT
14 pages
A Visual Guide To Quantization - by Maarten Grootendorst
No ratings yet
A Visual Guide To Quantization - by Maarten Grootendorst
31 pages
Coding Attention Mechanisms
No ratings yet
Coding Attention Mechanisms
24 pages
Can Gen AI and Copyright Coexist
No ratings yet
Can Gen AI and Copyright Coexist
7 pages
Multi-Layer Visual Feature Fusion in MLLMs
No ratings yet
Multi-Layer Visual Feature Fusion in MLLMs
14 pages
Toward General Design Principles For Generative AI Applications
No ratings yet
Toward General Design Principles For Generative AI Applications
16 pages
QuIP: 2-Bit Quantization of Large Language Models With Guarantees - 2307.13304
No ratings yet
QuIP: 2-Bit Quantization of Large Language Models With Guarantees - 2307.13304
34 pages
Pdftriage: Question Answering Over Long, Structured Documents
No ratings yet
Pdftriage: Question Answering Over Long, Structured Documents
17 pages
SAP Artificial Intelligence Training 1749973554
100% (1)
SAP Artificial Intelligence Training 1749973554
9 pages
10rag For Domain Specific Question Answering
No ratings yet
10rag For Domain Specific Question Answering
8 pages
Wearable Intelligent Throat Enables Natural Speech in Stroke Patients With Dysarthria.18266v1
No ratings yet
Wearable Intelligent Throat Enables Natural Speech in Stroke Patients With Dysarthria.18266v1
16 pages
Juridia Hackathon 1734982717
No ratings yet
Juridia Hackathon 1734982717
4 pages
2025.findings Naacl.203
No ratings yet
2025.findings Naacl.203
15 pages
Datasheet Building LLM Applications With Prompt Engineering
No ratings yet
Datasheet Building LLM Applications With Prompt Engineering
3 pages