Appendix A: Advanced Prompting Techniques

Master sophisticated prompting strategies for maximum LLM performance

Advanced25 min readInteractive Playground

Advanced45 min read

Appendix A: Advanced Prompting Techniques

Master the art and science of prompt engineering—from core principles to advanced techniques like Chain-of-Thought, Self-Consistency, ReAct, and Automatic Prompt Engineering

Introduction to Prompting

Prompting is the primary interface for interacting with language models—the process of crafting inputs to guide the model towards generating a desired output. This involves structuring requests, providing relevant context, specifying the output format, and demonstrating expected response types.

Well-designed prompts can maximize the potential of language models, resulting in accurate, relevant, and creative responses. In contrast, poorly designed prompts can lead to ambiguous, irrelevant, or erroneous outputs. The objective of prompt engineering is to consistently elicit high-quality responses from language models.

This appendix details various prompting techniques that extend beyond basic interaction methods. It explores methodologies for structuring complex requests, enhancing the model's reasoning abilities, controlling output formats, and integrating external information. These techniques are applicable to building a range of applications, from simple chatbots to complex multi-agent systems.

Core Prompting Principles

Effective prompting rests on fundamental principles guiding communication with language models, applicable across various models and task complexities. Mastering these principles is essential for consistently generating useful and accurate responses.

Clarity and Specificity

Instructions should be unambiguous and precise. Language models interpret patterns; multiple interpretations may lead to unintended responses. Define the task, desired output format, and any limitations or requirements. Avoid vague language or assumptions.

Conciseness

While specificity is crucial, it should not compromise conciseness. Instructions should be direct. Unnecessary wording or complex sentence structures can confuse the model or obscure the primary instruction. Use simple, direct phrasing and active verbs.

Using Action Verbs

Verb choice is a key prompting tool. Action verbs indicate the expected operation. Effective verbs include:

AnalyzeCategorizeCompareCreateEvaluateExtractGenerateIdentifySummarizeTranslate

Instructions Over Constraints

Positive instructions are generally more effective than negative constraints. Specifying the desired action is preferred to outlining what not to do. Frame prompts to guide the model directly. Positive instructions align with human guidance preferences and reduce confusion.

Experimentation and Iteration

Prompt engineering is an iterative process. Identifying the most effective prompt requires multiple attempts. Begin with a draft, test it, analyze the output, identify shortcomings, and refine the prompt. Documenting attempts is vital for learning and improvement.

Basic Prompting Techniques

Building on core principles, foundational techniques provide language models with varying levels of information or examples to direct their responses.

Zero-Shot Prompting

The most basic form of prompting, where the language model is provided with an instruction and input data without any examples. It relies entirely on the model's pre-training to understand the task.

When to use:

Zero-shot prompting is often sufficient for tasks that the model has likely encountered extensively during its training, such as simple question answering, text completion, or basic summarization.

Example:

Translate the following English sentence to French: 'Hello, how are you?'

One-Shot Prompting

Involves providing the language model with a single example of the input and the corresponding desired output prior to presenting the actual task. This serves as an initial demonstration to illustrate the pattern.

When to use:

One-shot prompting is useful when the desired output format or style is specific or less common. It gives the model a concrete instance to learn from.

Few-Shot Prompting

Enhances one-shot prompting by supplying several examples, typically three to five, of input-output pairs. This aims to demonstrate a clearer pattern of expected responses.

When to use:

Particularly effective for tasks where the desired output requires adhering to a specific format, style, or exhibiting nuanced variations. Excellent for classification, data extraction with specific schemas, or generating text in a particular style.

Best Practices:

Use 3-5 high-quality, diverse examples
Mix up class order in classification tasks
Cover potential variations or edge cases
Modern LLMs support "many-shot" learning (hundreds of examples)

Structuring Prompts

Beyond providing examples, the way you structure your prompt plays a critical role in guiding the language model. Structuring involves using different sections or elements to provide distinct types of information in a clear and organized manner.

System Prompting

Sets the overall context and purpose for a language model, defining its intended behavior for an interaction or session. This provides foundational guidelines for the model's responses, influencing tone, style, and general approach.

Example:

You are a helpful and harmless AI assistant. Respond to all queries in a polite and informative manner. Do not generate content that is harmful, biased, or inappropriate.

Role Prompting

Assigns a specific character, persona, or identity to the language model. This involves instructing the model to adopt the knowledge, tone, and communication style associated with that role.

Example:

Act as a seasoned travel blogger. Write a short, engaging paragraph about the best hidden gem in Rome.

Using Delimiters

Delimiters such as triple backticks (\`\`\`), XML tags (<instruction>, <context>), or markers (---) visually and programmatically separate sections. This minimizes misinterpretation by the model.

Example:

<instruction>
Summarize the following article, focusing on the main arguments.
</instruction>
<article>
[Insert the full text of the article here]
</article>

Context Engineering

Unlike static system prompts, context engineering dynamically provides background information crucial for tasks and conversations. This ever-changing information helps models grasp nuances, recall past interactions, and integrate relevant details.

Context Layers:

System prompts: Foundational instructions defining operational parameters
External data: Retrieved documents, tool outputs, real-time information
Implicit data: User identity, interaction history, environmental state

The core principle is that even advanced models underperform with a limited or poorly constructed view of their operational environment. Context engineering reframes the task from merely answering a question to building a comprehensive operational picture for the agent.

Structured Output

Requesting structured output such as JSON, XML, CSV, or Markdown tables is crucial. By explicitly asking for the output in a particular format and potentially providing a schema, you guide the model to organize its response in a way that can be easily parsed.

Benefits:

Type safety and validation
Easy parsing and integration
Reduced hallucinations
Schema enforcement

Pro Tip:

Use Pydantic (Python) or Zod (TypeScript) to validate structured output and create type-safe objects from LLM responses.

Reasoning and Thought Process Techniques

Large language models excel at pattern recognition but often face challenges with tasks requiring complex, multi-step reasoning. These techniques enhance reasoning capabilities by encouraging models to reveal their internal thought processes.

Chain-of-Thought (CoT)

A powerful method for improving reasoning abilities by explicitly prompting the model to generate intermediate reasoning steps before arriving at a final answer. Instead of just asking for the result, you instruct the model to "think step by step."

Zero-Shot CoT

Simply add "Let's think step by step" to your prompt without providing examples. Surprisingly effective for many tasks.

Few-Shot CoT

Provide examples showing both the step-by-step reasoning process and the final output. Often leads to better results on complex tasks.

Best Practices:

Place the answer after the reasoning steps
Set temperature to 0 for tasks with single correct answers
Use for mathematical, logical, or planning tasks

Self-Consistency

Improves reliability by generating multiple diverse reasoning paths for the same problem and selecting the most consistent answer through majority vote.

Process:

Generate multiple reasoning paths (use higher temperature)
Extract the final answer from each path
Select the most common answer via majority vote

Step-Back Prompting

Enhances reasoning by first asking the model to consider a general principle or concept related to the task before addressing specific details. This allows the model to activate relevant background knowledge.

Tree-of-Thought (ToT)

An advanced technique that extends Chain of Thought by enabling the model to explore multiple reasoning paths concurrently. Uses a tree structure where each node represents a "thought"—a coherent language sequence acting as an intermediate step.

Particularly suited for complex problems requiring exploration, backtracking, or evaluation of multiple possibilities. More computationally demanding but can achieve superior results on tasks necessitating deliberate problem-solving.

Action and Interaction Techniques

Intelligent agents possess the capability to actively engage with their environment beyond generating text. This includes utilizing tools, executing external functions, and participating in iterative cycles of observation, reasoning, and action.

Tool Use / Function Calling

A crucial ability for agents is using external tools or calling functions to perform actions beyond internal capabilities. Modern language models often undergo fine-tuning for "function calling" or "tool use."

The model interprets descriptions of available tools, determines when tool use is necessary, identifies the appropriate tool, and formats the required arguments. The model generates a structured output (typically JSON) specifying the tool and parameters—it does not execute the tool directly.

ReAct (Reason & Act)

A prompting paradigm that combines Chain-of-Thought-style reasoning with the ability to perform actions using tools in an interleaved manner. ReAct mimics how humans operate—we reason verbally and take actions to gather more information.

The ReAct Loop:

Thought: Model generates reasoning about current understanding and plan
Action: Model decides to perform an action using a tool
Observation: System executes tool and provides result back to model
Repeat: Loop continues until task is complete

Advanced Techniques

Automatic Prompt Engineering (APE)

Explores using language models themselves to generate, evaluate, and refine prompts. This method aims to automate the prompt writing process, potentially enhancing model performance without extensive human effort.

A "meta-model" takes a task description and generates multiple candidate prompts. These are evaluated based on output quality, and the best-performing prompts are selected and potentially refined further.

Retrieval Augmented Generation (RAG)

A powerful technique that enhances language models by giving them access to external, up-to-date, or domain-specific information during the prompting process. When a user asks a question, the system first retrieves relevant documents from a knowledge base.

This retrieved information is included in the prompt as context, allowing the model to generate a response grounded in external knowledge. This mitigates hallucination and provides access to information the model wasn't trained on.

Persona Pattern (User Persona)

While role prompting assigns a persona to the model, the Persona Pattern involves describing the user or target audience for the model's output. This helps the model tailor its response in terms of language, complexity, tone, and information type.

Example:

You are explaining quantum physics. The target audience is a high school student with no prior knowledge. Explain it simply and use analogies they might understand.

Using Google Gems

Google's AI "Gems" represent a user-configurable feature within Gemini. Each "Gem" functions as a specialized instance of the core AI, tailored for specific, repeatable tasks through explicit instructions.

This allows creation of highly specialized AI agents for focused applications—a code interpreter referencing specific libraries, a data analyst generating summaries without speculation, or a translator adhering to a formal style guide.

Using LLMs to Refine Prompts

A "meta" application where AI assists in optimizing instructions given to AI. Provide an LLM with an existing prompt, the task, and examples of current output, then prompt it to analyze and suggest improvements.

Benefits:

Accelerated iteration cycles
Identification of blind spots and ambiguities
Learning opportunity for prompt engineering skills
Scalability for large numbers of prompts

Prompting for Specific Tasks

Code Prompting

Language models trained on large code datasets can be powerful assistants for developers. Various use cases exist:

Writing Code

Generate code snippets or functions based on functionality descriptions

Explaining Code

Provide code and ask for line-by-line or summary explanations

Translating Code

Convert code from one programming language to another

Debugging Code

Identify issues, suggest fixes, or provide refactoring suggestions

Multimodal Prompting

The field is rapidly moving towards multimodal models that can process and generate information across different modalities (text, images, audio, video). Multimodal prompting involves using a combination of inputs to guide the model.

Example: Providing an image of a diagram and asking the model to explain the process shown, or providing an image and requesting a descriptive caption.

Best Practices and Experimentation

Becoming a skilled prompt engineer is an iterative process involving continuous learning and experimentation. Several valuable best practices:

Provide Examples

One or few-shot examples are among the most effective ways to guide the model

Design with Simplicity

Keep prompts concise, clear, and easy to understand

Be Specific

Clearly define desired format, length, style, and content

Use Instructions Over Constraints

Tell the model what to do rather than what not to do

Experiment with Formats

Try different input formats and writing styles

Document Attempts

Track what works, what doesn't, and why

Save Prompts in Codebases

Store prompts in separate, well-organized files

Automated Testing

Implement tests to monitor prompt performance

Visual Summary

Core Principles

Clarity, conciseness, action verbs

Basic Techniques

Zero-shot, one-shot, few-shot

Reasoning

CoT, self-consistency, ToT

Advanced

APE, RAG, ReAct, Gems

Key Takeaways

Prompt engineering is a discipline: It requires systematic experimentation, documentation, and iteration to consistently elicit high-quality responses

Structure matters: Use delimiters, system prompts, role definitions, and context engineering to provide clear operational frameworks

Reasoning techniques unlock potential: CoT, self-consistency, and ToT transform models from pattern matchers into logical reasoners

Action-oriented prompting enables agents: ReAct and function calling bridge the gap between digital thought and real-world interaction

Context is king: RAG and context engineering ground responses in factual, up-to-date information, preventing hallucinations

Structured output ensures reliability: JSON schemas and validation (Pydantic/Zod) transform probabilistic text into deterministic, type-safe data

Advanced techniques scale capabilities: APE, Google Gems, and LLM-assisted prompt refinement automate optimization and enable specialized agents

Mastery requires practice: Effective prompting improves through experimentation, documentation, collaboration, and continuous learning

Appendix A: Advanced Prompting Techniques

Master sophisticated prompting strategies for maximum LLM performance

Advanced25 min readInteractive Playground

Advanced45 min read

Appendix A: Advanced Prompting Techniques

Master the art and science of prompt engineering—from core principles to advanced techniques like Chain-of-Thought, Self-Consistency, ReAct, and Automatic Prompt Engineering

Introduction to Prompting

Core Prompting Principles

Clarity and Specificity

Conciseness

Using Action Verbs

Verb choice is a key prompting tool. Action verbs indicate the expected operation. Effective verbs include:

AnalyzeCategorizeCompareCreateEvaluateExtractGenerateIdentifySummarizeTranslate

Instructions Over Constraints

Experimentation and Iteration

Basic Prompting Techniques

Building on core principles, foundational techniques provide language models with varying levels of information or examples to direct their responses.

Zero-Shot Prompting

The most basic form of prompting, where the language model is provided with an instruction and input data without any examples. It relies entirely on the model's pre-training to understand the task.

When to use:

Zero-shot prompting is often sufficient for tasks that the model has likely encountered extensively during its training, such as simple question answering, text completion, or basic summarization.

Example:

Translate the following English sentence to French: 'Hello, how are you?'

One-Shot Prompting

When to use:

One-shot prompting is useful when the desired output format or style is specific or less common. It gives the model a concrete instance to learn from.

Few-Shot Prompting

Enhances one-shot prompting by supplying several examples, typically three to five, of input-output pairs. This aims to demonstrate a clearer pattern of expected responses.

When to use:

Best Practices:

Use 3-5 high-quality, diverse examples
Mix up class order in classification tasks
Cover potential variations or edge cases
Modern LLMs support "many-shot" learning (hundreds of examples)

Structuring Prompts

System Prompting

Example:

You are a helpful and harmless AI assistant. Respond to all queries in a polite and informative manner. Do not generate content that is harmful, biased, or inappropriate.

Role Prompting

Assigns a specific character, persona, or identity to the language model. This involves instructing the model to adopt the knowledge, tone, and communication style associated with that role.

Example:

Act as a seasoned travel blogger. Write a short, engaging paragraph about the best hidden gem in Rome.

Using Delimiters

Delimiters such as triple backticks (\`\`\`), XML tags (<instruction>, <context>), or markers (---) visually and programmatically separate sections. This minimizes misinterpretation by the model.

Example:

<instruction>
Summarize the following article, focusing on the main arguments.
</instruction>
<article>
[Insert the full text of the article here]
</article>

Context Engineering

Context Layers:

System prompts: Foundational instructions defining operational parameters
External data: Retrieved documents, tool outputs, real-time information
Implicit data: User identity, interaction history, environmental state

Structured Output

Benefits:

Type safety and validation
Easy parsing and integration
Reduced hallucinations
Schema enforcement

Pro Tip:

Use Pydantic (Python) or Zod (TypeScript) to validate structured output and create type-safe objects from LLM responses.

Reasoning and Thought Process Techniques

Chain-of-Thought (CoT)

Zero-Shot CoT

Simply add "Let's think step by step" to your prompt without providing examples. Surprisingly effective for many tasks.

Few-Shot CoT

Provide examples showing both the step-by-step reasoning process and the final output. Often leads to better results on complex tasks.

Best Practices:

Place the answer after the reasoning steps
Set temperature to 0 for tasks with single correct answers
Use for mathematical, logical, or planning tasks

Self-Consistency

Improves reliability by generating multiple diverse reasoning paths for the same problem and selecting the most consistent answer through majority vote.

Process:

Generate multiple reasoning paths (use higher temperature)
Extract the final answer from each path
Select the most common answer via majority vote

Step-Back Prompting

Tree-of-Thought (ToT)

Action and Interaction Techniques

Tool Use / Function Calling

ReAct (Reason & Act)

The ReAct Loop:

Thought: Model generates reasoning about current understanding and plan
Action: Model decides to perform an action using a tool
Observation: System executes tool and provides result back to model
Repeat: Loop continues until task is complete

Advanced Techniques

Automatic Prompt Engineering (APE)

Retrieval Augmented Generation (RAG)

Persona Pattern (User Persona)

Example:

You are explaining quantum physics. The target audience is a high school student with no prior knowledge. Explain it simply and use analogies they might understand.

Using Google Gems

Using LLMs to Refine Prompts

Benefits:

Accelerated iteration cycles
Identification of blind spots and ambiguities
Learning opportunity for prompt engineering skills
Scalability for large numbers of prompts