Prompt Templates and System Insights for Google's AI Co-Scientist
8 minutes
Scientific discovery is complex, often requiring innovative hypotheses and rigorous evaluation. Our earlier discussion on Google's AI co-scientist highlighted a breakthrough multi-agent system designed to augment human creativity in research.
In this post, we break down the prompt templates that empower the co-scientist's specialized agents. These detailed templates, as described in the original research paper, form the backbone of a system that continuously generates, debates, and refines hypotheses in a scientist-in-the-loop framework.
Background: Why an AI Co-Scientist?
Traditional research methods can be time-consuming, with experts manually sifting through literature to generate hypotheses. Google's AI co-scientist addresses this challenge by:
- Automating literature synthesis: It processes vast amounts of scientific literature to generate ideas.
- Facilitating collaborative debates: The system simulates discussions among domain experts.
- Iterative improvement: Through structured feedback loops, it refines hypotheses over multiple iterations.
This framework is built upon a multi-agent architecture where each agent has a specialized role, and the following sections provide a detailed look at the prompt templates that guide their operations.
Specialized Agent Prompts: Detailed Templates and Examples
The original paper describes a set of prompt templates for each specialized agent. Here, we break down these prompts and their purpose.
1. Generation Agent
Purpose
The Generation Agent initiates the hypothesis formulation process. It synthesizes insights from literature reviews and, in some cases, engages in simulated debates to produce novel, robust hypotheses.
Templates
- After Literature Review:

Prompt
You are an expert tasked with formulating a novel and robust hypothesis to address the following objective.
Describe the proposed hypothesis in detail, including specific entities, mechanisms, and anticipated outcomes.
This description is intended for an audience of domain experts.
You have conducted a thorough review of relevant literature and developed a logical framework for addressing the objective. The articles consulted, along with your analytical reasoning, are provided below.
Goal: {goal}
Criteria for a strong hypothesis: {preferences}
Existing hypothesis (if applicable): {source_hypothesis}
{instructions}
Literature review and analytical rationale (chronologically ordered, beginning with the most recent analysis): {articles_with_reasoning}
Proposed hypothesis (detailed description for domain experts):
This template helps make sure the hypothesis is based on the latest research and sound logic.
- After Scientific Debate:

Prompt
You are an expert participating in a collaborative discourse concerning the generation of a {idea_attributes} hypothesis. You will engage in a simulated discussion with other experts.
The overarching objective of this discourse is to collaboratively develop a novel and robust {idea_attributes} hypothesis.
Goal: {goal}
Criteria for a high-quality hypothesis: {preferences}
Instructions: {instructions}
Review Overview: {reviews_overview}
Procedure: Initial contribution (if initiating the discussion): Propose three distinct {idea_attributes} hypotheses.
Subsequent contributions (continuing the discussion):
- Pose clarifying questions if ambiguities or uncertainties arise.
- Critically evaluate the hypotheses proposed thus far, addressing the following aspects:
- Adherence to {idea_attributes} criteria.
- Utility and practicality.
- Level of detail and specificity.
- Identify any weaknesses or potential limitations.
- Propose concrete improvements and refinements to address identified weaknesses.
- Conclude your response with a refined iteration of the hypothesis.
General guidelines:
- Exhibit boldness and creativity in your contributions.
- Maintain a helpful and collaborative approach.
- Prioritize the generation of a high-quality {idea_attributes} hypothesis.
Termination condition: When sufficient discussion has transpired (typically 3-5 conversational turns, with a maximum of 10 turns) and all relevant questions and points have been thoroughly addressed and clarified, conclude the process by writing "HYPOTHESIS" (in all capital letters) followed by a concise and self-contained exposition of the finalized idea.
#BEGIN TRANSCRIPT# {transcript} #END TRANSCRIPT#
Your Turn:
This version encourages collaborative improvement, ensuring the hypothesis stands up to simulated peer review.
2. Reflection Agent
Purpose
The Reflection Agent acts as a peer reviewer. It examines the generated hypotheses by comparing them against observed data and existing literature, ensuring the hypothesis is both novel and testable.

Prompt
You are an expert in scientific hypothesis evaluation. Your task is to analyze the relationship between a provided hypothesis and observations from a scientific article.
Specifically, determine if the hypothesis provides a novel causal explanation for the observations, or if they contradict it.
Instructions:
- Observation extraction: list relevant observations from the article.
- Causal analysis (individual): for each observation: a. State if its cause is already established. b. Assess if the hypothesis could be a causal factor (hypothesis => observation). c. Start with: "would we see this observation if the hypothesis was true:". d. Explain if it's a novel explanation. If not, or if a better explanation exists, state: "not a missing piece."
- Causal analysis (summary): determine if the hypothesis offers a novel explanation for a subset of observations. Include reasoning. Start with: "would we see some of the observations if the hypothesis was true:".
- Disproof analysis: determine if any observations contradict the hypothesis. Start with: "does some observations disprove the hypothesis:".
- Conclusion: state: "hypothesis: already explained, other explanations more likely, missing piece, neutral, or disproved".
Scoring:
- Already explained: hypothesis consistent, but causes are known. No novel explanation.
- Other explanations more likely: hypothesis could explain, but better explanations exist.
- Missing piece: hypothesis offers a novel, plausible explanation.
- Neutral: hypothesis neither explains nor is contradicted.
- Disproved: observations contradict the hypothesis.
Important: if observations are expected regardless of the hypothesis, and don't disprove it, it's neutral.
Article: {article}
Hypothesis: {hypothesis}
Response: provide reasoning. end with: "hypothesis: already explained, other explanations more likely, missing piece, neutral, or disproved"
3. Ranking Agent
Purpose
The Ranking Agent compares multiple hypotheses to prioritize the most promising ideas. It uses an Elo-based tournament system to perform pairwise comparisons.
Templates
- Direct Comparison:

Prompt
You are an expert evaluator tasked with comparing two hypotheses.
Evaluate the two provided hypotheses and determine which one is superior based on the specified {idea_attributes}.
Provide a concise rationale for your selection, concluding with the phrase "better idea: 1 or 2".
Goal: {goal}
Evaluation criteria: {preferences}
Considerations: {notes}
Each hypothesis includes an independent review. These reviews may contain numerical scores. Disregard these scores in your comparative analysis, as they may not be directly comparable across reviews.
Hypothesis 1: {hypothesis_1}
Hypothesis 2: {hypothesis_2}
Review of hypothesis 1: {review_1}
Review of hypothesis 2: {review_2}
Reasoning and conclusion (end with "better hypothesis: 1 or 2"):
- Simulated Debate:

Prompt
You are an expert in comparative analysis, simulating a panel of domain experts engaged in a structured discussion to evaluate two competing hypotheses.
The objective is to rigorously determine which hypothesis is superior based on a predefined set of attributes and criteria.
The experts possess no pre-existing biases toward either hypothesis and are solely focused on identifying the optimal choice, given that only one can be implemented.
Goal: {goal}
Criteria for hypothesis superiority: {preferences}
Hypothesis 1: {hypothesis_1}
Hypothesis 2: {hypothesis_2}
Initial review of hypothesis 1: {review_1}
Initial review of hypothesis 2: {review_2}
Debate procedure: The discussion will unfold in a series of turns, typically ranging from 3 to 5, with a maximum of 10.
Turn 1: begin with a concise summary of both hypotheses and their respective initial reviews.
Subsequent turns:
- Pose clarifying questions to address any ambiguities or uncertainties.
- Critically evaluate each hypothesis in relation to the stated Goal and Criteria.
This evaluation should consider aspects such as:
- Potential for correctness/validity.
- Utility and practical applicability.
- Sufficiency of detail and specificity.
- Novelty and originality.
- Desirability for implementation.
- Identify and articulate any weaknesses, limitations, or potential flaws in either hypothesis.
Additional notes: {notes}
Termination and judgment: Once the discussion has reached a point of sufficient depth (typically 3-5 turns, up to 10 turns) and all relevant questions and concerns have been thoroughly addressed, provide a conclusive judgment. This judgment should succinctly state the rationale for the selection. Then, indicate the superior hypothesis by writing the phrase "better idea: ", followed by "1" (for hypothesis 1) or "2" (for hypothesis 2).
4. Evolution Agent
Purpose
The Evolution Agent refines hypotheses, enhancing their feasibility and practical implementation. It employs creative strategies to address weaknesses and combine strong elements from multiple hypotheses.
Templates
- Feasibility Improvement:

Prompt
You are an expert in scientific research and technological feasibility analysis.
Your task is to refine the provided conceptual idea, enhancing its practical implementability by leveraging contemporary technological capabilities. Ensure the revised concept retains its novelty, logical coherence, and specific articulation.
Goal: {goal}
Guidelines:
- Begin with an introductory overview of the relevant scientific domain.
- Provide a concise synopsis of recent pertinent research findings and related investigations, highlighting successful methodologies and established precedents.
- Articulate a reasoned argument for how current technological advancements can facilitate the realization of the proposed concept.
- CORE CONTRIBUTION: Develop a detailed, innovative, and technologically viable alternative to achieve the objective, emphasizing simplicity and practicality.
Evaluation Criteria: {preferences}
Original Conceptualization: {hypothesis}
Response:
- Out-of-the-Box Thinking:

Prompt
You are an expert researcher tasked with generating a novel, singular hypothesis inspired by analogous elements from provided concepts.
Goal: {goal}
Instructions:
- Provide a concise introduction to the relevant scientific domain.
- Summarize recent findings and pertinent research, highlighting successful approaches.
- Identify promising avenues for exploration that may yield innovative hypotheses.
- CORE HYPOTHESIS: Develop a detailed, original, and specific single hypothesis for achieving the stated goal, leveraging analogous principles from the provided ideas. This should not be a mere aggregation of existing methods or entities. Think out-of-the-box.
Criteria for a robust hypothesis: {preferences}
Inspiration may be drawn from the following concepts (utilize analogy and inspiration, not direct replication): {hypotheses}
Response:
5. Meta-Review Agent
Purpose
The Meta-Review Agent synthesizes feedback from all previous reviews and debates. It provides a high-level analysis to guide future iterations and consolidate the best ideas into a comprehensive research overview.

Prompt
You are an expert in scientific research and meta-analysis.
Synthesize a comprehensive meta-review of provided reviews pertaining to the following research goal:
Goal: {goal}
Preferences: {preferences}
Additional instructions: {instructions}
Provided reviews for meta-analysis: {reviews}
Instructions:
- Generate a structured meta-analysis report of the provided reviews.
- Focus on identifying recurring critique points and common issues raised by reviewers.
- The generated meta-analysis should provide actionable insights for researchers developing future proposals.
- Refrain from evaluating individual proposals or reviews; focus on producing a synthesized meta-analysis.
Response:
Conclusion
These detailed prompt templates from the original AI co-scientist paper exemplify how structured natural language instructions can guide each specialized agent to collaboratively generate, critique, and refine scientific hypotheses. By embedding these prompts into a multi-agent system, the AI co-scientist accelerates the hypothesis-generation process and ensures that the outputs are both novel and experimentally testable.
For those interested in a deeper technical understanding, we encourage you to refer to the original paper's Appendix A.2 and A.3, which provide a comprehensive view of the prompts and examples.
Valeriia Kuka
Valeriia Kuka, Head of Content at Learn Prompting, is passionate about making AI and ML accessible. Valeriia previously grew a 60K+ follower AI-focused social media account, earning reposts from Stanford NLP, Amazon Research, Hugging Face, and AI researchers. She has also worked with AI/ML newsletters and global communities with 100K+ members and authored clear and concise explainers and historical articles.