Jump to content

Deep research

From UBC Wiki
Cox & Mazumdar (2024). Definitions of artificial intelligence

Compiled by

Updated

See also

Introduction - What is AI "deep research"?

Deep research (sometimes called "deepsearch," "iterative deep research," or "research mode") refers to an advanced capability in modern large language models (LLMs) where the AI tool or system does not give an instant answer from its trained knowledge alone. Instead, it breaks a user’s question into sub-questions or research tasks.

Other features characteristic of deep research tools include:

  • autonomously searching the live web (and/or specialized databases in real time;
  • reading and analyzing dozens or hundreds of sources (academic papers, news, reports, primary documents, etc.).
  • "reason"ing over conflicting information, evaluating source credibility, and synthesizing its findings;
  • iterating and pouring over information, and searches, to dig deeper until it reaches a comprehensive, evidence-based answer;
  • delivering a heavily cited transparent report (however, it must be said not always accurate; see hallucinations in glossary).

Examples of this capability today (December 2025):

  • OpenAI’s “Deep Research” mode in ChatGPT (rolled out late 2024/early 2025)
  • Anthropic’s Claude with extended web search + reasoning loops
  • Perplexity’s “Pro Search” and similar features
  • Grok’s deep-dive research mode (when enabled)
  • Google’s Gemini Advanced “Deep Research” reports

(See OpenAI Deep Research, Google’s Gemini Deep Research and Perplexity’s Deep Research (renamed Perplexity Labs).

Some features of deep research

  • Multi-step reasoning models: breaks down complex questions into manageable parts, iteratively refining queries and validating conclusions;
  • Source synthesis: integrates information from multiple sources to identify patterns, contradictions, and gaps;
  • Advanced reasoning: using AI models with capabilities to PhD-level analysis to interpret data and generate high-level insights;
  • Citation and transparency: aims to provide source citations to ensure credibility and verifiability.

AI-powered deep research systems, such as those developed by OpenAI, Google’s Gemini, or Perplexity, automate these processes, significantly reducing research time (e.g., from days to minutes) while maintaining accuracy. These systems can take 5–30 minutes per query (not seconds) because they are literally performing human-level literature review and investigative work.

Examples of deep, reasoning models

  • OpenAI's O3 and O4 Mini:: generative pre-trained transformer (GPT) models as successors to OpenAI o1 for ChatGPT; designed for step-by-step problem-solving; effective in technical domains such as science and programming; can utilize external tools for enhanced functionality; designed to devote additional deliberation time when addressing questions that require step-by-step logical reasoning. See Wikipedia entry.
  • Google's Gemini 2.5: can process various data types; self-fact-checking capabilities; generates applications and games, etc.
  • Claude 4.1 Opus: maintains context over long conversations; excels in open-ended reasoning and provides nuanced responses.
  • Grok xAI released Grok 4 and 4 Heavy; xAI claims its model outperforms rival models in benchmark tests; generates controversial responses, including conspiracy theories and antisemitism.
  • DeepSeek‑R1: designed to tackle challenging queries that require thorough analysis and structured solutions; used in complex coding challenges or detailed logical puzzles.

OpenAI's deep research model

In 2025, Nature published a story entitled, OpenAI’s "deep research" tool: is it useful for scientists? reporting that OpenAI's deep research model produces pages-long reports that are well-cited and might help perform literature reviews. As it turned out, the pay-for-access platform called ‘OAI deep research’, synthesizes information from hundreds of websites into cited reports, and follows a similar path as Google, called ‘Deep Research’, released in December 2024. Both act as personal assistants doing the equivalent of hours of work in tens of minutes.

While scientists were impressed with OpenAI's deep research tool, others were less enthusiastic. “If a human did this I would be like: ‘This needs a lot of work’,” Kyle Kabasares, a data scientist at the Bay Area Environmental Research Institute, said....

See librarians' points of view

See Aaron Tay's August 8th blogpost on Academic Deep Research.

  • "...AI tools capable of deep research synthesis presents both challenges and opportunities for academic libraries and librarians. These technologies have the potential to significantly alter established research workflows, bringing the tools of multisource synthesis research to tasks and questions that previously did not merit the work."
  • Deep research, in the context of artificial intelligence (AI), refers to AI models' ability to conduct extensive online research, synthesize information from various sources, and generate comprehensive reports, with citations. DR is a form of web browsing where AI acts as a researcher, exploring the web, analyzing information, and producing detailed analyses.
  • Also: Deep research is agentic AI integrated into ChatGPT by OpenAI which generates cited reports on a user-specified topic by autonomously browsing the web for 5 to 30 minutes.

More on deep research

Deep research in medicine increasingly relies on artificial intelligence (AI) and semantic searching technologies to navigate the vast and rapidly growing body of biomedical literature. Traditional keyword-based searching often misses relevant studies because it cannot account for synonyms, abbreviations, or the nuanced ways clinical concepts are expressed. Semantic search, powered by AI and natural language processing (NLP), addresses this gap by interpreting the intent and contextual meaning of search queries, linking related terms such as heart attack, myocardial infarction, and acute coronary syndrome. This capability allows researchers and clinicians to uncover connections across diverse sources, identify hidden patterns, and integrate findings more efficiently. As a result, semantic searching enhances deep research by enabling more comprehensive evidence discovery, supporting systematic reviews, precision medicine, and informed clinical decision-making.

How do I know it's deep research?

Deep research is prominently located in some interfaces, and a default option for emerging models. Academic libraries have been a target for new AI technologies since the worldwide emergence of GenAI. “Deep Research”, “Research Plan”, and “Research Report”, are deliberate terms, and language appealing to academics. The word “deep” (with “research”) pops up everywhere in AI: DeepSeek, DeepMind (Google), ChatGPT Deep Research (OpenAI) – DeepPiXEL, DeepNeural AI, DeepX, etc.

Any (all?) deep search using AI returns a mix of credible and not credible sources from the web including articles, websites, and blogs. It's not the random sources that must be evaluated that is the issue, but that you cede control to the bots, limiting your autonomy. For all their flaws, search engines allow you to explore, which involves exploring the scope and constraints of your research question, discovering new language, and following unique pathways and ideas. Deep research creates a research plan, but it's not a feasible plan for academics. It’s a list of how a bot will search; in contrast, creating a research plan is key to learning, an exploration of your question and not a search for answers.

Established pathways for research allow for the possibility that you won't find anything, or that your research question will change and evolve.

Impact on biomedicine

According to Wang et al (2025) scientific research can only be compared and reproduced by strictly following fixed methods, which are relatively easy for AI to imitate and learn. Basically, current AI can speed up the retrieval and summarization of literature on one hand, and facilitate the comparison and discussion of results on the other. For dry lab research, especially text-based ones, such as systematic review, meta-analysis, bibliometric study, AI may replace them efficiently. With the dawn of a new era, the future development and face of academia is full of curiosity. Undoubtedly, the crucial part of research will continue to be creative originality and critical thinking.

Current AI products that combine deep thinking and deep search with real-time web and database access, focus on peer-reviewed research, offering custom AI workflows. Emerging platforms in the biomedical space are Elicit.com and Undermind.ai.

Examples of GenAI "Deep Research" features

References

Disclaimer

  • Note: Please use your critical reading skills while reading entries. No warranties, implied or actual, are granted for any health or medical search or AI information obtained while using these pages. Check with your librarian for more contextual, accurate information.