Legal Spoke

AI for Legal Research

The market has consolidated around four products. Stanford's peer-reviewed benchmark found leading legal RAG tools still hallucinate on one in three queries.

Free Shadow AI Risk Check Back to Legal Hub

The Four Products That Dominate Legal Research AI

Each uses retrieval-augmented generation (RAG) — retrieve cases, synthesize an answer with citations

Thomson Reuters

Westlaw AI / CoCounsel

Westlaw AI-Assisted Research, branded CoCounsel after the 2023 Casetext acquisition. Deeply integrated with Westlaw research workflow.

LexisNexis

Lexis+ AI

Lexis+ research platform with GenAI summarization, drafting, and Q&A grounded in Lexis content.

Harvey

Startup built on OpenAI infrastructure, widely deployed at AmLaw firms for drafting, research, and matter workflows.

vLex

Vincent AI

vLex's AI research product, with strong international content coverage in addition to U.S. cases.

Adoption Is Outrunning Governance

Thomson Reuters Institute 2025 Generative AI in Professional Services Report

26%

Of legal organizations actively using GenAI in 2025 (nearly doubled from 14% in 2024)

74%

Of GenAI users cite legal research as a top use case

52%

Of legal organizations have no GenAI policy

64%

Have received no GenAI training

The Stanford Hallucination Finding the Industry Does Not Want to Talk About

Stanford RegLab / HAI's "Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools" (May 2024 preprint; peer-reviewed in Journal of Empirical Legal Studies, 2025) tested the major products against a legal-query benchmark.

Lexis+ AI and Westlaw AI-Assisted Research / Ask Practical Law AI each hallucinated between 17% and 33% of the time — despite vendor 'hallucination-free' marketing claims. Lexis+ AI answered 65% of queries accurately. Westlaw AI-Assisted Research was accurate 42% of the time and hallucinated nearly twice as often as the other tools tested.

The companion paper Hallucinating Law (Stanford, January 2024) found general-purpose LLMs (GPT-3.5, PaLM-2, Llama-2) hallucinated on 58–82% of legal queries — context that explains why RAG-based tools are improvement, but does not justify 'no hallucination' marketing.

Why RAG Does Not Eliminate Hallucinations

Retrieval grounds the answer in real sources, but the generation step still invents

Failure Mode 1

Right Case, Wrong Holding

Cites a real case but mis-states its holding.

Why it matters: Hardest to catch — citation passes superficial verification, but the proposition is invented.

Failure Mode 2

Real Case, Unsupported Proposition

Cites a real case for a proposition the case does not support.

Why it matters: Requires reading the actual opinion, not just confirming the citation exists.

Failure Mode 3

Synthesized Rule From Multiple Cases

Combines holdings from multiple cases into a synthesized rule that no single case supports.

Why it matters: Looks like sophisticated legal reasoning; functions like a fabricated authority.

Failure Mode 4

Outdated Statutory Version

Cites a statute correctly but gets its current effective version wrong.

Why it matters: Especially dangerous on amended or recently revised codes.

What a Safe Legal Research Workflow Looks Like

Five-layer defense that satisfies Rule 1.1 competence and Rule 3.3 candor under ABA Op. 512

Use enterprise-grade legal research AI

Westlaw AI, Lexis+ AI, Harvey, vLex — not general-purpose LLMs. The hallucination rate is materially lower (though not zero).

Verify every citation

Pull the case. Read the holding. Confirm the proposition the AI attributed to the case is actually in the case.

Verify the citation form

Check that pinpoint citations are accurate and the case is still good law.

Run secondary-source corroboration

For novel propositions, if only the AI is asserting it, treat it as untrusted until corroborated.

Document the verification workflow

Rule 5.1 / 5.3 supervision requires a record of the verification. Rule 1.1 competence requires you can show how you used the tool.

Tool Selection Criteria

How firms compare research AI products before broad deployment

Accuracy

Hallucination Rate

Stanford's benchmarks are public; firms can also run internal benchmarks on their typical query types.

Contracts

Vendor Terms

No training on firm inputs, customer-controlled retention, BAA where the firm has health-related clients.

Workflow

Integration

Does the tool fit existing research workflows or require attorneys to context-switch?

Citations

Citation Discipline

Does the tool reliably pinpoint-cite, or does it generate citation-style strings that need separate verification?

Pricing

Cost vs. Traditional Research

Per-attorney pricing varies widely; ROI depends on actual usage and substitution effects.

MNPI

Confidentiality Terms

For firms handling sensitive client information, contractual terms must support Rule 1.6 without per-matter consent.

AI for Legal Research — FAQ

Do Westlaw AI and Lexis+ AI hallucinate?

Yes. Stanford RegLab's peer-reviewed 2024 study found Lexis+ AI and Westlaw AI-Assisted Research each hallucinated between 17% and 33% of the time on a benchmark of legal queries — despite vendor 'hallucination-free' marketing claims. Lexis+ AI was accurate on 65% of queries; Westlaw AI-Assisted Research was accurate on 42%.

Is legal-specific AI safer than ChatGPT for legal research?

Materially safer, but not safe. Stanford's earlier companion study found general-purpose LLMs (GPT-3.5, PaLM-2, Llama-2) hallucinated on 58–82% of legal queries — far worse than legal-specific RAG tools. But neither is at a level that permits skipping citation verification.

How fast is GenAI adoption growing in law firms?

Thomson Reuters Institute's 2025 survey found 26% of legal organizations were actively using GenAI in 2025, up from 14% in 2024 — nearly doubled in one year. 78% of law-firm respondents expect GenAI to be central to their workflow within five years.

Why does not RAG eliminate hallucinations?

Retrieval-augmented generation grounds the AI in real legal sources, but the generation step still synthesizes prose that may misstate holdings, cite cases for propositions they do not support, combine holdings from multiple cases into invented rules, or use outdated statutory versions. RAG reduces but does not eliminate hallucination risk.

What does ABA Op. 512 require for legal research AI?

Lawyers must gain reasonable understanding of the tool (Rule 1.1), verify output before filing (Rule 3.3), supervise associates using the tool (Rule 5.1) and nonlawyer staff using it (Rule 5.3), and bill honestly for AI-assisted research time (Rule 1.5).

Related Resources

Continue across the silo or bridge to a core hub

Govern Your Firm's Legal Research AI Use

Free Shadow AI Risk Check audits your tool selection, your verification workflow, and your Rule 1.1 / 3.3 documentation.

Free Shadow AI Risk Check Shadow AI Leaders Guide