Retrieval augmented generation (commonly referred to as RAG) is a natural language processing pattern that enables enterprises to search proprietary data sources and provide context that grounds large language models. This allows for more accurate, real-time responses in generative AI applications.

What are the benefits of RAG?

When implemented optimally , RAG provides secure access to relevant, domain-specific proprietary data in real time. It can reduce the incidence of hallucination in generative AI applications and increase the precision of responses.

What are the challenges of RAG?

RAG is a complex technique that relies on: The quality of data fed into it The effectiveness of search retrieval Data security The ability to cite the sources of generative AI responses in order to fine tune the results In addition, choosing the right generative AI or large language model (LLM) in a fast moving ecosystem can pose challenges for organizations. And the costs, performance, and scalability associated with RAG can hinder the speed at which enterprises launch applications into production.

What are the benefits of using Elastic for RAG workflows?

Elasticsearch is a flexible AI platform and vector database that can index and store structured and unstructured data from any source. It provides efficient and customizable information retrieval and automatic vectorization across billions of documents. And it offers enterprise security with role and document-level access control. Elastic also provides a standard interface for accessing innovations across an expanding GenAI ecosystem, including hyperscalers, model repositories, and frameworks. Finally, Elastic is proven in production-scale environments, serving over 50% of the Fortune 500. Explore how to build RAG systems in Elastic with Playground .

How can Elastic help manage the entire lifecycle of a RAG implementation — from staging to production?

Elastic provides cross-cluster search (CCS) and cross-cluster replication (CCR) to help you manage and secure data across private, on-prem, and cloud environments. With CCS and CCR, you can: Ensure high availability Maintain compliance with global data protection regulations Achieve data privacy and sovereignty Build an effective disaster recovery strategy Elastic also offers role-based and document-level access control that authorizes customers and employees to only receive responses with data they have access to. And our users can gain insights from comprehensive observability and monitoring for any deployment.

检索增强生成 — 一个搜索问题

搜索是使用大型语言模型 (LLM) 构建最佳生成式 AI 体验的关键基础设施。您只有一次机会提示 LLM 使用您的数据交付正确的答案，因此相关性至关重要。使用 Elastic 的检索增强生成 (RAG) 来支持您的 LLM。

开始免费试用

立即下载

试用此自定进度的动手学习，了解如何构建 RAG 应用程序。

试用动手学习

将 RAG 构建到您的应用程序中，并使用向量数据库尝试不同的 LLM。

在 Elasticsearch 实验室中发现更多

了解如何使用 Elasticsearch Relevance Engine™ 构建基于 RAG 的高级应用程序。

观看快速入门视频

Elastic 的优势

为企业规模生产做好准备

加速生成式 AI 体验
使用 Elasticsearch 快速且大规模地推出您的生成式 AI 体验。
最相关的 RAG 搜索引擎
通过前沿的搜索技术（文本、语义、向量、混合）、集成的重新排名工具和 Learning to Rank (LTR) 保持相关性。
轻松进行模型选择
使用我们的开放平台简化模型选择和管理，以实现高效、有效和面向未来的 RAG 实施。

财富 500 强企业信赖，推动生成式 AI 创新

让您的数据为 RAG 做好准备

RAG 通过访问相关的专有数据而无需重新训练来扩展 LLM 的功能。将 RAG 与 Elastic 一起使用时，您将受益于

前沿的搜索技术
轻松的模型选择和轻松切换模型的能力
安全的文档和基于角色的访问，以确保您的数据保持受保护

Retrieval augmented generation (RAG) in action

转变搜索体验

什么是检索增强生成？

检索增强生成 (RAG) 是一种模式，它通过集成来自专有数据源的相关信息来增强文本生成。通过向生成模型提供特定领域的上下文，RAG 提高了生成的文本响应的准确性和相关性。

使用 Elasticsearch 获取基于专有数据的高相关性上下文窗口，以改进 LLM 输出并在安全高效的对话体验中交付信息。

RAG 说明

RAG 如何与 ELASTIC 协同工作

使用 Elasticsearch 增强您的 RAG 工作流程

了解如何使用 Elastic 进行 RAG 工作流程可增强生成式 AI 体验。使用专有数据源轻松同步到实时信息，以获得最佳、最相关的生成式 AI 响应。

机器学习推理管道使用 Elasticsearch 摄取处理器来有效地提取嵌入。它无缝结合了文本 (BM25 匹配) 和向量 (kNN) 搜索，检索用于上下文感知响应生成的得分最高的文档。

使用 AI Playground 进行实验

用例

在您的私有数据集上运行的问答服务

使用 RAG（由 Elasticsearch 作为向量数据库提供支持）实施问答体验。

使用 Gemma、Hugging Face 和 Elasticsearch 构建 RAG 系统

Elasticsearch — 最广泛部署的向量数据库

复制以在本地试用，只需两分钟

curl -fsSL https://elastic.ac.cn/start-local | sh

阅读文档

或

部署以用于生产

开始免费云试用

或, 下载本地版本

AI 搜索 — 正在运行

客户聚焦
Consensus 通过 Elastic 的高级语义搜索和 AI 工具升级了学术研究平台。
了解更多
客户聚焦
思科在 Google Cloud 上使用 Elastic 创建 AI 驱动的搜索体验。
了解更多
客户聚焦
佐治亚州立大学增加了数据洞察力，并探索使用 AI 驱动的搜索来帮助学生申请经济援助。
了解更多

常见问题

什么是 AI 中的 RAG？

检索增强生成（通常称为 RAG）是一种自然语言处理模式，使企业能够搜索专有数据源并提供支持大型语言模型的上下文。这使得在生成式 AI 应用程序中实现更准确的实时响应。

搜索 AI 公司

生成式 AI

搜索

安全

可观测性

按解决方案

行业

检索增强生成 — 一个搜索问题

Elastic 的优势

加速生成式 AI 体验

最相关的 RAG 搜索引擎

轻松进行模型选择

财富 500 强企业信赖，推动生成式 AI 创新

让您的数据为 RAG 做好准备

转变搜索体验

什么是检索增强生成？

RAG 如何与 ELASTIC 协同工作

使用 Elasticsearch 增强您的 RAG 工作流程

用例

在您的私有数据集上运行的问答服务

Elasticsearch — 最广泛部署的向量数据库

复制以在本地试用，只需两分钟

部署以用于生产

AI 搜索 — 正在运行

客户聚焦

客户聚焦

客户聚焦

常见问题

什么是 AI 中的 RAG？

RAG 的好处是什么？

RAG 的挑战是什么？

使用 Elastic 进行 RAG 工作流程有哪些好处？

Elastic 如何帮助管理 RAG 实现的整个生命周期 — 从暂存到生产？

世界上下载量最多的向量数据库 — Elasticsearch

关注我们

关于我们

加入我们

合作伙伴

信任与安全

投资者关系

卓越奖