Creating a RAG pipeline for Search with OpenSearch

Automatically generate code to set up and use an OCI Generative AI connector with Search with OpenSearch for an end-to-end a Retrieval-Augmented Generation (RAG) pipeline.

Before you start, review and implement the prerequisites for using an OCI Generative AI connector with Search with OpenSearch.

You might need to set up more of the network in order for the application to invoke the function that talks to the OpenSearch cluster. The network requires one of the following options:

For a public subnet, an internet gateway, and two rules in the security list: a stateful egress rule, and an ingress rule allowing TCP traffic to use port 9200.
For a private subnet, a service gateway, and a stateful egress rule in the security list.

For information on how to set up these options, see Creating the VCN and Subnets to Use with OCI Functions, if they don't exist already.

The application needs to be run from a VM instance within the same subnet as the cluster, or you can set up port forwarding to run the code locally. See Task 5: Query the OCI Search Service for examples of these options.

From the Console

Open the navigation menu and click Databases. Under OpenSearch, click Clusters.
Under List scope, select the compartment that contains the cluster.
In the Clusters list, click the name of the cluster that you want to create the RAG pipeline for.
On the cluster details page, click Create RAG pipeline.
On the Configure RAG pipeline page, in General Information, enter a name, description, and tag for the pipeline, and then specify one or more context fields.
Context fields specify the text that gets translated into embeddings for the index. Values specified here must match fields that exist in the index.
To use the functionality to automatically generate the code for the Generative AI connector from the Console, the cluster's password must be stored using a secret with the OCI Vault service. If the password is already stored as a Vault secret, specify the username in Cluster Vault credentials, and then select the vault, vault secret, and secret version for the cluster.
If the password isn't stored as a vault secret, select Create a vault and secret, and perform the following tasks:
1. Create a vault.
2. After the vault is active, create a key for the vault.
3. For the vault, create a secret with the following specifics:
  - Select the key that you created in the previous step.
  - Manually enter the password for the OpenSearch cluster with the following format:
    - Secret Type Template: Plain-Text
    - Secret Contents: <OpenSearch-password>
In Model group, enter a name and description for the model group. If you enter the name of an existing model group, the generated code uses the model group ID for the existing model group, otherwise a new model group is created.
Select Next.
On the Configure Generative AI connector page, in Generative AI connector, enter a name and description for the connector.
Select the action, and then select the model to use for the connector.
The fields in Gen AI model parameters section are populated with default values, based on the model you selected in the previous step. You can change the parameter values. You can only enter valid parameters, and allowed parameter values are based on the model you select. A parameter value that works for one model might not work if you select a different model.
In Gen AI register model, enter a name and description for the Generative AI model.
Select Next.

The Generate code page contains code you use to create the RAG pipeline, based on the options you specified on the previous pages in the in the Create RAG pipeline workflow. For standalone code that you can copy or download, select Java or Python from the Language dropdown. You can then copy or download the code in the first text area into an application.

If you select Kibana, the code generated can't be run as a standalone application. Instead, the generated code contains is split into sequential steps that you copy to run from the cluster's OpenSearch Dashboard.

The second text area contains template code showing how to perform queries after the pipeline is created.

Oracle Cloud Infrastructure Documentation

Creating a RAG pipeline for Search with OpenSearch

From the Console