Building a RAG Chatbot with Spring Using Yellowbrick as a Vector Store

Overview

If you are a Java developer, you have probably used libraries from the Spring ecosystem. Java developers love Spring because it simplifies the development of complex applications. Its comprehensive ecosystem includes many specialized frameworks that reduce the complexity of integrating third-party APIs, databases, messaging systems, and more.

Spring AI is the latest project within the Spring ecosystem, designed to make it easy for Java developers to integrate AI capabilities into their applications. The framework abstracts the complexities of connecting to various AI-provider APIs by using familiar Spring abstractions.

Spring AI supports common tasks, such as implementing chatbots and virtual assistants, content generation and summarization, sentiment analysis, customer feedback processing, image and text recognition, and decision support.

In this tutorial, we will explain how to create a chatbot by using Spring AI. First, we will build a simple chatbot by using OpenAI, then enhance it with Retrieval-Augmented Generation (RAG) for improved contextual accuracy.

The RAG technique improves accuracy by augmenting the chatbot's knowledge with external data, such as a product manual, beyond the model's training data.

The Process

The process begins when the model receives a query, such as a user question. Use a pre-trained language model to encode this query into a vector (a mathematical representation).

Next, the system searches for relevant information within an external knowledge base by using the encoded query. The retrieved documents are then appended to the original query, providing additional context.

A vector database is ideal for storing these mathematical representations. While Yellowbrick does not natively support vector data types, it can store vector data efficiently for retrieval.

Spring abstracts the logic for interfacing with the vector store, allowing developers to switch databases without significant code changes. We will explain this by first using pgVector (a PostgreSQL extension) and then switching to Yellowbrick.

Create the Chatbot

All code for the following steps is available on GitHub.

Prerequisites

JDK 17 or later
OpenAI developer key

Getting Started

Please perform the following steps:

Visit start.spring.io and create a new project. Select Java 17 and Maven. Add dependencies for Spring Web and OpenAI. Decompress the generated project and import it into your IDE.
Add your OpenAI key to application.properties:

properties

spring.ai.openai.api-key=<YOUR_OPENAI_KEY>
spring.main.web-application-type=none

Update your Spring application class:

java

@Bean
public CommandLineRunner runner(ChatClient.Builder builder) {
    return args -> {
        ChatClient chatClient = builder.build();
        String response = chatClient.prompt("Tell me a joke").call().content();
        System.out.println(response);
    };
}

Run the application:

bash

./mvnw spring-boot:run

Add pgVector as a Vector Store

Start a pgVector Docker container:

bash

docker run -it --rm -p 5432:5432 -e POSTGRES_USER=postgres -e POSTGRES_PASSWORD=postgres pgvector/pgvector:pg17

Add the pgVector dependency to pom.xml:

xml

<dependency>
  <groupId>org.springframework.ai</groupId>
  <artifactId>spring-ai-pgvector-store-spring-boot-starter</artifactId>
</dependency>

To read PDF files, add:

xml

<dependency>
  <groupId>org.springframework.ai</groupId>
  <artifactId>spring-ai-pdf-document-reader</artifactId>
</dependency>

Configure the database connection in application.properties:

properties

spring.application.name=springai
spring.datasource.url=jdbc:postgresql://localhost:5432/postgres
spring.datasource.username=postgres
spring.datasource.password=postgres
spring.ai.vectorstore.pgvector.initialize-schema=true
spring.ai.vectorstore.pgvector.max-document-batch-size=1000

Add Content to the Vector Store

We will use an Employee Handbook PDF for this example. Place it in src/main/resources/data.

Create a DataLoaderService class:

java

@Service
public class DataLoaderService {

  @Value("classpath:/data/Employee_Handbook.pdf")
  private Resource pdfResource;

  @Autowired
  private VectorStore vectorStore;

  @PostConstruct
  public void load() {
    PagePdfDocumentReader pdfReader = new PagePdfDocumentReader(pdfResource,
        PdfDocumentReaderConfig.builder()
            .withPageExtractedTextFormatter(ExtractedTextFormatter.builder()
                .withNumberOfBottomTextLinesToDelete(3)
                .withNumberOfTopPagesToSkipBeforeDelete(1)
                .build())
            .withPagesPerDocument(1)
            .build());

    var tokenTextSplitter = new TokenTextSplitter();
    vectorStore.accept(tokenTextSplitter.apply(pdfReader.get()));
  }
}

Query with Context from the Vector Store

Prompts

The prompt ensures contextually correct responses. It consists of the following:

Context: Text snippets from the knowledge base.
Query: The user's input.

Code

Please perform the following steps:

Create a ChatService:

java

@Service
public class ChatService {

  @Autowired
  @Qualifier("openAiChatModel")
  private ChatModel chatClient;

  @Autowired
  private VectorStore vectorStore;

  private final String PROMPT_BLUEPRINT = """
    Answer the query using the provided context:
    {context}
    Query:
    {query}
    If no context is available, respond with:
    "I'm sorry, I don't have the information you're looking for."
  """;

  public String chat(String query) {
    return chatClient.call(createPrompt(query, searchData(query)));
  }

  private String createPrompt(String query, List<Document> context) {
    PromptTemplate template = new PromptTemplate(PROMPT_BLUEPRINT);
    template.add("query", query);
    template.add("context", context);
    return template.render();
  }

  public List<Document> searchData(String query) {
    return vectorStore.similaritySearch(query);
  }
}

Add a controller:

java

@RestController
class AIController {

  private final ChatService chatService;

  AIController(ChatService chatService) {
    this.chatService = chatService;
  }

  @GetMapping("/chat")
  public Map<String, String> chat(@RequestParam String query) {
    return Map.of("answer", chatService.chat(query));
  }
}

java

@Bean
ChatClient chatClient(ChatClient.Builder builder) {
  return builder.build();
}

Remove spring.main.web-application-type=none from application.properties, then run the application.
Test with HTTPie, curl, or Postman:

bash

http -v GET localhost:8080/chat?query='What is the vacation policy'

The following Json statement shows the expected response:

json

{
  "answer": "Vacations are provided as a benefit..."
}

Switch to Yellowbrick as a Vector Store

Steps

Please perform the following steps:

Clone the repository:

bash

git clone https://github.com/YellowbrickData/yellowbrick-learn

Navigate to the vector store directory:

bash

cd yellowbrick-learn/samples/rags/vectorstore

Build the artifact:

bash

./mvnw install

Replace the pgVector dependency in pom.xml with:

xml

<dependency>
  <groupId>com.yellowbrick.spring</groupId>
  <artifactId>vectorstore</artifactId>
  <version>0.0.1-SNAPSHOT</version>
</dependency>

Update application.properties:

properties

spring.datasource.url=jdbc:postgresql://myinstance.aws.yellowbrickcloud.com:5432/<username>
spring.datasource.username=<username>
spring.datasource.password=<password>

Restart the application. Queries will now run on Yellowbrick.

Bonus: Switch to Amazon Bedrock

Steps

Please perform the following steps:

Enable the Titan model in your AWS account.
Replace the OpenAI dependency with Bedrock in pom.xml:

xml

<dependency>
  <groupId>org.springframework.ai</groupId>
  <artifactId>spring-ai-bedrock-ai-spring-boot-starter</artifactId>
</dependency>

Update application.properties:

properties

spring.ai.bedrock.titan.chat.enabled=true
spring.ai.bedrock.titan.embedding.enabled=true
spring.ai.bedrock.titan.embedding.model=amazon.titan-embed-text-v2:0
spring.ai.bedrock.aws.region=us-east-1
spring.ai.bedrock.aws.secret-key=<your_secret_key>
spring.ai.bedrock.aws.access-key=<your_access_key>

Rerun the application to see similar results.

Bulk Loading Tables

Bulk Load Examples

Running a Bulk Load

Loading Tables from Parquet Files

ybload Command

Load Data with SQL

Loading Data from Object Storage

Loading from Amazon S3

Loading from Azure Blob Storage

Loading Tables with Spark

Setting up and Running a Spark Job

Setting Up the ybrelay Service

Trickle Loading Data via JDBC

Unloading Data to Object Storage

Unloading Data to Parquet Files

ybunload Command

Installing ybtools

Setting Up a Database Connection

Configuring SSL/TLS for Tools and Drivers

Secure Connections for ODBC/JDBC Clients and ybsql

Appliance

Appliance: Disk Encryption

Setting Up Encrypted Drives

Remote Diagnostics

System Alerts

Creating an Alert Endpoint

Using the System Management Console

ybcli Reference

ybcli: config

Cloud

Configuring

Vanity DNS

Yellowbrick Manager

Installing

CLI Install Instructions

Public Install Instructions

Private Install Instructions

Self-Managed Install Instructions

Permissions

Kubernetes Guides

Databases

Backup & Restore

Overview

ybbackup Commands

ybbackupctl Commands

ybrestore Commands

Database Replication

Managing Replication

Setting Up Replication

Encrypting Sensitive Data

LDAP Integration

LDAP Authentication

Synchronizing Users and Groups

Metering

System Views

sys.lock

Workload Management

How WLM Works

Compatibility Parameters

Data Processing and Formatting

Feature Enablement

General

Tuning

Yellowbrick Row Store (YRS) Alerting Parameters

ybsql \copy Command

ybsql Properties and Variables

SQL Commands

CREATE EXTERNAL FORMAT

CREATE EXTERNAL TABLE

CREATE TABLE

GRANT

Plan Hinting

SELECT

GROUP BY Clause

Subqueries

SQL Data Types

Data Type Casting

DECIMAL

JSON

JSONB

Building a RAG Chatbot with Spring Using Yellowbrick as a Vector Store

Overview

The Process

Create the Chatbot

Prerequisites

Getting Started

Add pgVector as a Vector Store

Add Content to the Vector Store

Query with Context from the Vector Store

Prompts

Code

Switch to Yellowbrick as a Vector Store

Steps

Bonus: Switch to Amazon Bedrock

Steps

References