Querying Structured Data (SQL)

Open in Colab

In the previous tutorial, we explored how agentic RAG can handle complex queries on structured data in the form of tables using pandas. Now, we’ll see how we can do the same for SQL databases.

Consider a scenario similar to the previous tutorial where we have evaluation results for an LLM application. However, instead of a CSV file, this data is now stored in a SQLite database. Users might still ask questions like “What’s the average score for a specific use case?” or “Which configuration has the lowest latency?”, but now we’ll answer these using SQL queries instead of pandas operations.

In this tutorial, we’ll cover:

Setting up a SQLite database
Creating a function to execute SQL queries
Building an agent for querying SQL databases
Running the agent with various types of queries

By implementing these techniques, we’ll expand our agentic RAG system to handle structured data in SQL databases, complementing our previous work with tabular data in pandas.

Let’s get started by setting up our environment and creating our SQLite database.

Setup

To get started, first we need to install the cohere library and create a Cohere client.

PYTHON

1 ! pip install cohere pandas -qq

PYTHON

1 import json
2 import os
3 import cohere
4 import sqlite3
5 import pandas as pd
6 
7 co = cohere.ClientV2(
8     "COHERE_API_KEY"
9 )  # Get your free API key: https://dashboard.cohere.com/api-keys

Creating a SQLite database

Next, we’ll create a SQLite database to store our evaluation results. SQLite is a lightweight, serverless database engine that’s perfect for small to medium-sized applications. Here’s what we’re going to do:

Create a new SQLite database file named evaluation_results.db.
Create a table called evaluation_results with columns for usecase, run, score, temperature, tokens, and latency.
Insert sample data into the table to simulate our evaluation results.

Important: the data can be found here. Make sure to have the file in the same directory as this notebook for the imports to work correctly.

PYTHON

1 # Create a connection to a new SQLite database (or connect to an existing one)
2 conn = sqlite3.connect("evaluation_results.db")
3 cursor = conn.cursor()
4 
5 # Execute the CREATE TABLE command
6 cursor.execute(
7     """
8 CREATE TABLE evaluation_results (
9     usecase TEXT,
10     run TEXT,
11     score FLOAT,
12     temperature FLOAT,
13     tokens INTEGER,
14     latency FLOAT
15 )
16 """
17 )
18 
19 # Execute the INSERT commands
20 data = [
21     ("extract_names", "A", 0.5, 0.3, 103, 1.12),
22     ("draft_email", "A", 0.6, 0.3, 252, 2.5),
23     ("summarize_article", "A", 0.8, 0.3, 350, 4.2),
24     ("extract_names", "B", 0.2, 0.3, 101, 2.85),
25     ("draft_email", "B", 0.4, 0.3, 230, 3.2),
26     ("summarize_article", "B", 0.6, 0.3, 370, 4.2),
27     ("extract_names", "C", 0.7, 0.3, 101, 2.22),
28     ("draft_email", "C", 0.5, 0.3, 221, 2.5),
29     ("summarize_article", "C", 0.1, 0.3, 361, 3.9),
30     ("extract_names", "D", 0.7, 0.5, 120, 3.2),
31     ("draft_email", "D", 0.8, 0.5, 280, 3.4),
32     ("summarize_article", "D", 0.9, 0.5, 342, 4.8),
33 ]
34 
35 cursor.executemany(
36     "INSERT INTO evaluation_results VALUES (?,?,?,?,?,?)", data
37 )
38 
39 # Commit the changes and close the connection
40 conn.commit()
41 conn.close()

Creating a function to query a SQL database

Next, we’ll define a function called sql_table_query that allows us to execute SQL queries on our evaluation_results database.

This function will enable us to retrieve and analyze data from our evaluation_results table, allowing for dynamic querying based on our specific needs.

PYTHON

1 def sql_table_query(query: str) -> dict:
2     """
3     Execute an SQL query on the evaluation_results table and return the result as a dictionary.
4 
5     Args:
6     query (str): SQL query to execute on the evaluation_results table
7 
8     Returns:
9     dict: Result of the SQL query
10     """
11     try:
12         # Connect to the SQLite database
13         conn = sqlite3.connect("evaluation_results.db")
14 
15         # Execute the query and fetch the results into a DataFrame
16         df = pd.read_sql_query(query, conn)
17 
18         # Close the connection
19         conn.close()
20 
21         # Convert DataFrame to dictionary
22         result_dict = df.to_dict(orient="records")
23 
24         return result_dict
25 
26     except sqlite3.Error as e:
27         print(f"An error occurred: {e}")
28         return str(e)
29     except Exception as e:
30         print(f"An unexpected error occurred: {e}")
31         return str(e)
32 
33 
34 functions_map = {"sql_table_query": sql_table_query}

We can test the function by running a simple query:

PYTHON

1 result = sql_table_query(
2     "SELECT * FROM evaluation_results WHERE usecase = 'extract_names'"
3 )
4 print(result)

1 [{'usecase': 'extract_names', 'run': 'A', 'score': 0.5, 'temperature': 0.3, 'tokens': 103, 'latency': 1.12}, {'usecase': 'extract_names', 'run': 'B', 'score': 0.2, 'temperature': 0.3, 'tokens': 101, 'latency': 2.85}, {'usecase': 'extract_names', 'run': 'C', 'score': 0.7, 'temperature': 0.3, 'tokens': 101, 'latency': 2.22}, {'usecase': 'extract_names', 'run': 'D', 'score': 0.7, 'temperature': 0.5, 'tokens': 120, 'latency': 3.2}]

Setting up a tool to interact with the database

Next, we’ll create a tool that will allow the agent to interact with the SQLite database containing our evaluation results.

PYTHON

1 sql_table_query_tool = {
2     "type": "function",
3     "function": {
4         "name": "sql_table_query",
5         "description": "Execute an SQL query on the evaluation_results table in the SQLite database. The table has columns 'usecase', 'run', 'score', 'temperature', 'tokens', and 'latency'.",
6         "parameters": {
7             "type": "object",
8             "properties": {
9                 "query": {
10                     "type": "string",
11                     "description": "SQL query to execute on the evaluation_results table",
12                 }
13             },
14             "required": ["query"],
15         },
16     },
17 }
18 
19 tools = [sql_table_query_tool]

Building an agent for querying SQL data

Next, let’s create a run_agent function to run the agentic RAG workflow, just as we did in Part 1.

The only change we are making here is to make the system message more specific and describe the database schema to the agent.

PYTHON

1 system_message = """## Task and Context
2 You are an assistant who helps developers analyze LLM application evaluation results from a SQLite database. The database contains a table named 'evaluation_results' with the following schema:
3 
4 - usecase (TEXT): The type of task being evaluated
5 - run (TEXT): The identifier for a specific evaluation run
6 - score (REAL): The performance score of the run
7 - temperature (REAL): The temperature setting used for the LLM
8 - tokens (INTEGER): The number of tokens used in the run
9 - latency (REAL): The time taken for the run in seconds
10 
11 You can use SQL queries to analyze this data and provide insights to the developers."""

PYTHON

1 model = "command-a-03-2025"
2 
3 
4 def run_agent(query, messages=None):
5     if messages is None:
6         messages = []
7 
8     if "system" not in {m.get("role") for m in messages}:
9         messages.append({"role": "system", "content": system_message})
10 
11     # Step 1: get user message
12     print(f"Question:\n{query}")
13     print("=" * 50)
14 
15     messages.append({"role": "user", "content": query})
16 
17     # Step 2: Generate tool calls (if any)
18     response = co.chat(
19         model=model, messages=messages, tools=tools, temperature=0.3
20     )
21 
22     while response.message.tool_calls:
23 
24         print("Tool plan:")
25         print(response.message.tool_plan, "\n")
26         print("Tool calls:")
27         for tc in response.message.tool_calls:
28             # print(f"Tool name: {tc.function.name} | Parameters: {tc.function.arguments}")
29             if tc.function.name == "analyze_evaluation_results":
30                 print(f"Tool name: {tc.function.name}")
31                 tool_call_prettified = print(
32                     "\n".join(
33                         f"  {line}"
34                         for line_num, line in enumerate(
35                             json.loads(tc.function.arguments)[
36                                 "code"
37                             ].splitlines()
38                         )
39                     )
40                 )
41                 print(tool_call_prettified)
42             else:
43                 print(
44                     f"Tool name: {tc.function.name} | Parameters: {tc.function.arguments}"
45                 )
46         print("=" * 50)
47 
48         messages.append(
49             {
50                 "role": "assistant",
51                 "tool_calls": response.message.tool_calls,
52                 "tool_plan": response.message.tool_plan,
53             }
54         )
55 
56         # Step 3: Get tool results
57         for tc in response.message.tool_calls:
58             tool_result = functions_map[tc.function.name](
59                 **json.loads(tc.function.arguments)
60             )
61             tool_content = [
62                 (
63                     {
64                         "type": "document",
65                         "document": {"data": json.dumps(tool_result)},
66                     }
67                 )
68             ]
69 
70             messages.append(
71                 {
72                     "role": "tool",
73                     "tool_call_id": tc.id,
74                     "content": tool_content,
75                 }
76             )
77 
78         # Step 4: Generate response and citations
79         response = co.chat(
80             model=model,
81             messages=messages,
82             tools=tools,
83             temperature=0.3,
84         )
85 
86     messages.append(
87         {
88             "role": "assistant",
89             "content": response.message.content[0].text,
90         }
91     )
92 
93     # Print final response
94     print("Response:")
95     print(response.message.content[0].text)
96     print("=" * 50)
97 
98     # Print citations (if any)
99     verbose_source = (
100         False  # Change to True to display the contents of a source
101     )
102     if response.message.citations:
103         print("CITATIONS:\n")
104         for citation in response.message.citations:
105             print(
106                 f"Start: {citation.start}| End:{citation.end}| Text:'{citation.text}' "
107             )
108             print("Sources:")
109             for idx, source in enumerate(citation.sources):
110                 print(f"{idx+1}. {source.id}")
111                 if verbose_source:
112                     print(f"{source.tool_output}")
113             print("\n")
114 
115     return messages

Running the agent

Let’s now ask the agent the same set of questions we asked in the previous chapter. While the previous chapter translates the questions into pandas Python code, this time the agent will be using SQL queries.

PYTHON

1 messages = run_agent("What's the average evaluation score in run A")
2 # Answer: 0.63

1 Question:
2 What's the average evaluation score in run A
3 ==================================================
4 Tool plan:
5 I will query the connected SQL database to find the average evaluation score in run A. 
6 
7 Tool calls:
8 Tool name: sql_table_query | Parameters: {"query":"SELECT AVG(score) AS average_score\r\nFROM evaluation_results\r\nWHERE run = 'A';"}
9 ==================================================
10 Response:
11 The average evaluation score in run A is 0.63.
12 ==================================================
13 CITATIONS:
14 
15 Start: 41| End:46| Text:'0.63.' 
16 Sources:
17 1. sql_table_query_97h16txpbeqs:0

PYTHON

1 messages = run_agent(
2     "What's the latency of the highest-scoring run for the summarize_article use case?"
3 )
4 # Answer: 4.8

1 Question:
2 What's the latency of the highest-scoring run for the summarize_article use case?
3 ==================================================
4 Tool plan:
5 I will query the connected SQL database to find the latency of the highest-scoring run for the summarize_article use case.
6 
7 I will filter the data for the summarize_article use case and order the results by score in descending order. I will then return the latency of the first result. 
8 
9 Tool calls:
10 Tool name: sql_table_query | Parameters: {"query":"SELECT latency\r\nFROM evaluation_results\r\nWHERE usecase = 'summarize_article'\r\nORDER BY score DESC\r\nLIMIT 1;"}
11 ==================================================
12 Response:
13 The latency of the highest-scoring run for the summarize_article use case is 4.8.
14 ==================================================
15 CITATIONS:
16 
17 Start: 77| End:81| Text:'4.8.' 
18 Sources:
19 1. sql_table_query_ekswkn14ra34:0

PYTHON

1 messages = run_agent(
2     "Which use case uses the least amount of tokens on average? Show the comparison of all use cases in a markdown table."
3 )
4 # Answer: extract_names (106.25), draft_email (245.75), summarize_article (355.75)

1 Question:
2 Which use case uses the least amount of tokens on average? Show the comparison of all use cases in a markdown table.
3 ==================================================
4 Tool plan:
5 I will query the connected SQL database to find the average number of tokens used for each use case. I will then present this information in a markdown table. 
6 
7 Tool calls:
8 Tool name: sql_table_query | Parameters: {"query":"SELECT usecase, AVG(tokens) AS avg_tokens\nFROM evaluation_results\nGROUP BY usecase\nORDER BY avg_tokens ASC;"}
9 ==================================================
10 Response:
11 Here is a markdown table showing the average number of tokens used for each use case:
12 
13 | Use Case | Average Tokens |
14 |---|---|
15 | extract_names | 106.25 |
16 | draft_email | 245.75 |
17 | summarize_article | 355.75 |
18 
19 The use case that uses the least amount of tokens on average is **extract_names**.
20 ==================================================
21 CITATIONS:
22 
23 Start: 129| End:142| Text:'extract_names' 
24 Sources:
25 1. sql_table_query_50yjx2cecqx1:0
26 
27 
28 Start: 145| End:151| Text:'106.25' 
29 Sources:
30 1. sql_table_query_50yjx2cecqx1:0
31 
32 
33 Start: 156| End:167| Text:'draft_email' 
34 Sources:
35 1. sql_table_query_50yjx2cecqx1:0
36 
37 
38 Start: 170| End:176| Text:'245.75' 
39 Sources:
40 1. sql_table_query_50yjx2cecqx1:0
41 
42 
43 Start: 181| End:198| Text:'summarize_article' 
44 Sources:
45 1. sql_table_query_50yjx2cecqx1:0
46 
47 
48 Start: 201| End:207| Text:'355.75' 
49 Sources:
50 1. sql_table_query_50yjx2cecqx1:0
51 
52 
53 Start: 277| End:290| Text:'extract_names' 
54 Sources:
55 1. sql_table_query_50yjx2cecqx1:0

Summary

In this tutorial, we learned about:

How to set up a SQLite database for structured data
How to create a function to execute SQL queries
How to build an agent for querying the database
How to run the agent

By implementing these techniques, we’ve further expanded our agentic RAG system to handle structured data in the form of SQL databases. This allows for more powerful and flexible querying capabilities, especially when dealing with large datasets or complex relationships between data.

This tutorial completes our exploration of structured data handling in the agentic RAG system, covering both tabular data (using pandas) and relational databases (using SQL). These capabilities significantly enhance the system’s ability to work with diverse data formats and structures.