ChatPPC

ChatPPC is a tool to help staff at Gardner Packard Children's Health Center navigate patient care resources, built with Next.js, Vercel AI SDK, and LangChain. This project also uses Supabase as a vector database for retrieval augmented generation (RAG).

Local Development

Prerequisites

Node.js 18+
Docker Desktop (for local Supabase development)
OpenAI API key (for document ingestion and embeddings)

Setup for Development

Install the Supabase CLI:

yarn global add supabase

Clone the repository:

git clone https://github.com/StanfordBDHG/ChatPPC
cd ChatPPC

Install dependencies:

yarn install

Initialize Supabase in your project:

supabase init

Start the Supabase emulator:

supabase start

If this step succeeded, you should see a message that begins with

supabase local development setup is running.

Note the API URL and service_role key that are printed out below this message when the emulator starts, which you will use in the next step.

Create a .env.local file in the root directory with these variables:

OPENAI_API_KEY=your_openai_api_key
SUPABASE_URL={API URL}
SUPABASE_PRIVATE_KEY={service_role key}

Apply database migrations:

supabase migration up

Run the development server:

yarn run dev

Open http://localhost:3000 to view the ChatPPC application. You can also access the Supabase Studio at http://localhost:54323 to view and manage your local database.

Tip

At this point, you can follow the instructions below in the Document Ingestion and Vector Search Optimization sections to add documents and optimize search performance.

Project Structure

├── scripts/                   # Executable Node.js scripts
│   ├── ingest.mjs             # Document ingestion script
│   └── optimize.mjs           # Vector search optimization script
├── tests/                     # All test files
│   ├── ingest.test.mjs        # Ingestion functionality tests
│   ├── optimize.test.mjs      # Optimization script tests
│   └── database.test.mjs      # Database connectivity tests
├── supabase/                  # Database-related files
│   ├── migrations/            # Database schema changes
│   ├── scripts/               # SQL utility scripts
│   │   ├── optimize-vector-search.sql
│   │   └── verify-indexes.sql
│   └── seed.sql              # Initial data seeding
├── app/                      # Next.js application pages
├── components/               # React components
└── docs/                     # Documentation files for ingestion

Testing

The project includes a comprehensive test suite covering document ingestion, vector search optimization, and end-to-end workflows:

Running Tests

# Run all tests (unit + database)
yarn test

# Run only unit tests (fast, no database required)
yarn test:unit

# Run database tests (requires Supabase setup)
yarn test:database

# Run complete test suite including app tests
yarn test:all

Test Categories

Unit Tests

Ingestion Tests (yarn test:ingest): Document processing, hash generation, file handling
Optimization Tests (yarn test:optimize): Vector index setup, SQL validation, script functionality

Database Tests

Database Connectivity (yarn test:database): Supabase connection and vector search functionality
Function Validation: Tests the match_documents function with various parameters
Performance Testing: Vector search speed and result accuracy

Test Requirements

Unit tests: No external dependencies (always runnable)
Database tests: Require SUPABASE_URL and SUPABASE_PRIVATE_KEY environment variables
All tests: Node.js 18+ and project dependencies installed

Quick Start Workflow

Once you have the development environment set up, follow this workflow:

Ingest Documents: yarn ingest docs (add your .md files to the docs folder first)
Optimize Search (optional): yarn optimize (creates database indexes for better performance with larger numbers of documents)
Test Everything: yarn test (runs comprehensive test suite)
Start Development: yarn dev (application ready at http://localhost:3000)

Document Ingestion

The project includes an ingestion script that processes markdown files and stores them in your Supabase vector database for AI retrieval.

Preparing Documents for Ingestion

Add your markdown files to the docs directory. Each document should be a properly formatted markdown file (.md).

Running the Ingestion Script

To ingest documents from the docs folder, use the following command:

yarn ingest docs

The script will:

Scan the specified directory for markdown (.md) files
Split the content into chunks with appropriate overlap
Generate embeddings using OpenAI
Store the embeddings in your Supabase vector database

Vector Search Optimization

Note

This section describes optional optimization techniques that may be helpful if encountering slow queries when ingesting larger numbers of documents.

After running document ingestion, you can create vector indexes by running the following script:

yarn optimize

This script will create and verify:

HNSW index on embeddings for fast vector similarity search
GIN index on metadata for efficient filtering

Admin Dashboard

To access the admin dashboard for viewing conversation analytics and managing documents:

Navigate to the Supabase dashboard and add a new user under Authentication with an email and password. Currently only admins have individual user accounts, whereas users access without an account, therefore any user created in Supabase Authentication is automatically considered an admin.
Navigate to /admin or click the 📄 icon in the top right of the navbar, then sign in with your admin credentials.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github/workflows		.github/workflows
.yarn/releases		.yarn/releases
app		app
components		components
docs		docs
lib		lib
public/images		public/images
scripts		scripts
supabase		supabase
tests		tests
utils		utils
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierrc.json		.prettierrc.json
.yarnrc.yml		.yarnrc.yml
CONTRIBUTORS.md		CONTRIBUTORS.md
LICENSE		LICENSE
README.md		README.md
components.json		components.json
middleware.ts		middleware.ts
next.config.js		next.config.js
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ChatPPC

Local Development

Prerequisites

Setup for Development

Project Structure

Testing

Running Tests

Test Categories

Unit Tests

Database Tests

Test Requirements

Quick Start Workflow

Document Ingestion

Preparing Documents for Ingestion

Running the Ingestion Script

Vector Search Optimization

Admin Dashboard

About

Uh oh!

Releases 1

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Uh oh!

License

StanfordBDHG/ChatPPC

Folders and files

Latest commit

History

Repository files navigation

ChatPPC

Local Development

Prerequisites

Setup for Development

Project Structure

Testing

Running Tests

Test Categories

Unit Tests

Database Tests

Test Requirements

Quick Start Workflow

Document Ingestion

Preparing Documents for Ingestion

Running the Ingestion Script

Vector Search Optimization

Admin Dashboard

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages