Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
11 changes: 11 additions & 0 deletions serverless/load-balancing/overview.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,17 @@ When you're ready to get started, follow this tutorial to learn how to [build an

Or, if you're ready for a more advanced use case, you can jump straight into [building a vLLM load balancer](/serverless/load-balancing/vllm-worker).

You can also watch this video for an brief overview of the concepts explained on this page:

<iframe
className="w-full aspect-video rounded-xl"
src="https://www.youtube.com/embed/-ENnPpjtMZ8"
title="No more Queues? Runpod Serverless Load Balancing Walkthrough - Fundamentals and Concepts"
frameBorder="0"
allow="fullscreen; accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture"
allowFullScreen
></iframe>

## Key features

- **Direct HTTP access**: Connect directly to worker HTTP servers, bypassing queue infrastructure for lower latency.
Expand Down