For the latest stable version, please use Korvet 0.12.5!

Benchmarks

This page documents performance benchmarks for Korvet with Redis Enterprise as the storage backend.

Optimal Configuration Benchmark

This benchmark demonstrates the best throughput configuration for Korvet with Redis Enterprise.

Test Environment

Redis Enterprise: 16 shards, running locally
Korvet: Single instance (macOS, Apple Silicon)
Kafka Tools: kafka-producer-perf-test from Apache Kafka
Topic Configuration: 16 partitions (1× the number of shards)
Record Size: 1 KB (1024 bytes)
Total Messages: 8,000,000 (1,000,000 per producer)

Configuration

Parameter	Value
Producers	8
Batch Size	1000 messages (1.07 MB)
Redis Connection Pool Size	8
Acks	1
Compression	none
Linger	0ms

Parameter

Value

Producers

Batch Size

1000 messages (1.07 MB)

Redis Connection Pool Size

Acks

Compression

none

Linger

0ms

Performance Results

Metric	Value
Aggregate Throughput	380,952 records/sec
Throughput (MB/sec)	372.02 MB/sec
Total Messages	8,000,000
Duration	21 seconds
Average Latency (range)	249-411 ms
95th Percentile Latency (range)	657-2136 ms

Metric

Value

Aggregate Throughput

380,952 records/sec

Throughput (MB/sec)

372.02 MB/sec

Total Messages

8,000,000

Duration

21 seconds

Average Latency (range)

249-411 ms

95th Percentile Latency (range)

657-2136 ms

Korvet Resource Usage

Metric	Value
Process CPU	2.35%
System CPU	20.51%
JVM Memory Used	244.13 MB

Metric

Value

Process CPU

2.35%

System CPU

20.51%

JVM Memory Used

244.13 MB

Redis Enterprise Metrics

Metric	Value
Total CPU (all 16 shards)	79%
Per-Shard CPU	2-8%
Data per Shard	112-172 MB
Data Distribution	Even across all shards

Metric

Value

Total CPU (all 16 shards)

79%

Per-Shard CPU

2-8%

Data per Shard

112-172 MB

Data Distribution

Even across all shards

Key Findings

High throughput with low CPU usage: Achieved 372 MB/sec with only 2.35% Korvet CPU usage
Excellent scalability headroom: Both Korvet and Redis Enterprise operating well below capacity
Even load distribution: Data and CPU load distributed evenly across all 16 Redis shards
Optimal batch size: 1000 messages per batch provided the best balance of throughput and latency

Running This Benchmark

To reproduce this benchmark, use the provided benchmark script from the korvet-dist repository:

git clone https://github.com/redis-field-engineering/korvet-dist.git
cd korvet-dist/samples/benchmark/scripts
./run-comprehensive-benchmark.sh

The script will:

Start Korvet with the specified Redis pool size
Create a topic with 16 partitions
Run 8 concurrent producers, each sending 1,000,000 messages
Collect metrics from Korvet (via actuator) and Redis Enterprise (via API)
Generate a detailed report with throughput, latency, and resource usage

Results are saved to /tmp/korvet-benchmark-<timestamp>/.

Single Shard Benchmark

This benchmark demonstrates Korvet performance with a single Redis shard, providing a baseline for comparison with multi-shard configurations.

Test Environment

Redis Enterprise: 1 shard (~1 GB maxmemory), running locally
Korvet: Single instance (macOS, Apple Silicon)
Kafka Tools: kafka-producer-perf-test from Apache Kafka
Topic Configuration: 1 partition (matching the single shard)
Record Size: 1 KB (1024 bytes)

Configuration

Parameter	Value
Producers	1 (baseline) / 8 (concurrent)
Batch Size	1000 messages (1.07 MB)
Redis Connection Pool Size	16
Acks	1
Compression	none
Linger	0ms

Parameter

Value

Producers

1 (baseline) / 8 (concurrent)

Batch Size

1000 messages (1.07 MB)

Redis Connection Pool Size

Acks

Compression

none

Linger

0ms

Performance Results

Single Producer (Baseline)

Metric	Value
Throughput	151,860 records/sec
Throughput (MB/sec)	148.30 MB/sec
Total Messages	200,000
Average Latency	160 ms
P99 Latency	329 ms

Metric

Value

Throughput

151,860 records/sec

Throughput (MB/sec)

148.30 MB/sec

Total Messages

200,000

Average Latency

160 ms

P99 Latency

329 ms

8 Concurrent Producers

Metric	Value
Aggregate Throughput	168,641 records/sec
Throughput (MB/sec)	164.68 MB/sec
Total Messages	674,564
Duration	4.58 seconds
Average Latency (range)	495-724 ms
Memory Used	776.47 MB

Metric

Value

Aggregate Throughput

168,641 records/sec

Throughput (MB/sec)

164.68 MB/sec

Total Messages

674,564

Duration

4.58 seconds

Average Latency (range)

495-724 ms

Memory Used

776.47 MB

Comparison: 16 Shards vs 1 Shard

Metric	16 Shards	1 Shard	Ratio
Database Memory	~16 GB	~1 GB	16×
Topic Partitions	16	1	16×
Throughput (rec/s)	380,952	168,641	2.26×
Throughput (MB/s)	372.02	164.68	2.26×
Per-shard throughput	23,809	168,641	0.14×

Key Findings

Single shard achieves ~44% of 16-shard aggregate throughput: 168,641 vs 380,952 records/sec
Higher per-shard efficiency with fewer shards: A single shard processes 168,641 rec/s vs 23,809 rec/s per shard in the 16-shard setup
Memory efficiency: ~1.15 KB per message in Redis Streams (776 MB for 674,564 messages)
Single producer baseline: 151,860 rec/s provides a clean baseline without concurrency overhead

Cold Storage Archival Benchmark

This benchmark measures the throughput of archiving messages from Redis Streams to Delta Lake on S3.

Test Environment

EC2 Instance: c5.2xlarge (8 vCPU, 16GB RAM) in us-west-1
S3 Bucket: Same region (us-west-1) for optimal network performance
Redis: Docker container on same instance
Message Size: ~100 bytes (binary payload)
Compression: ZSTD (Parquet default)

Single Stream Results

Archiving from a single Redis Stream to S3:

Messages	Archive Time	Throughput	Parquet Files	Delta Commits
1,000,000	31.3s	31,970 msg/s	100 @ 186ms avg	11 @ 509ms avg

Messages

Archive Time

Throughput

Parquet Files

Delta Commits

1,000,000

31.3s

31,970 msg/s

100 @ 186ms avg

11 @ 509ms avg

Multi-Stream Results (4 Partitions)

Archiving from 4 Redis Streams in parallel to S3:

Messages	Archive Time	Throughput	Parquet Files	Delta Commits
1,000,000	12.5s	80,239 msg/s	100 @ 212ms avg	16 @ 385ms avg
4,000,000	34.7s	115,347 msg/s	400 @ 192ms avg	44 @ 486ms avg

Messages

Archive Time

Throughput

Parquet Files

Delta Commits

1,000,000

12.5s

80,239 msg/s

100 @ 212ms avg

16 @ 385ms avg

4,000,000

34.7s

115,347 msg/s

400 @ 192ms avg

44 @ 486ms avg

Scaling Summary

Configuration	Throughput	vs Single Stream
1 stream	32k msg/s	baseline
4 streams (1M messages)	80k msg/s	2.5×
4 streams (4M messages)	115k msg/s	3.6×

Configuration

Throughput

vs Single Stream

1 stream

32k msg/s

baseline

4 streams (1M messages)

80k msg/s

2.5×

4 streams (4M messages)

115k msg/s

3.6×

Key Findings

Single stream peaks at ~32k msg/s: Bottleneck is S3 PUT latency for Parquet files
Near-linear scaling with streams: 4 streams achieves 115k msg/s (3.6× single stream)
Parquet writes average ~190ms: Same-region S3 provides consistent low latency
Delta commits average ~480ms: Includes S3 metadata operations for transaction log
Excellent compression: ZSTD achieves ~50:1 compression ratio (~2 bytes/message stored)
Same-region S3 is critical: Cross-region throughput drops ~50%

Storage Efficiency

Metric	Value
Messages archived	4,000,000
S3 objects created	444 (400 Parquet + 44 Delta logs)
Total S3 storage	~8 MB
Bytes per message	~2 bytes (after ZSTD compression)
Compression ratio	~50:1

Metric

Value

Messages archived

4,000,000

S3 objects created

444 (400 Parquet + 44 Delta logs)

Total S3 storage

~8 MB

Bytes per message

~2 bytes (after ZSTD compression)

Compression ratio

~50:1

Archival Configuration

The archival service was configured with:

storage:
  enabled: true
  path: s3a://your-bucket/korvet
  s3:
    region: us-west-1

# Tuning parameters (per stream)
read-worker-count: 4      # One per stream
commit-worker-count: 4    # One per stream
redis-batch-size: 10000   # Messages per Redis XREADGROUP
max-batches-per-commit: 10
files-per-delta-commit: 10

Configuration Recommendations

For optimal throughput:

Batch size: Use 1000 messages per batch for best balance of throughput and latency
Producers: 8 concurrent producers provides excellent throughput with manageable latency
Redis pool size: Match pool size to number of producers (8) for optimal connection utilization
Partitions: Use 1-2× the number of Redis shards (16 partitions for 16 shards)
Redis shards: Match the number of shards to available CPU cores
Rebalance delay: Configure korvet.server.rebalance-delay appropriately (default 10s) to allow all consumers to join before rebalancing
Replication: Disable replication for write-heavy workloads (if durability requirements allow)

Running Your Own Benchmarks

Using the Benchmark Script

The korvet-dist repository contains a script to run benchmarks with various configurations.

git clone https://github.com/redis-field-engineering/korvet-dist.git
cd korvet-dist/samples/benchmark/scripts
./run-comprehensive-benchmark.sh

Configuration Options

Edit the script to customize benchmark parameters:

# Test parameters
TOPIC="benchmark-test"
PARTITIONS=16
RECORD_SIZE=1024
NUM_RECORDS=1000000

# Parameter arrays
PRODUCERS=(8)           # Number of concurrent producers
BATCH_SIZES=(1000)      # Messages per batch
POOL_SIZES=(8)          # Redis connection pool size

What the Script Does

Starts Korvet with the specified Redis pool size
Flushes Redis to ensure clean state
Creates topic with specified number of partitions
Runs producers using kafka-producer-perf-test
Collects metrics:
- Korvet CPU and memory (via Spring Boot Actuator at port 8080)
- Redis Enterprise CPU and memory (via REST API at port 9443)
- Producer throughput and latency
Generates report with detailed results

Output

Results are saved to /tmp/korvet-benchmark-<timestamp>/:

SUMMARY.txt: Summary table of all test results
producers-<N>_batch-<B>msg_pool-<P>.txt: Detailed results for each test

Example summary output:

Producers  Batch(msg)   Pool       Total Msgs      Duration(s)  Throughput(rec/s)  Throughput(MB/s)
8          1000         8          8000000         21           380952             372.02

Running Cold Storage Benchmarks

To run cold storage benchmarks against S3:

# Set S3 bucket and region
export S3_BUCKET=your-bucket-name
export AWS_REGION=us-west-1

# Run single-stream benchmark (100k messages default)
./gradlew :korvet-storage:test \
  --tests "StreamArchivalServiceS3Benchmark.benchmarkS3"

# Run with more messages
./gradlew :korvet-storage:test \
  --tests "StreamArchivalServiceS3Benchmark.benchmarkS3" \
  -Dtest.message.count=1000000

# Run multi-stream benchmark (4 streams)
./gradlew :korvet-storage:test \
  --tests "StreamArchivalServiceS3Benchmark.benchmarkS3MultiStream" \
  -Dtest.message.count=1000000

For best results, run on an EC2 instance in the same region as your S3 bucket.