Redis feature store with Lettuce
Build a Redis-backed online feature store in Java with Lettuce
This guide shows you how to build a small Redis-backed online feature store in
Java with Lettuce, the
async-by-default Netty-based Redis client. The demo runs on top of the JDK's
com.sun.net.httpserver.HttpServer so you can bulk-load a batch of users
with a key-level TTL, run a streaming worker that overwrites real-time
features with per-field TTL, retrieve any subset of features for one user
under 2 ms, and pipeline HMGET across a hundred users for batch scoring.
The Jedis walkthrough
covers the same flow with a synchronous, pool-borrowing client. This page
focuses on what's different in Lettuce — the multiplexed connection, the
RedisAsyncCommands surface, and the auto-flush pipelining model — rather
than re-explaining the shared concepts.
Overview
Each entity (here, a user) is one Redis
Hash at a deterministic key —
fs:user:{id}. The hash holds every feature for that entity as one field per
feature: batch-materialized aggregates (refreshed once a day) alongside
streaming-updated signals (refreshed every few seconds). One
HMGET returns whichever subset the model
needs in one network round trip.
Two TTL layers solve the mixed staleness problem without an application-side cleaner:
- A key-level
EXPIREaligned with the batch materialization cycle (24 hours in the demo). If the batch refresher fails, the whole entity disappears at the next cycle and inference sees a missing entity — which the model handler can detect and fall back on — rather than silently outdated values. - A per-field
HEXPIRE(Redis 7.4+) on each streaming feature gives that field its own shorter expiry, independent of the rest of the hash. If the streaming pipeline stops updating a feature, the field self-cleans while the batch fields stay populated.
That gives you:
- A single round trip for retrieval — any subset of features for one entity
in one
HMGET. - Sub-millisecond hot path. The Redis-side work is microseconds; in practice the bottleneck is the network round trip plus the model's own feature-prep.
- Pipelined batch scoring — one round trip for
Nusers at once. - Independent freshness per feature, expressed as a server-side TTL rather than as application logic.
- Self-cleanup on pipeline failure: a stalled batch refresher lets entities expire on schedule, and a stalled streaming worker lets each affected field expire on its own timer.
How Lettuce differs from Jedis
The big mental-model difference for someone arriving from Jedis:
- One shared, multiplexed connection. A
StatefulRedisConnection<K, V>is thread-safe and serves the whole process. There's noJedisPool-style per-call borrow — every handler in the HTTP thread pool and the streaming worker share the same connection, and Netty handles the serialization onto the underlying socket. - Async-by-default API. Every method on
RedisAsyncCommands<K, V>returns aRedisFuture<T>(which is aCompletionStage<T>and aFuture<T>). For synchronous code paths the helper blocks with.get(); for reactive pipelines you'd compose with.thenApply()/.thenCompose()or use the.reactive()API directly. - Pipelining via connection-level auto-flush. Lettuce doesn't have a
pipelined()-style builder. Instead, you toggleconn.setAutoFlushCommands(false)on the connection, queue commands as normal async calls (each returns its ownRedisFuture), callconn.flushCommands()to ship the batch, and toggle auto-flush back on.LettuceFutures.awaitAll(...)waits for all the futures to resolve.
In short: reach for Lettuce when you need async/reactive composition or you're already in a reactive stack (Spring WebFlux, Project Reactor); reach for Jedis when blocking commands are common or you want a simple sync API with explicit per-call connection lifetime. The Lettuce and Jedis client guides cover the deeper selection criteria.
In this example, the batch features describe a user's longer-term shape and
are bulk-loaded by BuildFeatures.java. The streaming features describe
what the user is doing right now and are written by StreamingWorker.java
on a daemon thread. The inference handlers of the demo server read any
subset of those features through FeatureStore.java's helper class. All
four sources share one StatefulRedisConnection opened in DemoServer.java.
The feature-store helper
The FeatureStore class wraps the read/write paths
(source):
import io.lettuce.core.RedisClient;
import io.lettuce.core.api.StatefulRedisConnection;
RedisClient client = RedisClient.create("redis://localhost:6379");
try (StatefulRedisConnection<String, String> conn = client.connect()) {
FeatureStore store = new FeatureStore(conn,
"fs:user:",
24L * 60L * 60L, // whole-entity TTL aligned with the daily batch cycle
5L * 60L // per-field TTL on each streaming feature
);
// Batch materialization: one HSET + EXPIRE per user, all pipelined
// through a single connection-level flush.
Map<String, Map<String, Object>> rows = Map.of(
"u0001", Map.of(
"country_iso", "US", "risk_segment", "low",
"tx_count_7d", 14, "avg_amount_30d", 92.40,
"account_age_days", 612, "chargeback_count_180d", 0));
store.bulkLoad(rows);
// Streaming write: HSET + HEXPIRE on just the fields that changed.
store.updateStreaming("u0001", Map.of(
"last_login_ts", System.currentTimeMillis(),
"last_device_id", "ios-9f02",
"tx_count_5m", 3,
"failed_logins_15m", 0,
"session_country", "US"));
// Inference read: HMGET of whatever the model needs.
Map<String, String> features = store.getFeatures("u0001", List.of(
"risk_segment", "tx_count_7d", "avg_amount_30d",
"tx_count_5m", "failed_logins_15m"));
// Batch scoring: pipelined HMGET across many users.
Map<String, Map<String, String>> batch = store.batchGetFeatures(
List.of("u0001", "u0002", "u0003"),
List.of("risk_segment", "tx_count_5m", "failed_logins_15m"));
} finally {
client.shutdown();
}
Data model
Each user is one Redis Hash. Every value is stored as a string — Redis hash
fields are bytes on the wire, so the helper encodes booleans as "true" /
"false" (encodeValue(Object) in FeatureStore.java) and renders
everything else with Object.toString(). The model server is responsible
for parsing back to the right type, the same way it would when reading any
serialized feature store.
fs:user:u0001 TTL = 86400 s (key-level)
country_iso=US <no field TTL>
risk_segment=low <no field TTL>
account_age_days=612 <no field TTL>
tx_count_7d=14 <no field TTL>
avg_amount_30d=92.40 <no field TTL>
chargeback_count_180d=0 <no field TTL>
last_login_ts=1716998413541 TTL = 300 s (per field, HEXPIRE)
last_device_id=ios-9f02 TTL = 300 s (per field, HEXPIRE)
tx_count_5m=3 TTL = 300 s (per field, HEXPIRE)
failed_logins_15m=0 TTL = 300 s (per field, HEXPIRE)
session_country=US TTL = 300 s (per field, HEXPIRE)
Bulk-loading batch features
bulkLoad queues one HSET and one EXPIRE per user with auto-flush
disabled, flushes once, and waits for every RedisFuture to resolve.
public int bulkLoad(Map<String, Map<String, Object>> rows, long ttlSeconds) {
if (rows.isEmpty()) return 0;
List<RedisFuture<?>> futures = new ArrayList<>(rows.size() * 2);
conn.setAutoFlushCommands(false);
try {
for (Map.Entry<String, Map<String, Object>> e : rows.entrySet()) {
String key = keyFor(e.getKey());
Map<String, String> encoded = encode(e.getValue());
futures.add(async.hset(key, encoded));
futures.add(async.expire(key, ttlSeconds));
}
conn.flushCommands();
} finally {
conn.setAutoFlushCommands(true);
}
if (!LettuceFutures.awaitAll(BATCH_TIMEOUT, futures.toArray(new RedisFuture[0]))) {
throw new IllegalStateException("bulkLoad: timed out after " + BATCH_TIMEOUT);
}
...
}
The two important things to notice:
setAutoFlushCommands(false)is on the connection, not the async commands. It affects every call going through thatStatefulRedisConnectionuntil it's flipped back. Thefinallyblock restores auto-flush even if a queue step throws — failing to do so would silently break every subsequent command in the JVM.LettuceFutures.awaitAllblocks with a timeout. With auto-flush off, queued commands can sit in the local pipeline buffer indefinitely if something below the flush goes wrong. The timeout givesbulkLoada clean failure mode rather than hanging forever.
In production, the equivalent of this script runs as an offline pipeline (a
Spark or Feast materialize job) that reads from the warehouse and writes
into Redis. The
Feast RedisOnlineStore
provider does exactly this under the hood; the in-house
Redis Feature Form integration
covers the materialize + serve path end-to-end.
Streaming writes with per-field TTL
updateStreaming is the linchpin of the mixed-staleness story:
public void updateStreaming(String entityId, Map<String, Object> fields, long ttlSeconds) {
if (fields.isEmpty()) return;
String key = keyFor(entityId);
Map<String, String> encoded = encode(fields);
String[] names = encoded.keySet().toArray(new String[0]);
RedisFuture<Long> hsetFut;
RedisFuture<List<Long>> hexpireFut;
conn.setAutoFlushCommands(false);
try {
hsetFut = async.hset(key, encoded);
hexpireFut = async.hexpire(key, ttlSeconds, names);
conn.flushCommands();
} finally {
conn.setAutoFlushCommands(true);
}
awaitOne(hsetFut);
List<Long> codes = awaitOne(hexpireFut);
for (Long code : codes) {
if (code == null || code != 1L) {
throw new IllegalStateException(
"HEXPIRE did not set every field TTL for " + key + ": " + codes);
}
}
...
}
HEXPIRE sets the TTL on individual
hash fields, not on the whole key. The two commands are queued under one
flush so Redis runs them in pipeline order: the HSET first creates or
overwrites the fields, then HEXPIRE attaches a TTL to each of those same
fields. HEXPIRE returns one status code per field — 1 if the TTL was
set, 2 if the expiry was 0 or in the past (so Redis deleted the field
instead), 0 if an NX | XX | GT | LT conditional flag was set and not
met (we never use one here), -2 if the field doesn't exist on the key.
The helper throws if any code is anything other than 1, so the "every
streaming write renews its TTL" invariant fails loudly rather than silently
leaving a streaming field with no expiry attached.
If a streaming pipeline stops, the streaming fields drop out one by one as
their per-field TTLs elapse. HTTL lets
the model side inspect the remaining TTL on any field, which is useful both
for debugging and as a freshness signal in the model itself.
HEXPIRE requires Redis 7.4 or later.
HEXPIREand the field-level TTL commands were added in Redis 7.4. Lettuce 6.4 was the first release with the bindings; the demo'spom.xmlpins 7.5.2.RELEASE.
Inference reads with HMGET
getFeatures is one HMGET:
public Map<String, String> getFeatures(String entityId, List<String> fieldNames) {
String key = keyFor(entityId);
Map<String, String> out = new LinkedHashMap<>();
if (fieldNames == null) {
Map<String, String> all = awaitOne(async.hgetall(key));
if (all != null) out.putAll(all);
return out;
}
if (fieldNames.isEmpty()) return out;
List<KeyValue<String, String>> values = awaitOne(
async.hmget(key, fieldNames.toArray(new String[0])));
for (KeyValue<String, String> kv : values) {
if (kv != null && kv.hasValue()) {
out.put(kv.getKey(), kv.getValue());
}
}
return out;
}
Lettuce's hmget returns List<KeyValue<K, V>> rather than a parallel
List<V> like Jedis. KeyValue is Lettuce's Optional-like wrapper:
kv.hasValue() tells you whether Redis returned a value or a nil for that
field, and kv.getValue() unwraps it. The helper drops hasValue()==false
entries so the caller's Map<String, String> only contains fields that
actually exist on the hash.
Batch scoring with pipelined HMGET
The same connection-level flush pattern carries over to batch reads:
public Map<String, Map<String, String>> batchGetFeatures(
List<String> entityIds, List<String> fieldNames) {
if (entityIds.isEmpty() || fieldNames.isEmpty()) {
return Collections.emptyMap();
}
String[] names = fieldNames.toArray(new String[0]);
List<RedisFuture<List<KeyValue<String, String>>>> futures =
new ArrayList<>(entityIds.size());
conn.setAutoFlushCommands(false);
try {
for (String id : entityIds) {
futures.add(async.hmget(keyFor(id), names));
}
conn.flushCommands();
} finally {
conn.setAutoFlushCommands(true);
}
Map<String, Map<String, String>> out = new LinkedHashMap<>();
for (int i = 0; i < entityIds.size(); i++) {
List<KeyValue<String, String>> values = awaitOne(futures.get(i));
Map<String, String> row = new LinkedHashMap<>();
for (KeyValue<String, String> kv : values) {
if (kv != null && kv.hasValue()) row.put(kv.getKey(), kv.getValue());
}
out.put(entityIds.get(i), row);
}
return out;
}
One round trip for the whole batch. The first call after server startup includes a few milliseconds of Netty event-loop and connection warm-up; steady-state, the demo returns a 100-user batch in 2-5 ms against a local Redis.
A Redis Cluster is different: a single auto-flush batch is bound to one
shard, because all the queued commands ship through one connection to one
node. For batch reads on a cluster, use
RedisClusterClient — its
StatefulRedisClusterConnection exposes getConnection(slot) for
per-shard auto-flush batching, and the high-level RedisAdvancedClusterAsyncCommands
fans out non-pipelined calls per shard automatically.
A hash tag like fs:user:{vip}:u0001 forces a known set of keys onto the
same shard so one auto-flush batch can cover all of them in a single round
trip.
The streaming worker
StreamingWorker.java is the demo's stand-in for whatever Flink, Kafka
Streams, or bespoke service computes the real-time features
(source).
It runs as a daemon Thread next to the demo server so the UI can start,
pause, and resume it; in production this code would live in the streaming
layer.
The lifecycle (start / stop / pause / resume / waitForIdle) is identical to
the Jedis demo — the worker thread itself doesn't care which client it's
talking to, only that FeatureStore.updateStreaming pipelines the
HSET + HEXPIRE in order within one flush. The Lettuce helper achieves that through
the connection-level flush described above.
Pausing the worker is what shows off the mixed-staleness behavior: leave
it paused for longer than streamingTtlSeconds and the streaming fields
disappear from every user's hash one by one, while the batch fields remain
under the longer key-level EXPIRE. The demo's Pause / resume button
lets you see this happen in real time.
pause() only blocks future ticks from running. A reset that's about to
DEL every key also needs to wait out an already-running tick, which is
what waitForIdle() is for. The demo's Reset handler calls
worker.pause() and worker.waitForIdle() before it issues the DEL
sweep, so a mid-flight tick can't recreate a user under a streaming-only
hash with no key-level TTL.
The batch builder
BuildFeatures.java is the demo's nightly materializer
(source).
It generates synthetic feature rows and calls store.bulkLoad once. The
synthesis itself is not the point — in a real deployment the equivalent
code reads from the offline store (Snowflake, BigQuery, Iceberg) and writes
the resulting hashes into Redis.
Run the builder on its own (independently of the demo server) to populate Redis from the command line:
mvn exec:java -Dexec.mainClass=BuildFeatures -Dexec.args="--count 500 --ttl-seconds 3600"
That writes 500 users at fs:user:* with a one-hour key-level TTL, which
is how a typical operator would pre-seed a feature store from the command
line when debugging.
The interactive demo
DemoServer.java runs the JDK HttpServer on port 8089 with a fixed
thread pool. The HTML page lets you:
- Bulk-load any number of users (default 200) with a configurable key-level TTL.
- See the store state: user count, batch / streaming TTLs, cumulative read/write counters.
- See the streaming worker status and pause or resume it.
- Run an inference read for any user with a chosen feature subset, and see the value, the per-field TTL, and the read latency.
- Run batch scoring with a pipelined
HMGETacrossNusers. - Inspect any user's full hash with field-level TTLs and the key-level TTL.
The server holds one FeatureStore, one StreamingWorker, one
RedisClient, and one StatefulRedisConnection for the lifetime of the
process. Every HTTP handler and the streaming worker share that single
connection — Lettuce multiplexes the commands across them automatically.
Endpoints:
| Endpoint | What it does |
|---|---|
GET /state |
User count, TTL config, stats counters, worker status. |
POST /bulk-load |
Auto-flush batched HSET + EXPIRE over N synthetic users with a chosen TTL. |
POST /worker/toggle |
Pause / resume the streaming worker. |
POST /read |
HMGET a chosen feature subset for one user; report latency and per-field TTLs. |
POST /batch-read |
Pipeline HMGET across N users; report total latency and per-entity field counts. |
GET /inspect |
HGETALL + HTTL for one user; full hash view with per-field TTLs. |
POST /reset |
Drop every user under the key prefix (used by the demo's reset button). |
Prerequisites
- Redis 7.4 or later.
HEXPIREandHTTLwere added in Redis 7.4; the demo relies on per-field TTL for the mixed-staleness story. - Java 17 or later. The demo uses switch expressions with arrow labels
(
case "..." -> ...), records, and text blocks. - Lettuce 6.4 or later. The demo's
pom.xmlpins 7.5.2.RELEASE. Field-level TTL bindings (hexpire,httl,hpersist) ship from Lettuce 6.4.
If your Redis server is running elsewhere, start the demo with
--redis-uri redis://host:port.
Running the demo
Get the source files
The demo lives in a small Maven project under
feature-store/java-lettuce.
Clone the repo or copy the directory:
git clone https://github.com/redis/docs.git
cd docs/content/develop/use-cases/feature-store/java-lettuce
mvn package
Start the demo server
From the project directory:
mvn exec:java -Dexec.mainClass=DemoServer
You should see:
Dropping any existing users under 'fs:user:*' for a clean demo run (pass --no-reset to keep them).
Redis feature-store demo server listening on http://127.0.0.1:8089
Using Redis at redis://localhost:6379 with key prefix 'fs:user:' (batch TTL 86400s, streaming TTL 300s)
Materialized 200 user(s); streaming worker running.
Open http://127.0.0.1:8089. The first inference read after startup is a few milliseconds slower than the rest because Lettuce / Netty are warming up the event loop and the underlying socket; subsequent reads settle into 1-2 ms on a local Redis.
Useful things to try:
- Pick a user and click Read features with a mixed batch/streaming subset — you'll see batch fields with no per-field TTL (covered by the key-level TTL) and streaming fields with a positive per-field TTL.
- Click Pipeline HMGET with
count=100to see the latency of a 100-user batch read. - Click Pause / resume on the streaming worker and leave it paused for
~5 minutes (or restart the server with
--streaming-ttl-seconds 30to make it visible in seconds). Re-run Read features on any user and watch the streaming fields disappear while the batch fields stay. - Click Inspect on a user to see the full hash with field-level TTLs.
- Click Reset to drop every user and start over.
The server is read/write against your local Redis. The default key prefix
is fs:user:. Pass --no-reset to keep existing data across restarts, or
--redis-uri to point at a different Redis.
Production usage
The guidance below focuses on the production concerns that are specific to
running a feature store on Redis. For the generic Lettuce production
checklist — ClientResources tuning, AUTH/ACL, retry policy,
sentinel/cluster failover — see the
Lettuce client guide. For TLS
specifically, follow the
connect-with-TLS recipe.
The feature-store demo runs against localhost with the defaults; a real
deployment should harden the client first.
Pick the batch TTL to outlast a failed refresher
The whole-entity EXPIRE is your safety net against silent staleness from
a broken batch pipeline. Set it longer than your worst-case batch outage
so a single missed run doesn't take the feature store offline, but short
enough that a sustained outage causes loud failures (missing entities)
rather than quiet ones (yesterday's features being scored as today's). The
standard choice is one cycle of "expected refresh interval × 2" — for a
daily batch, 48 hours; for a 6-hour batch, 12 hours.
The same logic applies to the per-field streaming TTL: a few times the expected update interval so a slow-but-alive streaming worker doesn't churn features needlessly, but short enough that a stalled worker causes visible freshness failures.
Don't share auto-flush state across unrelated code paths
conn.setAutoFlushCommands(false) flips a connection-level toggle that
affects every call going through that connection until it's flipped
back. If two threads run pipelined writes concurrently against the same
connection, they will fight over the flag — one thread's flushCommands()
will ship the other thread's still-being-queued commands, or its
restore-to-true will flush the other thread's queue prematurely. Worse,
a single non-pipelined read on that same connection will be silently
queued (and never flushed) while the flag is off.
The demo handles this by opening two connections from the same
RedisClient:
- The shared read connection stays in default auto-flush=true mode.
Every HTTP handler and the streaming worker use it for the
non-pipelined commands (
HMGET,HTTL,TTL,SCAN,DEL,HGETALL). - The dedicated pipeline connection is reserved for
bulkLoad,updateStreaming, andbatchGetFeatures. These all acquire a singlepipelineLockinside theFeatureStoreinstance before they touch the auto-flush flag, so concurrent batches block each other instead of corrupting the state. With one lock and one connection, you get at most one in-flight batch at a time on the pipeline side; the read connection is unaffected.
For batch concurrency beyond what one connection sustains, scale this
pattern to a small
BoundedAsyncPool<StatefulRedisConnection<K, V>>
of pipeline connections and lease one per batch.
Pipeline batch reads across shards
On a single Redis instance, an auto-flush batched HMGET across N users
is one round trip. A Redis Cluster is different: a single auto-flush batch
is bound to one shard, because all queued commands ship to one node. For
batch reads on a cluster, use
RedisClusterClient and one
of:
- Fan-out via
RedisAdvancedClusterAsyncCommands— the cluster client routes eachhmGetto the right shard transparently. Easier to write, slightly more overhead per call. - Bucket keys by slot with
SlotHash.getSlot(key)and open one connection per affected shard; auto-flush-batch each bucket separately. More code, but one round trip per shard.
For a small number of frequently-queried users (a top-N customer list, for
example), a hash tag like fs:user:{vip}:u0001 forces a known set of keys
onto the same shard so one batch can cover them all.
Make HEXPIRE part of every streaming write
The single biggest correctness lever in this design is that the streaming
write applies HEXPIRE every time. If a streaming worker writes a field
without renewing its TTL, the field carries whatever expiry was there
before — possibly none, possibly stale — and the mixed-staleness invariant
breaks. Keep the HSET and HEXPIRE under the same flush boundary (or,
even safer, in the same
Lua script if you
don't trust the call site).
Avoid HGETALL on the request path
HGETALL reads every field on the hash, including ones the model doesn't
need. With dozens of features per entity, that is wasted serialization
work on the server and wasted bandwidth on the wire. Always specify the
field list explicitly with hmget in the model server.
The exception is debugging and feature-set discovery, where you genuinely
want the full hash. The demo's "Inspect" button uses hgetall for exactly
this reason.
Inspect the store directly with redis-cli
When testing or troubleshooting, the cli tells you everything:
# How many users currently in the store
redis-cli --scan --pattern 'fs:user:*' | wc -l
# One user's full hash and key-level TTL
redis-cli HGETALL fs:user:u0001
redis-cli TTL fs:user:u0001
# Per-field TTL on the streaming fields
redis-cli HTTL fs:user:u0001 FIELDS 5 \
last_login_ts last_device_id tx_count_5m failed_logins_15m session_country
# Sample HMGET as the model would issue it
redis-cli HMGET fs:user:u0001 risk_segment tx_count_7d avg_amount_30d tx_count_5m
A streaming field that returns -2 from HTTL doesn't exist on the hash
(either it was never written, or it expired); -1 means the field has no
TTL set (and is therefore covered only by the key-level EXPIRE); any
positive value is the remaining TTL in seconds.
Learn more
This example uses the following Redis commands:
HSETto write a feature or a whole feature row in one call.HMGETto retrieve any subset of features for one entity in one round trip.HGETALLfor debugging and feature-set discovery.HEXPIREandHTTLfor per-field TTL on streaming features (Redis 7.4+).EXPIREandTTLfor the whole-entity TTL aligned with the batch materialization cycle.- Pipelined
HMGETacross many entities for batch scoring with one network round trip via Lettuce's connection-level auto-flush.
See the Lettuce documentation for the full client reference, and the Hashes overview for the deeper conceptual model.