ScyllaDB Brings Massive-Scale Vector Search to Real-Time AI

January 20, 2026 at 09:00 AM EST

SUNNYVALE, Calif., Jan. 20, 2026 (GLOBE NEWSWIRE) -- ScyllaDB today announced the general availability of its new Vector Search. This high-performance vector search supports the industry’s largest models with low TCO.

ScyllaDB is commonly used for real-time AI workloads such as latency-sensitive machine learning, predictive analytics, and fraud detection. It is trusted by high-growth companies such as Tripadvisor, ShareChat, and Freshworks to power large-scale latency-sensitive feature stores. As ScyllaDB’s customers began adopting vector search, many found standalone vector databases to be overly complex and costly at scale. In response, ScyllaDB added Vector Search to its ScyllaDB Cloud offering.

ScyllaDB Vector Search is built on ScyllaDB’s shard-per-core architecture with a Rust-based extension that leverages the USearch approximate-nearest-neighbor (ANN) search library. The architecture separates storage and indexing responsibilities while keeping the system unified from the user’s perspective.

The ScyllaDB nodes store both the structured attributes and the vector embeddings in the same distributed table.
The dedicated Vector Store service consumes updates from ScyllaDB via Change Data Capture (CDC) and builds ANN indexes in memory.
Queries are issued to the database, then internally routed to the Vector Store.

This design allows each layer to scale independently, optimizing for its own workload characteristics and eliminating resource interference.

The combination of ScyllaDB’s hardware-optimized shard-per-core architecture for high op/s queries and USearch’s C++ implementation with 10x performance gains over FAISS is a perfect fit for massive real-time AI workloads. Together, this enables industry-leading time-to-first-token.

In recent 1-billion-vector benchmarks, ScyllaDB Vector Search achieved sub-2 ms P99 latency with up to ~250,000 queries per second on large-scale similarity search workloads. These tests were performed with the publicly available yandex-deep 1b dataset, which contains 1 billion vectors, and the 3 + 3 node setup reflects realistic production deployments.

"ScyllaDB supports the most scalable vector search deployments at monstrous speed," said Dor Laor, co-founder and CEO at ScyllaDB. "Based on publicly available benchmarks, ScyllaDB currently demonstrates the fastest vector search performance at billion vector scale. It also offers excellent TCO across all model sizes. That means teams can support and scale their largest AI inference workloads without the traditional performance-cost tradeoffs."

About ScyllaDB
ScyllaDB is a specialty database for workloads that require predictable performance at scale. It’s adopted by organizations that require ultra-low latency, even at millions of features or operations per second, billions of embeddings, or petabytes of storage. ScyllaDB’s shard-per-core architecture taps the full power of modern infrastructure, translating to fewer nodes, less admin, and lower costs. Over 400 companies such as Disney+, Discord, Tripadvisor, Expedia, Zillow, Starbucks, and Comcast use ScyllaDB for their toughest database challenges. For more information: https://www.scylladb.com/

Media Contact
Wayne Ariola
wayne.ariola@scylladb.com

Symbol	Price	Change (%)
AMZN	204.79	+3.64 (1.81%)
AAPL	264.35	+0.47 (0.18%)
AMD	200.12	-2.96 (-1.46%)
BAC	53.36	+0.62 (1.18%)
GOOG	303.94	+1.12 (0.37%)
META	643.22	+3.93 (0.61%)
MSFT	399.60	+2.74 (0.69%)
NVDA	187.98	+3.01 (1.63%)
ORCL	156.17	+2.20 (1.43%)
TSLA	411.32	+0.69 (0.17%)

ScyllaDB Brings Massive-Scale Vector Search to Real-Time AI

More News

Recent Quotes

Sections

Services

Plymouth, IN (46563)

Today

Tonight

ScyllaDB Brings Massive-Scale Vector Search to Real-Time AI

More News

Recent Quotes

Sections

Services