Graph Federation · Virtual DBMS · Knowledge Graph Architecture

Neo4j Virtual Graph vs
Virtuoso Virtual DBMS

A comparative analysis of two graph federation architectures — Neo4j's zero-copy warehouse integration versus Virtuoso's native multi-model virtual database engine. Examines federation model, query compilation strategy, multi-model support, identifier systems, maturity timelines, and the fundamental philosophical divide between adding graph queries to warehouses versus building a platform where the graph is a first-class Web citizen.

Overview

Neo4j Virtual Graph vs Virtuoso: A Comparative Analysis

In May 2026, Neo4j introduced Virtual Graph — a zero-copy capability that compiles Cypher into SQL for in-place execution on Snowflake and Databricks. This represents a significant architectural shift for Neo4j: moving from a native-graph-first philosophy toward warehouse federation. Virtuoso has operated on the inverse principle since the 1990s: a virtual DBMS engine where SQL, SPARQL, openCypher, GQL, and GraphQL all compile through a common query processor against heterogeneous data sources — relational tables, RDF graphs, external SPARQL endpoints, and web services — using HTTP-based hyperlinks as standardized entity identifiers. This analysis compares the two architectures across federation model, query compilation, multi-model support, identifier strategy, maturity, and ecosystem positioning.

2026
Virtual Graph Launch
1990s
Virtuoso Virtual DBMS
5
Query Languages
2
Platforms Compared
8
Comparison Dimensions
25+ yrs
Production Heritage
Platforms

Platforms Compared

Neo4j Virtual Graph

Announced May 2026 (private preview). A zero-copy graph virtualization layer that compiles Cypher into SQL for in-place execution on Snowflake and Databricks. Uses AI to generate graph data models from warehouse schemas, with a deterministic query translator and dedicated graph compute layer for traversals and algorithms.

Query languages: Cypher. Data sources: Snowflake, Databricks (JDBC/SQL planned). Latency: seconds-to-minutes (warehouse-grade). Identifier model: internal node IDs + foreign keys + AI-inferred relationships.

Announcement blog post →

Virtuoso Universal Server

Production-deployed since the 1990s. A multi-model, multi-purpose platform that loosely couples SQL, SPARQL, openCypher, GQL, and GraphQL to underlying data via a virtual DBMS engine. Provides declarative RDF Views over relational data, cross-endpoint SPARQL federation, and HTTP URI-based entity identifiers.

Query languages: SQL, SPARQL, openCypher, GQL, GraphQL. Data sources: any ODBC/JDBC source, SPARQL endpoints, RDF graphs, WebDAV. Latency: milliseconds (native) to seconds (federated). Identifier model: HTTP IRIs as universal super-keys (Linked Data principles).

Virtuoso home page →

Comparison

Architectural Comparison

DimensionNeo4j Virtual GraphVirtuoso Universal Server
Federation Model Bolt-on warehouse federation via Cypher-to-SQL compilation. Announced May 2026. Native virtual DBMS engine with multi-model query compilation. In production since the 1990s.
Query Languages Cypher only. GQL parity on roadmap. SQL, SPARQL, openCypher, GQL, GraphQL — loosely coupled. Language and storage are architecturally decoupled.
Query Compilation Deterministic Cypher→SQL compiler targeting Snowflake/Databricks SQL. Not LLM-driven. Deterministic SPARQL/Cypher/GQL→SQL compiler targeting native SQL engine with cost-based optimizer tuned over decades. SPARQL-FED for cross-endpoint queries.
Data Sources Snowflake, Databricks at launch. JDBC/SQL sources on roadmap. Any ODBC/JDBC source, SPARQL endpoints, RDF graphs, WebDAV repositories, web services. Broader surface area reflecting decades of evolution.
Identifier System Internal node IDs + foreign keys. AI infers relationships where foreign keys are absent. HTTP IRIs as universal super-keys (Linked Data principles). Hyperlinks enable cross-system entity resolution without AI inference.
Inference & Reasoning Not native to Virtual Graph. Relies on warehouse schema (foreign keys) + AI model inference for relationship discovery. Native RDFS/OWL subclass and subproperty entailment. Custom ontologies (shared or domain-specific) provide inference rules that enrich virtualized queries across heterogeneous sources. SPASQL embeds SPARQL patterns — including inference — directly into SQL, bringing reasoning to existing SQL applications unobtrusively.
Latency Profile Seconds-to-minutes via Virtual Graph (warehouse-grade). Milliseconds via native AuraDB for operational workloads. Milliseconds (native graph) to seconds (federated). One engine handles the full spectrum — no forced trade-off between OLTP and OLAP.
Maturity Private preview (May 2026). Production readiness and GA date unstated. 25+ years in production. RDF Views production-stable since ~2006. Powers DBpedia, enterprise Knowledge Graphs globally.
Philosophy Graph-native company adding virtual capabilities outward to warehouses — adding virtual to graph. Virtual Graph complements native AuraDB. Virtual-first multi-model platform that added graph capabilities on top of its relational engine — added graph to virtual. Open source + commercial.
Features

Feature Matrix

CapabilityNeo4j Virtual GraphVirtuoso
Zero-copy graph over SQL ✓ Cypher→SQL (Snowflake, Databricks) ✓ SPARQL→SQL (any JDBC/ODBC, since ~2006)
Native graph storage ✓ Via AuraDB (separate product) ✓ Built-in column-store RDF engine
AI-assisted graph model generation ✓ Built-in AI inspects warehouse schema, infers nodes/relationships/properties. Visual editor for corrections before committing. Snowflake/Databricks only at launch. ✓ Natural language, agent-driven via MCP. Once the OPAL (OpenLink AI) layer is initialized, an AI agent inspects any ODBC, JDBC, or HTTP-accessible data source through MCP tools, proposes entity and relationship mappings conversationally, and generates RDF Views (Quad Map Patterns) on demand. The mapping is auditable DDL — not a black box. Works across any backend, not limited to specific warehouses. The AI is not baked into the product UI; it's invoked through an open protocol by any agent.
Cross-endpoint federation ✗ Not available (Aura+VG composite on roadmap) ✓ SPARQL-FED (W3C standard). SQL ATTACH for cross-DB joins
Graph algorithms library ✓ GDS: PageRank, community detection, pathfinding ✓ SPARQL-BI handles graph analytics using common graph algorithms. SPARQL property paths for transitive closure and reachability. SPASQL embeds graph pattern matching in SQL analytics queries. Weighted degree centrality and other measures computable within the engine
GraphRAG ◐ Test value without ETL; native primitives on roadmap ✓ Vector/hybrid search + SPARQL + Wikidata/DBpedia federation
LLM entity grounding ✗ Not addressed in announcement ✓ HTTP IRIs as built-in dereferenceable citation anchors
W3C standards ✗ None (Cypher contributed to ISO GQL) ✓ SPARQL, RDF, Linked Data, RDFS/OWL entailment
ISO GQL support ◐ Cypher contributed to GQL; parity on roadmap ✓ GQL preview demonstrated (May 2026)
SPARQL inside SQL (SPASQL) ✗ No SPARQL support ✓ SPASQL embeds SPARQL sub-queries, property paths, and federation directly in SQL statements — brings graph pattern matching and inference to existing SQL users and applications without requiring them to learn a new query language
Ontology-driven reasoning ✗ Not available. AI-inferred schema mapping only. ✓ RDFS/OWL subclass/subproperty entailment. Shared and custom ontologies drive inference across virtualized data. Reasoning enriches queries transparently — users get inferred relationships without writing explicit join logic.
LOD Cloud infrastructure ✗ No role in LOD Cloud infrastructure ✓ Powers the majority of live SPARQL endpoints behind the massive Linked Open Data (LOD) Cloud. Every one of those endpoints is also SQL-accessible via SPASQL — SPARQL inside SQL brings LOD Cloud data to the entire SQL ecosystem without requiring users to learn a new query language
MCP tooling for AI Agents ✗ Not available as MCP tools ✓ All core Virtuoso capabilities exposed as MCP tools — SQL, SPARQL, SPASQL, SPARQL-FED, GraphQL, RDF Views generation, entity discovery, WebDAV, and license management — enabling AI agents and skills to operate directly against databases, knowledge bases, filesystems, and APIs
Multi-model (relational + graph) ✗ Graph-only; requires separate warehouse for SQL ✓ Single engine: relational + RDF + property graph
ACID transactions on graph ✓ Via native AuraDB (not Virtual Graph) ✓ On native RDF storage
Open source ✗ Enterprise/commercial; Virtual Graph pricing undisclosed ✓ Full virtualization in open source edition
Self-hosted deployment ◐ Self-managed Enterprise only; Virtual Graph Aura-only at launch ✓ On-premises, cloud, or hybrid
Real-time graph queries ✗ Via Virtual Graph (warehouse latency: seconds–minutes) ✓ Millisecond SPARQL on native storage
Guidance

Use Case Guidance

Use CaseRecommendationRationale
Graph queries on Snowflake/Databricks data without ETL Either Neo4j VG offers native Snowflake/Databricks integration with AI-generated models — minutes to start if you're already in those ecosystems. Virtuoso has offered the same zero-copy pattern via standard ODBC and JDBC connectivity to Snowflake, Databricks, or any other warehouse for decades — no proprietary coupling to specific vendors. Choose based on whether you prefer native warehouse API integration (Neo4j) or standards-based connectivity applicable to all warehouses uniformly (Virtuoso).
Graph queries on Oracle, SQL Server, PostgreSQL, MySQL, or any ODBC/JDBC source Virtuoso RDF Views support any ODBC/JDBC backend — production-stable for ~20 years. Neo4j VG is Snowflake/Databricks only at launch.
Multi-language access: SQL + SPARQL + Cypher + GQL on same data Virtuoso Five query languages loosely coupled to one engine. Neo4j VG supports Cypher only.
Cross-organizational knowledge graph linking internal data with Wikidata/DBpedia Virtuoso SPARQL-FED enables a single query to span internal RDF Views + live Wikidata + live DBpedia. HTTP IRIs provide universal entity identification across systems.
LLM-backed GraphRAG with verifiable entity citations Virtuoso HTTP IRIs serve as built-in citation anchors. LLM outputs link to dereferenceable entities. Federation pulls context from public knowledge graphs alongside internal data.
Packaged graph algorithms with polished developer UX Neo4j GDS library provides PageRank, community detection, pathfinding with strong UX and a unified API. Virtuoso's SPARQL-BI handles common graph algorithms and the results are queryable via SPARQL and SQL, but it lacks the packaged library experience of GDS. If polished algorithm UX is the primary decision driver, Neo4j leads here.
Real-time fraud detection, identity resolution, or agentic decisioning (millisecond latency) Either Both platforms deliver millisecond graph performance on native storage. Neo4j AuraDB uses its native property graph engine. Virtuoso uses its native column-store RDF engine with SPARQL and SQL — open standards rather than vendor-specific query languages. The difference is the query surface: Cypher (Neo4j) vs SPARQL + SQL (Virtuoso). If your fraud/identity workload benefits from linking to external identity graphs (Wikidata, DBpedia, organizational LDAP), Virtuoso's federation and HTTP IRI model add real-time entity resolution across systems.
Open source graph virtualization with no license fees Virtuoso Full RDF Views + SPARQL + federation included in open source edition. Neo4j VG pricing undisclosed; Enterprise licensing required for production.
Standards-first architecture (W3C, ISO) for regulated environments Virtuoso Built on W3C SPARQL/RDF/Linked Data standards. ISO GQL support. Multi-decade standards participation.
AI-assisted graph model generation from warehouse schema Either Neo4j bundles AI into the Virtual Graph UI — inspects tables, infers entities/relationships, visual editor. Virtuoso achieves the same via MCP tools + any AI agent through the OPAL layer — natural language inspection of any ODBC/JDBC/HTTP source, conversational mapping, and RDF Views generation. The outcome is equivalent; the difference is built-in proprietary AI (Neo4j) vs open-protocol agent-driven AI (Virtuoso).
Existing Neo4j/Cypher investment + data in Snowflake/Databricks Neo4j Familiar Cypher syntax, Aura console, existing toolchain. Low friction to add graph reasoning to warehouse data already in Snowflake/Databricks.
Maximum data source breadth and deployment flexibility Virtuoso Any ODBC/JDBC + SPARQL + RDF + WebDAV. On-prem, cloud, hybrid. No vendor lock-in. Open source base with commercial support optional.
Self-sovereign identity at Internet and Web scale Virtuoso HTTP IRIs + WebID + PKCS#12 + mTLS provide a complete SSI stack. Every entity — user, agent, dataset, API — has a dereferenceable, cryptographically verifiable identity. Graph virtualization maps identities across systems via owl:sameAs. Authentication (mTLS), authorization (WebID-ACL), identification (HTTP IRIs), and storage (WebDAV/RDF) are loosely coupled — each layer can evolve independently without breaking the others.
Loose coupling of AI Agents and Agent Skills with Data Spaces Virtuoso Virtuoso exposes databases, knowledge bases, filesystems, and APIs as a unified data space via MCP tools (SQL, SPARQL, SPASQL, SPARQL-FED, GraphQL). AI agents use these tools to read, query, and reason across heterogeneous sources through a single protocol. Agent skills (kg-generator, rdf-infographic, data-twingler, acp-client, mpp-stripe-client) compose these tools into higher-order workflows — all backed by the same virtual DBMS engine.
RDF-based AI harnesses and behavioral contracts Virtuoso Agent behavioral contracts, session memory, and skill rules expressed as RDF (Turtle) — self-describing, queryable, and resolvable. An agent's standing instructions, entity resolution rules, and validation gates are structured data, not flat prose. This document is a live showcase: its companion RDF knowledge graph is hosted on Virtuoso, resolved via URIBurner, and queried through the embedded SPARQL workbench — demonstrating the very virtualization, federation, and linked data capabilities it describes.
Findings

Key Findings

The same architectural pattern — two decades apart

Neo4j Virtual Graph's Cypher-to-SQL compilation over external warehouses is architecturally identical to Virtuoso's SPARQL-to-SQL compilation over RDF Views — a pattern Virtuoso has delivered in production since ~2006. What Neo4j announced as a private preview in May 2026, Virtuoso has been doing for twenty years.

~20 yrs
Virtuoso Head Start
5
Virtuoso Query Languages
1
Neo4j VG Language (Cypher)
Any
Virtuoso Data Sources
2
Neo4j VG Data Sources
SPARQL-FED
Virtuoso Cross-Endpoint

Where Neo4j Leads

Packaged graph algorithm library (GDS) with strong developer UX. AI-assisted graph model generation from warehouse schemas. Brand recognition and community in the property graph space. Clear narrative for GraphRAG and agentic AI workloads. A natural fit for organizations already invested in the Neo4j ecosystem with data in Snowflake or Databricks.

Where Virtuoso Leads

Production maturity: ~20 years of virtualization vs private preview. Data source breadth: any ODBC/JDBC source vs two warehouses. Query language breadth: five languages loosely coupled. Cross-endpoint federation via SPARQL-FED with live Wikidata/DBpedia integration. HTTP IRIs as universal keys enabling genuine Linked Data. Full virtualization included in open source.

The Philosophical Divide

Neo4j Virtual Graph adds graph queries to your existing warehouse — graph reasoning as a feature. Virtuoso treats the graph as a first-class Web citizen — entities identified by HTTP IRIs, data virtualized through declarative mappings, and query languages as interchangeable interfaces to a unified engine. One is a graph capability add-on; the other is a platform built on the architectural principle that data, information, and knowledge are distinct layers.

Knowledge Graph

KG Explorer

Interactive visualization of the comparative analysis knowledge graph. Click nodes to explore via URIBurner.

— nodes / — links
Click outside to release zoom
Groups:
main
endpoint
catalog
dataset
org
country
product
section
faq
glossary
howto
class
property
instance

⚙ Advanced Settings

-250
70px
Toggle predicates
FAQ

Frequently Asked Questions

Neo4j Virtual Graph is a zero-copy graph federation capability announced in May 2026 (private preview). It compiles Cypher queries into SQL for in-place execution on Snowflake and Databricks, enabling graph reasoning without data relocation. It uses AI to generate graph data models from warehouse schemas, automatically detecting entities, relationships, and properties.
Virtuoso's virtual DBMS is a multi-model database engine that has supported data virtualization since the 1990s. It allows SQL, SPARQL, openCypher, GQL, and GraphQL queries to execute against heterogeneous data sources — relational tables, RDF graphs, external SPARQL endpoints, and web services — through a common query processor without data duplication.
Both compile graph queries deterministically to SQL — this is not LLM-driven in either case. Neo4j compiles Cypher to Snowflake or Databricks SQL. Virtuoso compiles SPARQL/Cypher/GQL to its native SQL engine, with SPARQL-FED enabling cross-endpoint federation. The key difference: Neo4j targets external warehouse engines; Virtuoso uses its own engine as the compilation target while also federating to external sources.
Virtuoso RDF Views (mid-2000s) transforms relational database tables into RDF Knowledge Graphs through declarative Quad Map Patterns — without copying data. SPARQL queries against these virtual RDF graphs are compiled to SQL and executed in-place on the underlying relational engine. This is architecturally identical to Neo4j Virtual Graph's Cypher-to-SQL compilation, but predates it by approximately 20 years.
Virtuoso uses HTTP-based hyperlinks (URIs) as standardized entity identifiers following Linked Data principles. These serve as super keys for entity relationship graphs, enabling cross-system entity resolution without AI inference. Neo4j Virtual Graph relies on foreign keys declared in warehouse schemas, supplemented by AI-inferred relationships where foreign keys are absent.
Neo4j Virtual Graph supports Cypher only. Virtuoso supports five query languages: SQL, SPARQL, openCypher, GQL (ISO standard), and GraphQL — all against the same underlying data through its multi-model query processor. GQL support in Virtuoso was demonstrated publicly in May 2026.
Virtuoso's virtual DBMS has been in production for over 25 years, with RDF Views available since the mid-2000s. Neo4j Virtual Graph entered private preview in May 2026. The architectural concepts are similar — deterministic graph-to-SQL compilation — but the production heritage differs by decades.
Use Virtual Graph when data already lives in Snowflake or Databricks and you want graph reasoning without ETL. Use native Neo4j (AuraDB) for millisecond-latency graph traversals, real-time agentic workloads, ACID transactions, and continuously updated operational graphs. The platforms are complementary, not substitutes.
Neo4j Virtual Graph launches with Snowflake and Databricks, with JDBC/SQL sources planned. Virtuoso's virtual DBMS supports any JDBC/ODBC source, SPARQL endpoints, RDF document stores, WebDAV repositories, and web services — a broader surface area reflecting its longer evolution as a virtual engine.
Neo4j is a graph-native company that built AuraDB first and is now adding federation outward to warehouses. Virtuoso was built as a virtual-first multi-model platform that added graph capabilities (RDF, SPARQL, Cypher, GQL) on top of its relational engine. One is adding virtual to graph; the other added graph to virtual.
Yes — Virtuoso RDF Views compiles SPARQL to SQL against any JDBC/ODBC relational source, providing zero-copy graph virtualization that predates Neo4j Virtual Graph by ~20 years. Virtuoso also adds SPARQL-FED federation across external endpoints, something Neo4j Virtual Graph does not currently offer.
Virtuoso is built on W3C standards: SPARQL (2008), RDF (1999), Linked Data principles, and ISO GQL (2024). Neo4j's Cypher began as a proprietary language (later open-sourced via openCypher) and contributed to the ISO GQL standard. Both participate in GQL standardization, but Virtuoso's standards alignment is foundational while Neo4j's is evolutionary.
Glossary

Key Terms & Concepts

Cypher
Neo4j's declarative graph query language, inspired by ASCII art patterns for representing graph traversals. Open-sourced as openCypher in 2015. Contributed to the ISO GQL standard. Supported by Neo4j Virtual Graph for warehouse queries.
Query Federation
The ability to execute a single query across multiple data sources. SPARQL-FED enables a Virtuoso SPARQL query to span local graphs, remote endpoints, and RDF Views simultaneously. Neo4j Virtual Graph is adding federation to warehouses; Virtuoso has federated across SPARQL endpoints for over a decade.
GQL (Graph Query Language)
ISO/IEC 39075:2024 — the first international standard graph query language. Both Neo4j (via Cypher heritage) and Virtuoso (direct implementation) support GQL, making it the convergence point for graph query standardization.
GraphRAG
Graph Retrieval-Augmented Generation — using knowledge graphs to provide structured context to LLMs. Both Neo4j (via Graph Tools) and Virtuoso (via SPARQL + URIBurner) support GraphRAG patterns, but with different graph models: labeled property graphs vs RDF knowledge graphs.
Linked Data Principles
A set of best practices for publishing structured data on the web using HTTP URIs as entity identifiers, RDF as the data model, and hyperlinks to connect entities across systems. Virtuoso is built on Linked Data principles; Neo4j uses internal node IDs and foreign keys.
Multi-Model Database
A database system supporting multiple data models (relational, RDF graph, property graph, document) and query languages within a single engine. Virtuoso is a multi-model database; Neo4j is primarily a graph database complementing its native engine with virtual warehouse access.
RDF Views
Virtuoso feature that declaratively maps relational database schemas to RDF ontologies via Quad Map Patterns. SPARQL queries against these virtual RDF graphs are compiled to SQL. Functionally equivalent to Neo4j Virtual Graph's schema-to-graph mapping, but predating it by ~20 years.
SPARQL
W3C standard query language for RDF graphs (2008). Supported natively by Virtuoso for querying Knowledge Graphs, RDF Views over relational data, and federated queries across remote endpoints (SPARQL-FED).
Virtual DBMS
A database management system where the query engine operates as a virtual layer over heterogeneous data sources. Virtuoso pioneered this architecture in the 1990s, long before the modern 'data virtualization' and 'query federation' movements.
Zero-Copy Architecture
An architectural pattern where data remains in its original storage system while external query engines operate on it virtually. Neither Neo4j Virtual Graph nor Virtuoso RDF Views physically relocate data — both compile queries to SQL for in-place execution.
Guide

How to Evaluate Graph Federation Architectures

1

Identify your data residency requirements

Determine whether your data can move or must stay in place due to governance, scale, or organizational constraints. If data must stay in a warehouse, both Neo4j Virtual Graph and Virtuoso RDF Views are viable zero-copy options.

2

Map your query language needs

List the query languages your teams and tools require. If only Cypher is needed and data is in Snowflake/Databricks, Neo4j Virtual Graph is a natural fit. If you need SQL, SPARQL, GQL, or GraphQL alongside graph queries, Virtuoso's multi-model architecture provides unified access.

3

Assess your identifier strategy

Evaluate how entities are identified across your data landscape. If you use HTTP URIs and Linked Data principles, Virtuoso's native URI-based graph model is aligned. If you rely on warehouse foreign keys, Neo4j's AI-inferred relationship discovery may be more pragmatic.

4

Evaluate federation breadth

Catalog your data sources beyond warehouses — SPARQL endpoints, existing RDF graphs, linked open data, WebDAV repositories. If your federation needs extend beyond SQL warehouses, Virtuoso's broader data source support (JDBC/ODBC, SPARQL, RDF) may be required.

5

Consider latency and workload profiles

For millisecond-latency operational graph workloads, both platforms recommend native graph storage (Neo4j AuraDB or Virtuoso's native RDF store). For analytical graph queries with seconds-to-minutes latency tolerance, either virtual approach works.

6

Review standards alignment

Check whether your architecture requires ISO standards compliance (GQL, SPARQL) or W3C standards (RDF, Linked Data). Virtuoso's multi-decade standards participation provides a standards-first foundation; Neo4j's Cypher-to-GQL evolution is converging on the same standards.

7

Run a proof-of-concept on your own data

Both Neo4j and Virtuoso offer evaluation paths. Test the same analytical graph query against your actual warehouse data using each platform's virtual/federation capability. Measure query performance, model accuracy, and integration effort before committing.

Workbench

SPARQL Workbench

SELECT queries use text/x-html+tr. DESCRIBE and CONSTRUCT use text/x-html-nice-turtle.

Named graph: https://linkeddata.uriburner.com/DAV/demos/daas/neo4j-virtual-graph-virtuoso-comparison-deepseek_v4pro-1.ttl

Explore Knowledge Graph using SPARQL