The Missing Layer in Your AI Stack

Get the Forrester report to see what’s changing.

ArangoDB 3.4 Introduces Native Search Engine and Full GeoJSON Support

Over 40 New Features Added to Simplify Application Development

San Francisco, CA, United States, December 6th, 2018 – ArangoDB, the leading open source native multi-model database, today announced the GA release of ArangoDB 3.4 – a transactional database solution which enables developers to efficiently interact with multiple data models by using just one technology and one query language. Major new enhancements in ArangoDB 3.4 include ArangoSearch, a feature which transforms ArangoDB, when combined with traversals or joins in AQL, from a data retrieval to an information retrieval solution; and full GeoJSON Support enabled by a Google S2 Geo Index library integration.

ArangoSearch, the result of four years of research and development, combines Boolean and generalized ranking retrieval models (e.g. vector space model). Providing a rich set of information retrieval capabilities, ArangoSearch consists of 2 components – a search engine and an integration layer. The former is responsible for managing the index, querying and scoring, whereas the latter provides search capabilities for the end user in a convenient way. ArangoSearch can be combined with all three data models in ArangoDB. If used in conjunction with graph database capabilities, search results could be used, for example, to enhance fraud protection, individualize recommendations or simplify precision medicine.

Search uses a special kind of materialized view to enable full-text search on multiple collections at once. Within the view definition one can specify entire collections or individual fields that should be covered by an inverted index using one or several general text analyzers. In search queries expressed with AQL, you can rank the results using multiple scorers (TFIDF and BM25) even combined. Users can now perform relevance-based matching, phrase and prefix matching, search with complex Boolean expressions, query time relevance tuning and combine complex traversals, geo-queries, and other access patterns with information retrieval techniques.

ArangoDB 3.4 includes full support for GeoJSON, an open standard format designed for representing simple geographical features, along with their non-spatial attributes. The support encompasses all geo primitives, including multi-polygons or multi-line strings. In 3.4 there has been a distinct engineering focus on increasing query and filtering functionality and optimizing performance. To this end, 3.4 also includes a Google S2 Geometry Library integration which complements ArangoDB’s RocksDB storage engine. Additionally, users can directly visualize results in OpenStreetMap which is integrated into the Query Editor of ArangoDBs WebUI.

Other notable enhancements in ArangoDB 3.4 include:

  • Query Profiler: to provide developers with more insight into complex queries, it is now possible to execute the query with special instrumentation code enabled resulting in a printed query plan with detailed execution statistics. It is now much easier to profile your queries and get insights into how much time was spent where.
  • Cluster Management: enhancements include faster cluster startup, synchronization and query execution. To increase the reliability and predictability of the ArangoDB cluster, internal protocols and request handling have been significantly overhauled to improve cluster-wide query execution, an example being Distributed Collect.
  • Streaming Cursors: at times the overall query performance is not a major priority, but rather how fast a user can obtain first results. Based on community feedback, 3.4 includes integrated streaming cursors which provides first results as they become available on the server.
  • RocksDB is now the default Storage Engine: previous versions of ArangoDB used MMfiles as the default storage engine. With 3.4, this has changed to RocksDB. This provides numerous advantages to the user including optimized binary storage format, optional caching, reduced replication catch-up time, an exclusive collection access option, and enhanced WAL sync control.

A full list of all the new features is available here.

Claudius Weinberger, CEO of ArangoDB, said: “Improved usability and enhanced application performance are at the heart of every release we deliver. We are constantly reviewing the functionality of our native multi-model solution to ensure it competes, and in many cases outperforms, single-model alternatives. In 3.4 we have introduced ArangoSearch and extended the geo-spatial search capabilities of our database, which is a huge step forward for our technology. Our commitment to innovation is reflected in the quantity of new features available in this release.”


About Arango 

Arango provides a trusted data foundation for Contextual AI — transforming enterprise data into a System of Context that truly represents the business, so LLMs can deliver better outcomes with unlimited scale and cost efficiency.

The Arango AI Data Platform gives developers a single, integrated environment to build and scale AI-powered applications without the complexity of stitching together multiple databases and tools. At its core is a massively scalable multi-model database that unifies graph, vector, document, and key-value data with full-text, geospatial, and vector search — creating the System of Context, the bridge between enterprise data and LLMs.

The Arango AI Suite includes automated data pipelines, multimodal data ingestion, AIOps and MLOps, LLM integrations, Graph Analytics, agentic frameworks for context-aware Hybrid/GraphRAG, GraphML, natural-language support, and GPU acceleration — enabling repeatable ROI and faster innovation.

Trusted by NVIDIA, HPE, the London Stock Exchange, the U.S. Air Force, NIH, and Articul8, Arango powers enterprise AI with context, confidence, and scale.

We are a proud member of the NVIDIA Inception Program and the AWS ISV Accelerate Program. Learn more at arango.ai, LinkedIn, YouTube, and G2.
marketing@arangodb.com