Home > Overview

Overview

Introduction

Rockset is a serverless search and analytics engine that can be used to build analytics dashboards and APIs on TBs of data. It turns data in databases and streams into highly performant SQL tables without requiring schemas, structural transformations or pipelines.

With Rockset as a backend, you can power:

  • Full SQL analytics over data in one or more data sources
  • Real-time dashboards with ad-hoc queries and drilldowns
  • Application features like search, filtering, ranking, etc
Rockset Overview Diagram

Why Rockset?

Rockset can be used as a backend to power operational dasboards and real-time APIs. The following properties make it well suited for this:

  • Serverless: No need to provision capacity or manage servers. There is minimum operational overhead.
  • Continuous Ingest: With read-only permissions to cloud data sources, Rockset can ingest data from them and stay in sync with insertions, deletions and updates in the source automatically.
  • Smart Schemas: Rockset does not require a schema ahead of time because the data itself is the source of truth. The schema evolves automatically to fit new fields and types.
  • Full SQL support: Support for full SQL—aggregations, filtering, windowing and joins over all types of fields—including nested objects and arrays. This enables the full expressiveness and flexibility of SQL queries over data in supported data sources, even if they don’t natively support SQL.
  • Converged Indexing: Multiple indexes are constructed automatically as each record is ingested. This helps queries run fast enough for interactive and real-time usage.
  • Security: All data at rest is encrypted using AES-256. Data is inserted and queried over SSL.
  • REST API: All functionality is supported via REST and various SDKs and clients on top of REST. This includes support for JDBC, etc, which allows existing tools and libraries to interface directly with Rockset.

The section below outlines the ways in which you can load data into Rockset. Following that, the other sections outline the ways to query data, and finally ways to access Rockset programmatically via the API.

Loading Data

Data Sources

Rockset manages data ingestion and can continuously sync data from databases, lakes and streams without any ETL pipelines. The data itself can be in different formats such as JSON, XML, CSV, Apache Parquet. Follow the links below to learn more about how to set up an integration with the supported data sources.

If your data is not in any of these sources, or you want to stream events directly into Rockset, you can use the Write API to write data into Rockset. Using one of the supported client SDKs or the REST API, you can stream records in real-time to Rockset to be indexed and turned into fast SQL tables.

If you would like to request support for a new data source, please drop us a note at support@rockset.com.

Querying Data

Visualization Tools

You can connect the following visualization and dashboarding tools to Rockset. These tools are fully supported and can natively query Rockset using SQL to build real-time visualizations.

IDEs

Rockset supports Jetbrains Datagrip if you want to connect to Rockset and compose SQL queries in an IDE. Follow the link below for documentation.

API Access

Rockset has client libraries wrapped on top of REST that can be used to insert, update and query data programmatically from custom application code.

We are working on enabling new client libraries. If you would like to get early access to them, contact support@rockset.com.