OpenTSDB has a thriving community who contributed and requested a number of new features.
While 3.0 is still a ways off, we’ll be pushing some of the new features into a new branch of the repo. Some are in progress and other features are planned. If you have any features that you want to see, let us know.
Distributed Queries - Based on the great work of Turn on Splicer we have a distributed query layer to split queries amongst multiple TSDs for greater throughput.
Query Caching - Improve queries with time-sharded caching of results.
Improved Expressions - Perform group by, downsampling and arithmetic modifications in any order. Potentially support UDFs as well.
Anomaly Processing/Forecasting - Integrate with modeling libraries (such as EGADs) for deeper time series analysis.
Rollup/Pre-Aggregates - Support for storing and querying time-based rolled up data and/or pre-aggregated values.
Distributed Percentile - Store histograms (or sketches) for calculating proper percentiles over multiple sources.
Expressions - Query time computations using time series data. For example, dividing one metric by another.
Graphite Style Functions - Additional filtering and mutation of data at query time using Graphite style functions.
Calendar Based Downsampling - The ability to align downsampled data on Gregorian calendar boundaries.
Bigtable Support - Run TSDB in the cloud using Google’s hosted Bigtable service.
Cassandra Support - Support for running OpenTSDB on legacy Cassandra clusters.
Write Filters - Block or allow time series or UID assignments based on plugins or whitelists.
New Aggregators - None for returning raw data. First and Last to return the first or last data points during downsampling.
Meta Data Cache Plugin - A new API for caching meta data to improve query performance.
Startup Plugins - APIs to help with service discovery on TSD startup.
Example Java API usage classes.
Appends - Support writing all data points for an hour in a single column. This saves the need for TSD compactions and reduces network traffic at query time.
Salting - Enables greater distribution of writes for high cardinality metrics as well as asynchronous scanning for improved query speed. (Non backwards compatible)
Random Metric UIDs - Enables better distribution of writes when creating new metrics
Storage Exception Plugin - Enables various handling of data points when HBase is unavailable
Secure AsyncHBase - Access HBase clusters requiring Kerberos or simple authentication along with optional encryption.
Fill Policy - Enable emitting NaNs or Nulls via the JSON query endpoint when data points are “missing”
Count and Percentiles - New aggregator functions
More Stats - Gives greater insight into query performance via the query stats endpoint and new stats for threads, region clients and the JVM
Annotations - Scan for multiple annotations only via the /api/annotations endpoint
Query Filters - New filters for flexibility including case (in)sensitive literals, wildcards and regular expressions.
Override Tag Widths - You can now override tag widths in the config instead of having to recompile the code.
Compaction Tuning - New parameters allow for tuning the TSD compaction process.
Delete Data And UIDs - Allow for deleting data at query time as well as removing UIDs from the system.
Synchronous Writing - The HTTP Put API now supports synchronous writing to make sure data is flushed to HBase.
Query Stats - Query details are now logged that include timing statistics. A new endpoint also shows running and completed queries.
Downsampling - Timestamps are now aligned on modulus boundaries, reducing the need to interpolation across series.
Last Data Point API - Query for the last data point for specific time series within a certain time window
Duplicates - Handle duplicate data points at query time or during FSCK
FSCK - An updated FSCK utility that iterates over the main data table, finding and fixing errors
Read/Write Modes - Block assigning UIDs on individual TSDs for backup clusters
UID Cache - Preload portions of the UID table on startup to improve writes
Lock-less UID Assignment - Drastically improves write speed when storing new metrics, tag names, or values
Restful API - Provides access to all of OpenTSDB’s features as well as offering new options, defaulting to JSON
Cross Origin Resource Sharing - For the API so you can make AJAX calls easily
Store Data Via HTTP - Write data points over HTTP as an alternative to Telnet
Configuration File - A key/value file shared by the TSD and command line tools
Pluggable Serializers - Enable different inputs and outputs for the API
Annotations - Record meta data about specific time series or data points
Meta Data - Record meta data for each time series, metrics, tag names, or values
Trees - Flatten metric and tag combinations into a single name for navigation or usage with different tools
Search Plugins - Send meta data to search engines to delve into your data and figure out what’s in your database
Real-Time Publishing Plugin - Send data to external systems as they arrive to your TSD
Ingest Plugins - Accept data points in different formats
Millisecond Resolution - Optionally store data with millisecond precision
Variable Length Encoding - Use less storage space for smaller integer values
Non-Interpolating Aggregation Functions - For situations where you require raw data
Rate Counter Calculations - Handle roll-over and anomaly supression
Additional Statistics - Including the number of UIDs assigned and available
Thank you to everyone who has contributed to 2.4. Help us out by sharing your ideas and code at GitHub