Built-in Plugins¶
The plugin
feature allows users to manipulate and customize data collected by collectors without changing the
collectors. Plugins have the same capabilities as collectors and therefore can collect data on their own as well.
Furthermore, multiple plugins can be put in a pipeline to perform more complex operations.
Harvest architecture defines two types of plugins:
Built-in generic - Statically compiled, generic plugins. "Generic" means the plugin is collector-agnostic. These plugins are provided in this package and listed in the right sidebar.
Built-in custom - These plugins are statically compiled, collector-specific plugins.
Their source code should reside inside the plugins/
subdirectory of the collector package
(e.g. (cmd/collectors/rest/plugins/svm/svm.go
)[https://github.com/NetApp/harvest/blob/main/cmd/collectors/rest/plugins/svm/svm.go]).
Custom plugins have access to all the parameters of their parent collector and
should therefore be treated with great care.
This documentation gives an overview of builtin plugins. For other plugins, see their respective documentation. For writing your own plugin, see Developer's documentation.
Note: the rules are executed in the same order as you've added them.
Aggregator¶
Aggregator creates a new collection of metrics (Matrix) by summarizing and/or averaging metric values from an existing Matrix for a given label. For example, if the collected metrics are for volumes, you can create an aggregation for nodes or svms.
Rule syntax¶
simplest case:
plugins:
Aggregator:
- LABEL
# will aggregate a new Matrix based on target label LABEL
If you want to specify which labels should be included in the new instances, you can add those space-seperated
after LABEL
:
- LABEL LABEL1,LABEL2
# same, but LABEL1 and LABEL2 will be copied into the new instances
# (default is to only copy LABEL and any global labels (such as cluster and datacenter)
Or include all labels:
- LABEL ...
# copy all labels of the original instance
By default, aggregated metrics will be prefixed with LABEL
. For example if the object of the original Matrix
is volume
(meaning metrics are prefixed with volume_
) and LABEL
is aggr
, then the metric volume_read_ops
will
become aggr_volume_read_ops
, etc. You can override this by providing the <>OBJ
using the following syntax:
- LABEL<>OBJ
# use OBJ as the object of the new matrix, e.g. if the original object is "volume" and you
# want to leave metric names unchanged, use "volume"
Finally, sometimes you only want to aggregate instances with a specific label value. You can use <VALUE>
for that (
optionally follow by OBJ
):
- LABEL<VALUE>
# aggregate all instances if LABEL has value VALUE
- LABEL<`VALUE`>
# same, but VALUE is regular expression
- LABEL<LABELX=`VALUE`>
# same, but check against "LABELX" (instead of "LABEL")
Examples:
plugins:
Aggregator:
# will aggregate metrics of the aggregate. The labels "node" and "type" are included in the new instances
- aggr node type
# aggregate instances if label "type" has value "flexgroup"
# include all original labels
- type<flexgroup> ...
# aggregate all instances if value of "volume" ends with underscore and 4 digits
- volume<`_\d{4}$`>
Aggregation rules¶
The plugin tries to intelligently aggregate metrics based on a few rules:
- Sum - the default rule, if no other rules apply
- Average - if any of the following is true:
- metric name has suffix
_percent
or_percentage
- metric name has prefix
average_
oravg_
- metric has property (
metric.GetProperty()
)percent
oraverage
- metric name has suffix
- Weighted Average - applied if metric has property
average
and suffix_latency
and if there is a matching_ops
metric. (This is currently only matching to ZapiPerf metrics, which use the Property field of metrics.) - Ignore - metrics created by some plugins, such as value_to_num by LabelAgent
Max¶
Max creates a new collection of metrics (Matrix) by calculating max of metric values from an existing Matrix for a given label. For example, if the collected metrics are for disks, you can create max at the node or aggregate level. Refer Max Examples for more details.
Max Rule syntax¶
simplest case:
plugins:
Max:
- LABEL
# create a new Matrix of max values on target label LABEL
If you want to specify which labels should be included in the new instances, you can add those space-seperated
after LABEL
:
- LABEL LABEL1,LABEL2
# similar to the above example, but LABEL1 and LABEL2 will be copied into the new instances
# (default is to only copy LABEL and all global labels (such as cluster and datacenter)
Or include all labels:
- LABEL ...
# copy all labels of the original instance
By default, metrics will be prefixed with LABEL
. For example if the object of the original Matrix is volume
(meaning
metrics are prefixed with volume_
) and LABEL
is aggr
, then the metric volume_read_ops
will
become aggr_volume_read_ops
. You can override this using the <>OBJ
pattern shown below:
- LABEL<>OBJ
# use OBJ as the object of the new matrix, e.g. if the original object is "volume" and you
# want to leave metric names unchanged, use "volume"
Finally, sometimes you only want to generate instances with a specific label value. You can use <VALUE>
for that (
optionally followed by OBJ
):
- LABEL<VALUE>
# aggregate all instances if LABEL has value VALUE
- LABEL<`VALUE`>
# same, but VALUE is regular expression
- LABEL<LABELX=`VALUE`>
# same, but check against "LABELX" (instead of "LABEL")
Max Examples¶
plugins:
Max:
# will create max of each aggregate metric. All metrics will be prefixed with aggr_disk_max. All labels are included in the new instances
- aggr<>aggr_disk_max ...
# calculate max instances if label "disk" has value "1.1.0". Prefix with disk_max
# include all original labels
- disk<1.1.0>disk_max ...
# max of all instances if value of "volume" ends with underscore and 4 digits
- volume<`_\d{4}$`>
LabelAgent¶
LabelAgent are used to manipulate instance labels based on rules. You can define multiple rules, here is an example of what you could add to the yaml file of a collector:
plugins:
LabelAgent:
# our rules:
split: node `/` ,aggr,plex,disk
replace_regex: node node `^(node)_(\d+)_.*$` `Node-$2`
Note: Labels for creating new label should use name defined in right side of =>. If not present then left side of => is used.
split¶
Rule syntax:
split:
- LABEL `SEP` LABEL1,LABEL2,LABEL3
# source label - separator - comma-seperated target labels
Splits the value of a given label by separator SEP
and creates new labels if their number matches to the number of
target labels defined in rule. To discard a subvalue, just add a redundant ,
in the names of the target labels.
Example:
split:
- node `/` ,aggr,plex,disk
# will split the value of "node" using separator "/"
# will expect 4 values: first will be discarded, remaining
# three will be stored as labels "aggr", "plex" and "disk"
split_regex¶
Does the same as split
but uses a regular expression instead of a separator.
Rule syntax:
split_regex:
- LABEL `REGEX` LABEL1,LABEL2,LABEL3
Example:
split_regex:
- node `.*_(ag\d+)_(p\d+)_(d\d+)` aggr,plex,disk
# will look for "_ag", "_p", "_d", each followed by one
# or more numbers, if there is a match, the submatches
# will be stored as "aggr", "plex" and "disk"
split_pairs¶
Rule syntax:
split_pairs:
- LABEL `SEP1` `SEP2`
# source label - pair separator - key-value separator
Extracts key-value pairs from the value of source label LABEL
. Note that you need to add these keys in the export
options, otherwise they will not be exported.
Example:
split_pairs:
- comment ` ` `:`
# will split pairs using a single space and split key-values using colon
# e.g. if comment="owner:jack contact:some@email", the result will be
# two new labels: owner="jack" and contact="some@email"
join¶
Join multiple label values using separator SEP
and create a new label.
Rule syntax:
join:
- LABEL `SEP` LABEL1,LABEL2,LABEL3
# target label - separator - comma-seperated source labels
Example:
join:
- plex_long `_` aggr,plex
# will look for the values of labels "aggr" and "plex",
# if they are set, a new "plex_long" label will be added
# by joining their values with "_"
replace¶
Substitute substring OLD
with NEW
in label SOURCE
and store in TARGET
. Note that target and source labels can be
the same.
Rule syntax:
replace:
- SOURCE TARGET `OLD` `NEW`
# source label - target label - substring to replace - replace with
Example:
replace:
- node node_short `node_` ``
# this rule will just remove "node_" from all values of label
# "node". E.g. if label is "node_jamaica1", it will rewrite it
# as "jamaica1"
replace_regex¶
Same as replace
, but will use a regular expression instead of OLD
. Note you can use $n
to specify n
th submatch
in NEW
.
Rule syntax:
replace_regex:
- SOURCE TARGET `REGEX` `NEW`
# source label - target label - substring to replace - replace with
Example:
replace_regex:
- node node `^(node)_(\d+)_.*$` `Node-$2`
# if there is a match, will capitalize "Node" and remove suffixes.
# E.g. if label is "node_10_dc2", it will rewrite it as
# will rewrite it as "Node-10"
exclude_equals¶
Exclude each instance, if the value of LABEL
is exactly VALUE
. Exclude means that metrics for this instance will not
be exported.
Rule syntax:
exclude_equals:
- LABEL `VALUE`
# label name - label value
Example:
exclude_equals:
- vol_type `flexgroup_constituent`
# all instances, which have label "vol_type" with value
# "flexgroup_constituent" will not be exported
exclude_contains¶
Same as exclude_equals
, but all labels that contain VALUE
will be excluded
Rule syntax:
exclude_contains:
- LABEL `VALUE`
# label name - label value
Example:
exclude_contains:
- vol_type `flexgroup_`
# all instances, which have label "vol_type" which contain
# "flexgroup_" will not be exported
exclude_regex¶
Same as exclude_equals
, but will use a regular expression and all matching instances will be excluded.
Rule syntax:
exclude_regex:
- LABEL `REGEX`
# label name - regular expression
Example:
exclude_regex:
- vol_type `^flex`
# all instances, which have label "vol_type" which starts with
# "flex" will not be exported
include_equals¶
Include each instance, if the value of LABEL
is exactly VALUE
. Include means that metrics for this instance will be
exported and instances that do not match will not be exported.
Rule syntax:
include_equals:
- LABEL `VALUE`
# label name - label value
Example:
include_equals:
- vol_type `flexgroup_constituent`
# all instances, which have label "vol_type" with value
# "flexgroup_constituent" will be exported
include_contains¶
Same as include_equals
, but all labels that contain VALUE
will be included
Rule syntax:
include_contains:
- LABEL `VALUE`
# label name - label value
Example:
include_contains:
- vol_type `flexgroup_`
# all instances, which have label "vol_type" which contain
# "flexgroup_" will be exported
include_regex¶
Same as include_equals
, but a regular expression will be used for inclusion. Similar to the other includes, all
matching instances will be included and all non-matching will not be exported.
Rule syntax:
include_regex:
- LABEL `REGEX`
# label name - regular expression
Example:
include_regex:
- vol_type `^flex`
# all instances, which have label "vol_type" which starts with
# "flex" will be exported
value_mapping¶
value_mapping was deprecated in 21.11 and removed in 22.02. Use value_to_num mapping instead.
value_to_num¶
Map values of a given label to a numeric metric (of type uint8
).
This rule maps values of a given label to a numeric metric (of type unit8
). Healthy is mapped to 1 and all non-healthy
values are mapped to 0.
This is handy to manipulate the data in the DB or Grafana (e.g. change color based on status or create alert).
Note that you don't define the numeric values yourself, instead, you only provide the possible (expected) values, the plugin will map each value to its index in the rule.
Rule syntax:
value_to_num:
- METRIC LABEL ZAPI_VALUE REST_VALUE `N`
# map values of LABEL to 1 if it is ZAPI_VALUE or REST_VALUE
# otherwise, value of METRIC is set to N
The default value N
is optional, if no default value is given and the label value does not match any of the given
values, the metric value will not be set.
Examples:
value_to_num:
- status state up online `0`
# a new metric will be created with the name "status"
# if an instance has label "state" with value "up", the metric value will be 1,
# if it's "online", the value will be set to 1,
# if it's any other value, it will be set to the specified default, 0
value_to_num:
- status state up online `4`
# metric value will be set to 1 if "state" is "up", otherwise to **4**
value_to_num:
- status outage - - `0` #ok_value is empty value.
# metric value will be set to 1 if "outage" is empty, if it's any other value, it will be set to the default, 0
# '-' is a special symbol in this mapping, and it will be converted to blank while processing.
value_to_num_regex¶
Same as value_to_num, but will use a regular expression. All matches are mapped to 1 and non-matches are mapped to 0.
This is handy to manipulate the data in the DB or Grafana (e.g. change color based on status or create alert).
Note that you don't define the numeric values, instead, you provide the expected values and the plugin will map each value to its index in the rule.
Rule syntax:
value_to_num_regex:
- METRIC LABEL ZAPI_REGEX REST_REGEX `N`
# map values of LABEL to 1 if it matches ZAPI_REGEX or REST_REGEX
# otherwise, value of METRIC is set to N
The default value N
is optional, if no default value is given and the label value does not match any of the given
values, the metric value will not be set.
Examples:
value_to_num_regex:
- certificateuser methods .*cert.*$ .*certificate.*$ `0`
# a new metric will be created with the name "certificateuser"
# if an instance has label "methods" with value contains "cert", the metric value will be 1,
# if value contains "certificate", the value will be set to 1,
# if value doesn't contain "cert" and "certificate", it will be set to the specified default, 0
value_to_num_regex:
- status state ^up$ ^ok$ `4`
# metric value will be set to 1 if label "state" matches regex, otherwise set to **4**
MetricAgent¶
MetricAgent are used to manipulate metrics based on rules. You can define multiple rules, here is an example of what you could add to the yaml file of a collector:
plugins:
MetricAgent:
compute_metric:
- snapshot_maxfiles_possible ADD snapshot.max_files_available snapshot.max_files_used
- raid_disk_count ADD block_storage.primary.disk_count block_storage.hybrid_cache.disk_count
Note: Metric names used to create new metrics can come from the left or right side of the rename operator (=>
)
Note: The metric agent currently does not work for histogram or array metrics.
compute_metric¶
This rule creates a new metric (of type float64) using the provided scalar or an existing metric value combined with a mathematical operation.
You can provide a numeric value or a metric name with an operation. The plugin will use the provided number or fetch the value of a given metric, perform the requested mathematical operation, and store the result in new custom metric.
Currently, we support these operations: ADD SUBTRACT MULTIPLY DIVIDE PERCENT
Rule syntax:
compute_metric:
- METRIC OPERATION METRIC1 METRIC2 METRIC3
# target new metric - mathematical operation - input metric names
# apply OPERATION on metric values of METRIC1, METRIC2 and METRIC3 and set result in METRIC
# METRIC1, METRIC2, METRIC3 can be a scalar or an existing metric name.
Examples:
compute_metric:
- space_total ADD space_available space_used
# a new metric will be created with the name "space_total"
# if an instance has metric "space_available" with value "1000", and "space_used" with value "400",
# the result value will be "1400" and set to metric "space_total".
compute_metric:
- disk_count ADD primary.disk_count secondary.disk_count hybrid.disk_count
# value of metric "disk_count" would be addition of all the given disk_counts metric values.
# disk_count = primary.disk_count + secondary.disk_count + hybrid.disk_count
compute_metric:
- files_available SUBTRACT files files_used
# value of metric "files_available" would be subtraction of the metric value of files_used from metric value of files.
# files_available = files - files_used
compute_metric:
- total_bytes MULTIPLY bytes_per_sector sector_count
# value of metric "total_bytes" would be multiplication of metric value of bytes_per_sector and metric value of sector_count.
# total_bytes = bytes_per_sector * sector_count
compute_metric:
- uptime MULTIPLY stats.power_on_hours 60 60
# value of metric "uptime" would be multiplication of metric value of stats.power_on_hours and scalar value of 60 * 60.
# total_bytes = bytes_per_sector * sector_count
compute_metric:
- transmission_rate DIVIDE transfer.bytes_transferred transfer.total_duration
# value of metric "transmission_rate" would be division of metric value of transfer.bytes_transferred by metric value of transfer.total_duration.
# transmission_rate = transfer.bytes_transferred / transfer.total_duration
compute_metric:
- inode_used_percent PERCENT inode_files_used inode_files_total
# a new metric named "inode_used_percent" will be created by dividing the metric "inode_files_used" by
# "inode_files_total" and multiplying the result by 100.
# inode_used_percent = inode_files_used / inode_files_total * 100
ChangeLog¶
The ChangeLog plugin is a feature of Harvest, designed to detect and track changes related to the creation, modification, and deletion of an object. By default, it supports volume, svm, and node objects. Its functionality can be extended to track changes in other objects by making relevant changes in the template.
Please note that the ChangeLog plugin requires the uuid
label, which is unique, to be collected by the template. Without the uuid
label, the plugin will not function.
The ChangeLog feature only detects changes when Harvest is up and running. It does not detect changes that occur when Harvest is down. Additionally, the plugin does not detect changes in metric values by default, but it can be configured to do so.
Enabling the Plugin¶
The plugin can be enabled in the templates under the plugins section.
For volume, svm, and node objects, you can enable the plugin with the following configuration:
plugins:
- ChangeLog
For other objects, you need to specify the labels to track in the plugin configuration. These labels should be relevant to the object you want to track. If these labels are not specified in the template, the plugin will not be able to track changes for the object.
Here's an example of how to enable the plugin for an aggregate object:
plugins:
- ChangeLog:
track:
- aggr
- node
- state
In the above configuration, the plugin will track changes in the aggr
, node
, and state
labels for the aggregate object.
Default Tracking for svm, node, volume¶
By default, the plugin tracks changes in the following labels for svm, node, and volume objects:
- svm: svm, state, type, anti_ransomware_state
- node: node, location, healthy
- volume: node, volume, svm, style, type, aggr, state, status
Other objects are not tracked by default.
These default settings can be overwritten as needed in the relevant templates. For instance, if you want to track junction_path
label and size_total
metric for Volume, you can overwrite this in the volume template.
plugins:
- ChangeLog:
- track:
- node
- volume
- svm
- style
- type
- aggr
- state
- status
- junction_path
- size_total
Change Types and Metrics¶
The ChangeLog plugin publishes a metric with various labels providing detailed information about the change when an object is created, modified, or deleted.
Object Creation¶
When a new object is created, the ChangeLog plugin will publish a metric with the following labels:
Label | Description |
---|---|
object | name of the ONTAP object that was changed |
op | type of change that was made |
metric value | timestamp when Harvest captured the change. 1698735558 in the example below |
Example of metric shape for object creation:
change_log{aggr="umeng_aff300_aggr2", cluster="umeng-aff300-01-02", datacenter="u2", index="0", instance="localhost:12993", job="prometheus", node="umeng-aff300-01", object="volume", op="create", style="flexvol", svm="harvest", volume="harvest_demo"} 1698735558
Object Modification¶
When an existing object is modified, the ChangeLog plugin will publish a metric with the following labels:
Label | Description |
---|---|
object |
Name of the ONTAP object that was changed |
op |
Type of change that was made |
track |
Property of the object which was modified |
new_value |
New value of the object after the change (only available for label changes and not for metric changes) |
old_value |
Previous value of the object before the change (only available for label changes and not for metric changes) |
metric value |
Timestamp when Harvest captured the change. 1698735677 in the example below |
category |
Type of the change, indicating whether it is a metric or a label change |
Example of metric shape for object modification for label:
change_log{aggr="umeng_aff300_aggr2", category="label", cluster="umeng-aff300-01-02", datacenter="u2", index="1", instance="localhost:12993", job="prometheus", new_value="offline", node="umeng-aff300-01", object="volume", old_value="online", op="update", style="flexvol", svm="harvest", track="state", volume="harvest_demo"} 1698735677
Example of metric shape for metric value change:
change_log{aggr="umeng_aff300_aggr2", category="metric", cluster="umeng-aff300-01-02", datacenter="u2", index="3", instance="localhost:12993", job="prometheus", node="umeng-aff300-01", object="volume", op="metric_change", track="volume_size_total", svm="harvest", volume="harvest_demo"} 1698735800
Object Deletion¶
When an object is deleted, the ChangeLog plugin will publish a metric with the following labels:
Label | Description |
---|---|
object | name of the ONTAP object that was changed |
op | type of change that was made |
metric value | timestamp when Harvest captured the change. 1698735708 in the example below |
Example of metric shape for object deletion:
change_log{aggr="umeng_aff300_aggr2", cluster="umeng-aff300-01-02", datacenter="u2", index="2", instance="localhost:12993", job="prometheus", node="umeng-aff300-01", object="volume", op="delete", style="flexvol", svm="harvest", volume="harvest_demo"} 1698735708
Viewing the Metrics¶
You can view the metrics published by the ChangeLog plugin in the ChangeLog Monitor
dashboard in Grafana
. This dashboard provides a visual representation of the changes tracked by the plugin for volume, svm, and node objects.
VolumeTopClients¶
The VolumeTopClients
plugin is used to track a volume's top clients and top files in terms of read and write IOPS, as well as read and write throughput. This plugin is available only through the RestPerf Collector in ONTAP version 9.13.1 and later.
Enabling the Plugin¶
Top Clients and Files collection is disabled by default. To enable Top Clients and Files tracking in Harvest, follow these steps:
- Ensure you are using ONTAP version 9.12 or later.
- Enable the Top Clients collection in the RestPerf Collector Volume template via the
VolumeTopClients
plugin.
For detailed steps on how to enable the plugin, refer to the discussion here.
Configuration Parameters¶
max_volumes
¶
The max_volumes
parameter specifies the maximum number of top volumes to track. By default, this value is set to 5, but it can be configured up to a maximum of 50.
The plugin will select the top volumes based on the descending order of read IOPS, write IOPS, read throughput, and write throughput in each performance poll. This means that during each performance poll, the plugin will:
- Collect the read IOPS, write IOPS, read throughput, and write throughput for all volumes.
- Sort the volumes in descending order based on their metric values.
- Select the top volumes as specified by
max_volumes
. - Collect top clients and top files metrics for these volumes.
objects
¶
The objects
parameter allows you to specify which metrics to collect. By default, both client and file data will be collected. You can customize this by including or excluding specific objects:
- objects:
- client # collect read/write operations and throughput metrics for the top clients.
- file # collect read/write operations and throughput metrics for the top files
If the objects
parameter is not defined, both client and file data will be collected by default.
Viewing the Metrics¶
You can view the metrics published by the VolumeTopClients
plugin in the Volume
dashboard under the Clients
and Files
row in Grafana.