logstash-filter-transaction_time 1.0.4 → 1.0.5
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- checksums.yaml +4 -4
- data/README.md +116 -1
- data/lib/logstash/filters/transaction_time.rb +28 -14
- data/logstash-filter-transaction_time.gemspec +1 -1
- data/spec/filters/transaction_time_spec.rb +51 -0
- metadata +2 -2
checksums.yaml
CHANGED
@@ -1,7 +1,7 @@
|
|
1
1
|
---
|
2
2
|
SHA1:
|
3
|
-
metadata.gz:
|
4
|
-
data.tar.gz:
|
3
|
+
metadata.gz: 074c50d41cd992c872b665bb63497ceee820cf5a
|
4
|
+
data.tar.gz: eeab3271345ab338bddcce72683b744a04a6cf9c
|
5
5
|
SHA512:
|
6
|
-
metadata.gz:
|
7
|
-
data.tar.gz:
|
6
|
+
metadata.gz: f244fe40c2e89b9a7a9f7e95441b7ce6b5a9fe2006d3d1fc6ab77d892b095969888552aa40003a776fda376b244a2f8dd9980a9e5b5a1423c92574e3f2354537
|
7
|
+
data.tar.gz: e9c27b093beac560791750525774e74feac4fded750ac4c3a804efc9d99e69a7473a90f707129122a2585146970091a4f3970074af1fdcdf50b12d121c6fae06
|
data/README.md
CHANGED
@@ -8,7 +8,122 @@ But instead of defining a start and an end for a transaction - only the unique i
|
|
8
8
|
Per default the transaction time is stored together with the unique identifier in a new event, which may be stored in the same or another index.
|
9
9
|
The information from the first, last, oldest or newest event may be attached with the new transaction_time event.
|
10
10
|
|
11
|
-
|
11
|
+
# Usage
|
12
|
+
|
13
|
+
The TransactionTime filter measures the time between two events in a transaction
|
14
|
+
|
15
|
+
This filter is supposed to be used instead of logstash-filters-elapsed
|
16
|
+
when you know that the order of a transaction cannot be guaranteed.
|
17
|
+
Which is most likely the case if you are using multiple workers and
|
18
|
+
a big amount of events are entering the pipeline in a rapid manner.
|
19
|
+
|
20
|
+
## The configuration:
|
21
|
+
```ruby
|
22
|
+
filter {
|
23
|
+
transaction_time {
|
24
|
+
uid_field => "Transaction-unique field"
|
25
|
+
ignore_uid => []
|
26
|
+
timeout => seconds
|
27
|
+
timestamp_tag => "name of timestamp"
|
28
|
+
replace_timestamp => ['keep', 'oldest', 'newest']
|
29
|
+
filter_tag => "transaction tag"
|
30
|
+
attach_event => ['first','last','oldest','newest','none']
|
31
|
+
release_expired => [true,false]
|
32
|
+
store_data_oldest => []
|
33
|
+
store_data_newest => []
|
34
|
+
periodic_flush => [true,false]
|
35
|
+
}
|
36
|
+
}
|
37
|
+
```
|
38
|
+
- `uid_field`
|
39
|
+
The only required parameter is "uid_field" which is used to identify
|
40
|
+
the events in a transaction. A transaction is concidered complete
|
41
|
+
when two events with the same UID has been captured.
|
42
|
+
It is when a transaction completes that the transaction time is calculated.
|
43
|
+
|
44
|
+
- `ignore_uid`
|
45
|
+
The ignore_uid field takes an array of strings. These strings represent specific UIDs
|
46
|
+
that should be ignored. This can be useful for ignoring parsing errors.
|
47
|
+
Example:
|
48
|
+
```ruby
|
49
|
+
ignore_uid => ["%{[transactionUID][0]}", ""]
|
50
|
+
```
|
51
|
+
Will ignore events having empty string or "%{[transactionUID][0]}" in the uid_field.
|
52
|
+
|
53
|
+
- `timeout`
|
54
|
+
The timeout parameter determines the maximum length of a transaction.
|
55
|
+
It is set to 300 (5 minutes) by default.
|
56
|
+
The transaction will not be recorded if timeout duration is exceeded.
|
57
|
+
The value of this parameter will have an impact on the memory footprint of the plugin.
|
58
|
+
|
59
|
+
- `timestamp_tag`
|
60
|
+
The timestamp_tag parameter may be used to select a specific field in the events to use
|
61
|
+
when calculating the transaction time. The default field is @timestamp.
|
62
|
+
|
63
|
+
- `replace_timestamp`
|
64
|
+
The new event created when a transaction completes may set its own timestamp
|
65
|
+
to when it completes (default) or it may use the timestamp of one of the events in the transaction.
|
66
|
+
The parameter replace_timestamp is used to specify this behaviour.
|
67
|
+
|
68
|
+
- `filter_tag`
|
69
|
+
Since this plugin exclusivly calculates the time between events in a transaction,
|
70
|
+
it may be wise to filter out the events that are infact not transactions.
|
71
|
+
This will help reduce both the memory footprint and processing time of this plugin,
|
72
|
+
especially if the pipeline receives a lot of non-transactional events.
|
73
|
+
You could use grok and/or mutate to apply this filter like this:
|
74
|
+
```ruby
|
75
|
+
filter {
|
76
|
+
grok{
|
77
|
+
match => { "message" => "(?<message_type>.*)\t(?<msgbody>.*)\t+UID:%{UUID:uid}" }
|
78
|
+
}
|
79
|
+
if [message_type] in ["MaterialIdentified","Recipe","Result"."ReleaseMaterial"]{
|
80
|
+
mutate {
|
81
|
+
add_tag => "Transaction"
|
82
|
+
}
|
83
|
+
}
|
84
|
+
transaction_time {
|
85
|
+
uid_field => "UID"
|
86
|
+
filter_tag => "Transaction"
|
87
|
+
}
|
88
|
+
}
|
89
|
+
```
|
90
|
+
In the example, grok is used to identify the message_type and then the tag "transaction" is added for a specific set of messages. This tag is then used in the transaction_time as filter_tag. Only the messages with this tag will be evaluated.
|
91
|
+
> **Note**: Do not use reserved name "_TransactionTime_" which is added to all events created by this plugin
|
92
|
+
|
93
|
+
- `attach_event`
|
94
|
+
The attach_event parameter can be used to append information from one of the events to the
|
95
|
+
new transaction_time event. The default is to not attach anything.
|
96
|
+
The memory footprint is kept to a minimum by using the default value.
|
97
|
+
|
98
|
+
- `release_expired`
|
99
|
+
The release_expired parameter determines if the first event in an expired transactions
|
100
|
+
should be released or not. Defaults to true
|
101
|
+
|
102
|
+
- `store_data_oldest/store_data_newest`
|
103
|
+
The parameters store_data_oldest and store_data_newest are both used in order to attach
|
104
|
+
specific fields from oldest respectively newest event. An example of this could be:
|
105
|
+
```ruby
|
106
|
+
store_data_oldest => ["@timestamp", "work_unit", "work_center", "message_type"]
|
107
|
+
store_data_newest => ["@timestamp", "work_unit", "work_center", "message_type"]
|
108
|
+
```
|
109
|
+
Which will result in the genereated transaction event inluding the specified fields from oldest and newest events in a hashmap named oldest/newest under the hash named "transaction_data"
|
110
|
+
Example of output data:
|
111
|
+
```
|
112
|
+
"transaction_data" => {
|
113
|
+
"oldest" => {
|
114
|
+
"message_type" => "MaterialIdentified",
|
115
|
+
"@timestamp" => 2018-10-31T07:36:23.072Z,
|
116
|
+
"work_unit" => "WT000743",
|
117
|
+
"work_center" => "WR000046"
|
118
|
+
},
|
119
|
+
"newest" => {
|
120
|
+
"message_type" => "Recipe",
|
121
|
+
"@timestamp" => 2018-10-31T07:36:28.188Z,
|
122
|
+
"work_unit" => "WT000743",
|
123
|
+
"work_center" => "WR000046"
|
124
|
+
}
|
125
|
+
}
|
126
|
+
```
|
12
127
|
|
13
128
|
# Logstash Plugin
|
14
129
|
|
@@ -14,6 +14,7 @@ require "logstash/namespace"
|
|
14
14
|
# filter {
|
15
15
|
# transaction_time {
|
16
16
|
# uid_field => "Transaction-unique field"
|
17
|
+
# ignore_uid => []
|
17
18
|
# timeout => seconds
|
18
19
|
# timestamp_tag => "name of timestamp"
|
19
20
|
# replace_timestamp => ['keep', 'oldest', 'newest']
|
@@ -31,6 +32,12 @@ require "logstash/namespace"
|
|
31
32
|
# the events in a transaction. A transaction is concidered complete
|
32
33
|
# when two events with the same UID has been captured.
|
33
34
|
# It is when a transaction completes that the transaction time is calculated.
|
35
|
+
#
|
36
|
+
# The ignore_uid field takes an array of strings. These strings represent specific UIDs
|
37
|
+
# that should be ignored. This can be useful for ignoring parsing errors.
|
38
|
+
# Example:
|
39
|
+
# ignore_uid => ["%{[transactionUID][0]}", ""]
|
40
|
+
# Will ignore events having empty string or "%{[transactionUID][0]}" in the uid_field.
|
34
41
|
#
|
35
42
|
# The timeout parameter determines the maximum length of a transaction.
|
36
43
|
# It is set to 300 (5 minutes) by default.
|
@@ -119,6 +126,8 @@ class LogStash::Filters::TransactionTime < LogStash::Filters::Base
|
|
119
126
|
|
120
127
|
# The name of the UID-field used to identify transaction-pairs
|
121
128
|
config :uid_field, :validate => :string, :required => true
|
129
|
+
# Array of UIDs to ignore (useful for ignoring parse-errors).
|
130
|
+
config :ignore_uids, :validate => :array, :default => []
|
122
131
|
# The amount of time (in seconds) before a transaction is dropped. Defaults to 5 minutes
|
123
132
|
config :timeout, :validate => :number, :default => 300
|
124
133
|
# What tag to use as timestamp when calculating the elapsed transaction time. Defaults to @timestamp
|
@@ -177,17 +186,18 @@ class LogStash::Filters::TransactionTime < LogStash::Filters::Base
|
|
177
186
|
(event.get("tags").nil? || !event.get("tags").include?(TRANSACTION_TIME_EXPIRED_TAG)) &&
|
178
187
|
(@filter_tag.nil? || (!event.get("tags").nil? && event.get("tags").include?(@filter_tag))))
|
179
188
|
|
180
|
-
|
181
|
-
|
182
|
-
|
183
|
-
|
184
|
-
|
185
|
-
|
186
|
-
|
187
|
-
|
188
|
-
|
189
|
-
|
190
|
-
|
189
|
+
if not @ignore_uids.include?(uid)
|
190
|
+
@mutex.synchronize do
|
191
|
+
if(!@transactions.has_key?(uid))
|
192
|
+
@transactions[uid] = LogStash::Filters::TransactionTime::Transaction.new(event, uid, @storeEvent)
|
193
|
+
|
194
|
+
else #End of transaction
|
195
|
+
@transactions[uid].addSecond(event,@storeEvent)
|
196
|
+
transaction_event = new_transactiontime_event(@transactions[uid], @attachData)
|
197
|
+
filter_matched(transaction_event)
|
198
|
+
yield transaction_event if block_given?
|
199
|
+
@transactions.delete(uid)
|
200
|
+
end
|
191
201
|
end
|
192
202
|
end
|
193
203
|
end
|
@@ -207,10 +217,14 @@ class LogStash::Filters::TransactionTime < LogStash::Filters::Base
|
|
207
217
|
expired_elements = remove_expired_elements()
|
208
218
|
end
|
209
219
|
|
210
|
-
|
211
|
-
|
220
|
+
if @release_expired
|
221
|
+
expired_elements.each do |element|
|
222
|
+
filter_matched(element)
|
223
|
+
end
|
224
|
+
#print("Exp" + options.to_s + expired_elements.to_s)
|
225
|
+
return expired_elements
|
212
226
|
end
|
213
|
-
return
|
227
|
+
return []
|
214
228
|
#yield expired_elements if block_given?
|
215
229
|
#return create_expired_events_from(expired_elements)
|
216
230
|
end
|
@@ -1,6 +1,6 @@
|
|
1
1
|
Gem::Specification.new do |s|
|
2
2
|
s.name = 'logstash-filter-transaction_time'
|
3
|
-
s.version = '1.0.
|
3
|
+
s.version = '1.0.5'
|
4
4
|
s.licenses = ['Apache-2.0','Apache License (2.0)']
|
5
5
|
s.summary = 'Writes the time difference between two events in a transaction to a new event'
|
6
6
|
s.description = 'This gem is a Logstash plugin required to be installed on top of the Logstash core pipeline using $LS_HOME/bin/logstash-plugin install gemname. This gem is not a stand-alone program. Source-code and documentation available at github: https://github.com/AddinITAB/logstash-filter-transaction_time'
|
@@ -175,6 +175,26 @@ describe LogStash::Filters::TransactionTime do
|
|
175
175
|
insist { @filter.transactions.size } == 1
|
176
176
|
end
|
177
177
|
end
|
178
|
+
describe "Setup release_expired = false" do
|
179
|
+
it "never releases any expired events when flush is called" do
|
180
|
+
config = {"release_expired" => false}
|
181
|
+
@config.merge!(config)
|
182
|
+
|
183
|
+
@filter.filter(event("message" => "Log message", UID_FIELD => uid, "@timestamp" => "2018-04-22T09:46:22.000+0100"))
|
184
|
+
insist { @filter.transactions.size } == 1
|
185
|
+
@filter.filter(event("message" => "Log message", UID_FIELD => uid2, "@timestamp" => "2018-04-22T09:46:22.000+0100"))
|
186
|
+
insist { @filter.transactions.size } == 2
|
187
|
+
|
188
|
+
#Looks like flush doesn't have config-scope. Setting release_expired hard instead of by config. Will it work like intended when using only config?
|
189
|
+
@filter.release_expired = false
|
190
|
+
((TIMEOUT/5)+1).times do
|
191
|
+
flushRes = @filter.flush({"from" => "test" })
|
192
|
+
insist { (flushRes.any?) } == false
|
193
|
+
#insist { @filter.flush().nil? }
|
194
|
+
end
|
195
|
+
insist { @filter.transactions.size } == 0
|
196
|
+
end
|
197
|
+
end
|
178
198
|
end
|
179
199
|
|
180
200
|
context "Testing Timestamp Override." do
|
@@ -185,6 +205,7 @@ describe LogStash::Filters::TransactionTime do
|
|
185
205
|
config = {"replace_timestamp" => 'oldest'}
|
186
206
|
@config.merge!(config)
|
187
207
|
|
208
|
+
|
188
209
|
@filter = LogStash::Filters::TransactionTime.new(@config)
|
189
210
|
@filter.register
|
190
211
|
|
@@ -377,4 +398,34 @@ describe LogStash::Filters::TransactionTime do
|
|
377
398
|
end
|
378
399
|
end
|
379
400
|
end
|
401
|
+
context "Testing ignore_uids." do
|
402
|
+
nokUid = "Erroneous UID"
|
403
|
+
uid = "9ACCA7B7-D0E9-4E52-A023-9D588E5BE42C"
|
404
|
+
describe "Config ignore_uids set" do
|
405
|
+
it "will not accept events with specified uid as transactions" do
|
406
|
+
config = {"ignore_uids" => ["Erroneous UID"]}
|
407
|
+
@config.merge!(config)
|
408
|
+
|
409
|
+
@filter = LogStash::Filters::TransactionTime.new(@config)
|
410
|
+
@filter.register
|
411
|
+
|
412
|
+
@filter.filter(event("message" => "first", UID_FIELD => nokUid, "@timestamp" => "2018-04-22T09:46:22.000+0100"))
|
413
|
+
@filter.filter(event("message" => "last", UID_FIELD => nokUid, "@timestamp" => "2018-04-22T09:46:22.100+0100")) do | new_event |
|
414
|
+
insist { new_event } == nil
|
415
|
+
end
|
416
|
+
end
|
417
|
+
it "will accept other events as transactions" do
|
418
|
+
config = {"ignore_uids" => ["Erroneous UID"]}
|
419
|
+
@config.merge!(config)
|
420
|
+
|
421
|
+
@filter = LogStash::Filters::TransactionTime.new(@config)
|
422
|
+
@filter.register
|
423
|
+
|
424
|
+
@filter.filter(event("message" => "first", UID_FIELD => uid, "@timestamp" => "2018-04-22T09:46:22.000+0100"))
|
425
|
+
@filter.filter(event("message" => "last", UID_FIELD => uid, "@timestamp" => "2018-04-22T09:46:22.100+0100")) do | new_event |
|
426
|
+
insist { new_event } != nil
|
427
|
+
end
|
428
|
+
end
|
429
|
+
end
|
430
|
+
end
|
380
431
|
end
|
metadata
CHANGED
@@ -1,14 +1,14 @@
|
|
1
1
|
--- !ruby/object:Gem::Specification
|
2
2
|
name: logstash-filter-transaction_time
|
3
3
|
version: !ruby/object:Gem::Version
|
4
|
-
version: 1.0.
|
4
|
+
version: 1.0.5
|
5
5
|
platform: ruby
|
6
6
|
authors:
|
7
7
|
- Tommy Welleby
|
8
8
|
autorequire:
|
9
9
|
bindir: bin
|
10
10
|
cert_chain: []
|
11
|
-
date: 2018-
|
11
|
+
date: 2018-11-02 00:00:00.000000000 Z
|
12
12
|
dependencies:
|
13
13
|
- !ruby/object:Gem::Dependency
|
14
14
|
requirement: !ruby/object:Gem::Requirement
|