logstash-integration-kafka 10.0.0-java

Sign up to get free protection for your applications and to get access to all the features.
@@ -0,0 +1,7 @@
1
+ ---
2
+ SHA1:
3
+ metadata.gz: d8bd0d1d4de277970d19245471ee43ab8832205e
4
+ data.tar.gz: 70c3fe4143e971f663d82fada824525097e65c37
5
+ SHA512:
6
+ metadata.gz: fe2c06f811bb179846f4e606af383c2ff2dbead0684b274be12dbaaa021a0bbb5591bb7a903113f98844ed74badebe7550ace29f9fbe90b791a20c5a2265b5e4
7
+ data.tar.gz: c41661c9edaf931bee9eaa2af266f3aa8b2fe6c1a3fbcb6e9e76da6885d3ec854837a824ee2ac67e8da72b950bc238178a0c35161d1ac95947b3b87d7211e82e
@@ -0,0 +1,6 @@
1
+ ## 10.0.0
2
+ - Initial release of the Kafka Integration Plugin, which combines
3
+ previously-separate Kafka plugins and shared dependencies into a single
4
+ codebase; independent changelogs for previous versions can be found:
5
+ - [Kafka Input Plugin @9.1.0](https://github.com/logstash-plugins/logstash-input-rabbitmq/blob/v9.1.0/CHANGELOG.md)
6
+ - [Kafka Output Plugin @8.1.0](https://github.com/logstash-plugins/logstash-output-rabbitmq/blob/v8.1.0/CHANGELOG.md)
@@ -0,0 +1,18 @@
1
+ The following is a list of people who have contributed ideas, code, bug
2
+ reports, or in general have helped logstash along its way.
3
+
4
+ Contributors:
5
+ * Joseph Lawson (joekiller)
6
+ * Pere Urbón (purbon)
7
+ * Pier-Hugues Pellerin (ph)
8
+ * Richard Pijnenburg (electrical)
9
+ * Suyog Rao (suyograo)
10
+ * Tal Levy (talevy)
11
+ * João Duarte (jsvd)
12
+ * Kurt Hurtado (kurtado)
13
+ * Ry Biesemeyer (yaauie)
14
+
15
+ Note: If you've sent us patches, bug reports, or otherwise contributed to
16
+ Logstash, and you aren't on the list above and want to be, please let us know
17
+ and we'll make sure you're here. Contributions from folks like you are what make
18
+ open source awesome.
@@ -0,0 +1,97 @@
1
+ # logsstash-integration-kafka
2
+
3
+ Apache Kafka integration for Logstash, including Input and Output plugins.
4
+
5
+ # Dependencies
6
+
7
+ * Apache Kafka version 0.8.1.1
8
+ * jruby-kafka library
9
+
10
+ # Plugins
11
+
12
+
13
+ ## logstash-input-kafka
14
+
15
+ Apache Kafka input for Logstash. This input will consume messages from a Kafka topic using the high level consumer API exposed by Kafka.
16
+
17
+ For more information about Kafka, refer to this [documentation](http://kafka.apache.org/documentation.html)
18
+
19
+ Information about high level consumer API can be found [here](http://kafka.apache.org/documentation.html#highlevelconsumerapi)
20
+
21
+ ### Logstash Configuration
22
+
23
+ See http://kafka.apache.org/documentation.html#consumerconfigs for details about the Kafka consumer options.
24
+
25
+ input {
26
+ kafka {
27
+ topic_id => ... # string (optional), default: nil, The topic to consume messages from. Can be a java regular expression for whitelist of topics.
28
+ white_list => ... # string (optional), default: nil, Blacklist of topics to exclude from consumption.
29
+ black_list => ... # string (optional), default: nil, Whitelist of topics to include for consumption.
30
+ zk_connect => ... # string (optional), default: "localhost:2181", Specifies the ZooKeeper connection string in the form hostname:port
31
+ group_id => ... # string (optional), default: "logstash", A string that uniquely identifies the group of consumer processes
32
+ reset_beginning => ... # boolean (optional), default: false, Specify whether to jump to beginning of the queue when there is no initial offset in ZK
33
+ auto_offset_reset => ... # string (optional), one of [ "largest", "smallest"] default => 'largest', Where consumer should start if group does not already have an established offset or offset is invalid
34
+ consumer_threads => ... # number (optional), default: 1, Number of threads to read from the partitions
35
+ queue_size => ... # number (optional), default: 20, Internal Logstash queue size used to hold events in memory
36
+ rebalance_max_retries => ... # number (optional), default: 4
37
+ rebalance_backoff_ms => ... # number (optional), default: 2000
38
+ consumer_timeout_ms => ... # number (optional), default: -1
39
+ consumer_restart_on_error => ... # boolean (optional), default: true
40
+ consumer_restart_sleep_ms => ... # number (optional), default: 0
41
+ decorate_events => ... # boolean (optional), default: false, Option to add Kafka metadata like topic, message size to the event
42
+ consumer_id => ... # string (optional), default: nil
43
+ fetch_message_max_bytes => ... # number (optional), default: 1048576
44
+ }
45
+ }
46
+
47
+ The default codec is json
48
+
49
+ ## logstash-output-kafka
50
+
51
+ Apache Kafka output for Logstash. This output will produce messages to a Kafka topic using the producer API exposed by Kafka.
52
+
53
+ For more information about Kafka, refer to this [documentation](http://kafka.apache.org/documentation.html)
54
+
55
+ Information about producer API can be found [here](http://kafka.apache.org/documentation.html#apidesign)
56
+
57
+ ### Logstash Configuration
58
+
59
+ See http://kafka.apache.org/documentation.html#producerconfigs for details about the Kafka producer options.
60
+
61
+ output {
62
+ kafka {
63
+ topic_id => ... # string (required), The topic to produce the messages to
64
+ broker_list => ... # string (optional), default: "localhost:9092", This is for bootstrapping and the producer will only use it for getting metadata
65
+ compression_codec => ... # string (optional), one of ["none", "gzip", "snappy"], default: "none"
66
+ compressed_topics => ... # string (optional), default: "", This parameter allows you to set whether compression should be turned on for particular
67
+ request_required_acks => ... # number (optional), one of [-1, 0, 1], default: 0, This value controls when a produce request is considered completed
68
+ serializer_class => ... # string, (optional) default: "kafka.serializer.StringEncoder", The serializer class for messages. The default encoder takes a byte[] and returns the same byte[]
69
+ partitioner_class => ... # string (optional) default: "kafka.producer.DefaultPartitioner"
70
+ request_timeout_ms => ... # number (optional) default: 10000
71
+ producer_type => ... # string (optional), one of ["sync", "async"] default => 'sync'
72
+ key_serializer_class => ... # string (optional) default: kafka.serializer.StringEncoder
73
+ message_send_max_retries => ... # number (optional) default: 3
74
+ retry_backoff_ms => ... # number (optional) default: 100
75
+ topic_metadata_refresh_interval_ms => ... # number (optional) default: 600 * 1000
76
+ queue_buffering_max_ms => ... # number (optional) default: 5000
77
+ queue_buffering_max_messages => ... # number (optional) default: 10000
78
+ queue_enqueue_timeout_ms => ... # number (optional) default: -1
79
+ batch_num_messages => ... # number (optional) default: 200
80
+ send_buffer_bytes => ... # number (optional) default: 100 * 1024
81
+ client_id => ... # string (optional) default: ""
82
+ partition_key_format => ... # string (optional) default: nil, Provides a way to specify a partition key as a string
83
+ }
84
+ }
85
+
86
+ The default codec is json for outputs. If you select a codec of plain, logstash will encode your messages with not only the message
87
+ but also with a timestamp and hostname. If you do not want anything but your message passing through, you should make
88
+ the output configuration something like:
89
+
90
+ output {
91
+ kafka {
92
+ codec => plain {
93
+ format => "%{message}"
94
+ }
95
+ topic_id => "my_topic_id"
96
+ }
97
+ }
data/Gemfile ADDED
@@ -0,0 +1,11 @@
1
+ source 'https://rubygems.org'
2
+
3
+ gemspec
4
+
5
+ logstash_path = ENV["LOGSTASH_PATH"] || "../../logstash"
6
+ use_logstash_source = ENV["LOGSTASH_SOURCE"] && ENV["LOGSTASH_SOURCE"].to_s == "1"
7
+
8
+ if Dir.exist?(logstash_path) && use_logstash_source
9
+ gem 'logstash-core', :path => "#{logstash_path}/logstash-core"
10
+ gem 'logstash-core-plugin-api', :path => "#{logstash_path}/logstash-core-plugin-api"
11
+ end
data/LICENSE ADDED
@@ -0,0 +1,13 @@
1
+ Copyright (c) 2012-2018 Elasticsearch <http://www.elastic.co>
2
+
3
+ Licensed under the Apache License, Version 2.0 (the "License");
4
+ you may not use this file except in compliance with the License.
5
+ You may obtain a copy of the License at
6
+
7
+ http://www.apache.org/licenses/LICENSE-2.0
8
+
9
+ Unless required by applicable law or agreed to in writing, software
10
+ distributed under the License is distributed on an "AS IS" BASIS,
11
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
12
+ See the License for the specific language governing permissions and
13
+ limitations under the License.
@@ -0,0 +1,5 @@
1
+ Elasticsearch
2
+ Copyright 2012-2019 Elastic NV
3
+
4
+ This product includes software developed by The Apache Software
5
+ Foundation (http://www.apache.org/).
@@ -0,0 +1,118 @@
1
+ # Logstash Plugin
2
+
3
+ [![Travis Build Status](https://travis-ci.org/logstash-plugins/logstash-integration-kafka.svg)](https://travis-ci.org/logstash-plugins/logstash-integration-kafka)
4
+
5
+ This is a plugin for [Logstash](https://github.com/elastic/logstash).
6
+
7
+ It is fully free and fully open source. The license is Apache 2.0, meaning you are pretty much free to use it however you want in whatever way.
8
+
9
+ ## Logging
10
+
11
+ Kafka logs do not respect the Log4J2 root logger level and defaults to INFO, for other levels, you must explicitly set the log level in your Logstash deployment's `log4j2.properties` file, e.g.:
12
+ ```
13
+ logger.kafka.name=org.apache.kafka
14
+ logger.kafka.appenderRef.console.ref=console
15
+ logger.kafka.level=debug
16
+ ```
17
+
18
+ ## Documentation
19
+
20
+ https://www.elastic.co/guide/en/logstash/current/plugins-inputs-kafka.html
21
+
22
+ Logstash provides infrastructure to automatically generate documentation for this plugin. We use the asciidoc format to write documentation so any comments in the source code will be first converted into asciidoc and then into html. All plugin documentation are placed under one [central location](http://www.elastic.co/guide/en/logstash/current/).
23
+
24
+ - For formatting code or config example, you can use the asciidoc `[source,ruby]` directive
25
+ - For more asciidoc formatting tips, see the excellent reference here https://github.com/elastic/docs#asciidoc-guide
26
+
27
+ ## Need Help?
28
+
29
+ Need help? Try #logstash on freenode IRC or the https://discuss.elastic.co/c/logstash discussion forum.
30
+
31
+ ## Developing
32
+
33
+ ### 1. Plugin Developement and Testing
34
+
35
+ #### Code
36
+ - To get started, you'll need JRuby with the Bundler gem installed.
37
+
38
+ - Create a new plugin or clone and existing from the GitHub [logstash-plugins](https://github.com/logstash-plugins) organization. We also provide [example plugins](https://github.com/logstash-plugins?query=example).
39
+
40
+ - Install dependencies
41
+
42
+ ```sh
43
+ bundle install
44
+ rake install_jars
45
+ ```
46
+
47
+ #### Test
48
+
49
+ - Update your dependencies
50
+
51
+ ```sh
52
+ bundle install
53
+ rake install_jars
54
+ ```
55
+
56
+ - Run unit tests
57
+
58
+ ```sh
59
+ bundle exec rspec
60
+ ```
61
+
62
+ - Run integration tests
63
+
64
+ you'll need to have docker available within your test environment before
65
+ running the integration tests. The tests depend on a specific Kafka image
66
+ found in Docker Hub called `spotify/kafka`. You will need internet connectivity
67
+ to pull in this image if it does not already exist locally.
68
+
69
+ ```sh
70
+ bundle exec rspec --tag integration
71
+ ```
72
+
73
+ ### 2. Running your unpublished Plugin in Logstash
74
+
75
+ #### 2.1 Run in a local Logstash clone
76
+
77
+ - Edit Logstash `Gemfile` and add the local plugin path, for example:
78
+ ```ruby
79
+ gem "logstash-output-kafka", :path => "/your/local/logstash-output-kafka"
80
+ ```
81
+ - Install plugin
82
+ ```sh
83
+ # Logstash 2.3 and higher
84
+ bin/logstash-plugin install --no-verify
85
+
86
+ # Prior to Logstash 2.3
87
+ bin/plugin install --no-verify
88
+
89
+ ```
90
+ - Run Logstash with your plugin
91
+ ```sh
92
+ bin/logstash -e 'output { kafka { topic_id => "kafka_topic" }}'
93
+ ```
94
+ At this point any modifications to the plugin code will be applied to this local Logstash setup. After modifying the plugin, simply rerun Logstash.
95
+
96
+ #### 2.2 Run in an installed Logstash
97
+
98
+ You can use the same **2.1** method to run your plugin in an installed Logstash by editing its `Gemfile` and pointing the `:path` to your local plugin development directory or you can build the gem and install it using:
99
+
100
+ - Build your plugin gem
101
+ ```sh
102
+ gem build logstash-output-kafka.gemspec
103
+ ```
104
+ - Install the plugin from the Logstash home
105
+ ```sh
106
+ bin/plugin install /your/local/plugin/logstash-output-kafka.gem
107
+ ```
108
+ - Start Logstash and proceed to test the plugin
109
+
110
+ ## Contributing
111
+
112
+ All contributions are welcome: ideas, patches, documentation, bug reports, complaints, and even something you drew up on a napkin.
113
+
114
+ Programming is not a required skill. Whatever you've seen about open source and maintainers or community members saying "send patches or die" - you will not see that here.
115
+
116
+ It is more important to the community that you are able to contribute.
117
+
118
+ For more information about contributing, see the [CONTRIBUTING](https://github.com/elastic/logstash/blob/master/CONTRIBUTING.md) file.
@@ -0,0 +1,28 @@
1
+ :plugin: kafka
2
+ :type: integration
3
+ :no_codec:
4
+
5
+ ///////////////////////////////////////////
6
+ START - GENERATED VARIABLES, DO NOT EDIT!
7
+ ///////////////////////////////////////////
8
+ :version: %VERSION%
9
+ :release_date: %RELEASE_DATE%
10
+ :changelog_url: %CHANGELOG_URL%
11
+ :include_path: ../../../../logstash/docs/include
12
+ ///////////////////////////////////////////
13
+ END - GENERATED VARIABLES, DO NOT EDIT!
14
+ ///////////////////////////////////////////
15
+
16
+ [id="plugins-{type}s-{plugin}"]
17
+
18
+ === Kafka Integration Plugin
19
+
20
+ include::{include_path}/plugin_header.asciidoc[]
21
+
22
+ ==== Description
23
+
24
+ The Kafka Integration Plugin provides integrated plugins for working with the https://kafka.apache.org/[Kafka] distributed streaming platform.
25
+
26
+ This plugin uses Kafka Client 2.1.0. For broker compatibility, see the official https://cwiki.apache.org/confluence/display/KAFKA/Compatibility+Matrix[Kafka compatibility reference]. If the linked compatibility wiki is not up-to-date, please contact Kafka support/community to confirm compatibility.
27
+
28
+ :no_codec!:
@@ -0,0 +1,582 @@
1
+ :plugin: kafka
2
+ :type: input
3
+ :default_codec: plain
4
+
5
+ ///////////////////////////////////////////
6
+ START - GENERATED VARIABLES, DO NOT EDIT!
7
+ ///////////////////////////////////////////
8
+ :version: %VERSION%
9
+ :release_date: %RELEASE_DATE%
10
+ :changelog_url: %CHANGELOG_URL%
11
+ :include_path: ../../../../logstash/docs/include
12
+ ///////////////////////////////////////////
13
+ END - GENERATED VARIABLES, DO NOT EDIT!
14
+ ///////////////////////////////////////////
15
+
16
+ [id="plugins-{type}s-{plugin}"]
17
+
18
+ === Kafka input plugin
19
+
20
+ include::{include_path}/plugin_header.asciidoc[]
21
+
22
+ ==== Description
23
+
24
+ This input will read events from a Kafka topic.
25
+
26
+ This plugin uses Kafka Client 2.1.0. For broker compatibility, see the official https://cwiki.apache.org/confluence/display/KAFKA/Compatibility+Matrix[Kafka compatibility reference]. If the linked compatibility wiki is not up-to-date, please contact Kafka support/community to confirm compatibility.
27
+
28
+ If you require features not yet available in this plugin (including client version upgrades), please file an issue with details about what you need.
29
+
30
+ This input supports connecting to Kafka over:
31
+
32
+ * SSL (requires plugin version 3.0.0 or later)
33
+ * Kerberos SASL (requires plugin version 5.1.0 or later)
34
+
35
+ By default security is disabled but can be turned on as needed.
36
+
37
+ The Logstash Kafka consumer handles group management and uses the default offset management
38
+ strategy using Kafka topics.
39
+
40
+ Logstash instances by default form a single logical group to subscribe to Kafka topics
41
+ Each Logstash Kafka consumer can run multiple threads to increase read throughput. Alternatively,
42
+ you could run multiple Logstash instances with the same `group_id` to spread the load across
43
+ physical machines. Messages in a topic will be distributed to all Logstash instances with
44
+ the same `group_id`.
45
+
46
+ Ideally you should have as many threads as the number of partitions for a perfect balance --
47
+ more threads than partitions means that some threads will be idle
48
+
49
+ For more information see http://kafka.apache.org/documentation.html#theconsumer
50
+
51
+ Kafka consumer configuration: http://kafka.apache.org/documentation.html#consumerconfigs
52
+
53
+ ==== Metadata fields
54
+
55
+ The following metadata from Kafka broker are added under the `[@metadata]` field:
56
+
57
+ * `[@metadata][kafka][topic]`: Original Kafka topic from where the message was consumed.
58
+ * `[@metadata][kafka][consumer_group]`: Consumer group
59
+ * `[@metadata][kafka][partition]`: Partition info for this message.
60
+ * `[@metadata][kafka][offset]`: Original record offset for this message.
61
+ * `[@metadata][kafka][key]`: Record key, if any.
62
+ * `[@metadata][kafka][timestamp]`: Timestamp in the Record. Depending on your broker configuration, this can be either when the record was created (default) or when it was received by the broker. See more about property log.message.timestamp.type at https://kafka.apache.org/10/documentation.html#brokerconfigs
63
+
64
+ Metadata is only added to the event if the `decorate_events` option is set to true (it defaults to false).
65
+
66
+ Please note that `@metadata` fields are not part of any of your events at output time. If you need these information to be
67
+ inserted into your original event, you'll have to use the `mutate` filter to manually copy the required fields into your `event`.
68
+
69
+ [id="plugins-{type}s-{plugin}-options"]
70
+ ==== Kafka Input Configuration Options
71
+
72
+ This plugin supports these configuration options plus the <<plugins-{type}s-{plugin}-common-options>> described later.
73
+
74
+ NOTE: Some of these options map to a Kafka option. See the
75
+ https://kafka.apache.org/documentation for more details.
76
+
77
+ [cols="<,<,<",options="header",]
78
+ |=======================================================================
79
+ |Setting |Input type|Required
80
+ | <<plugins-{type}s-{plugin}-auto_commit_interval_ms>> |<<string,string>>|No
81
+ | <<plugins-{type}s-{plugin}-auto_offset_reset>> |<<string,string>>|No
82
+ | <<plugins-{type}s-{plugin}-bootstrap_servers>> |<<string,string>>|No
83
+ | <<plugins-{type}s-{plugin}-check_crcs>> |<<string,string>>|No
84
+ | <<plugins-{type}s-{plugin}-client_id>> |<<string,string>>|No
85
+ | <<plugins-{type}s-{plugin}-connections_max_idle_ms>> |<<string,string>>|No
86
+ | <<plugins-{type}s-{plugin}-consumer_threads>> |<<number,number>>|No
87
+ | <<plugins-{type}s-{plugin}-decorate_events>> |<<boolean,boolean>>|No
88
+ | <<plugins-{type}s-{plugin}-enable_auto_commit>> |<<string,string>>|No
89
+ | <<plugins-{type}s-{plugin}-exclude_internal_topics>> |<<string,string>>|No
90
+ | <<plugins-{type}s-{plugin}-fetch_max_bytes>> |<<string,string>>|No
91
+ | <<plugins-{type}s-{plugin}-fetch_max_wait_ms>> |<<string,string>>|No
92
+ | <<plugins-{type}s-{plugin}-fetch_min_bytes>> |<<string,string>>|No
93
+ | <<plugins-{type}s-{plugin}-group_id>> |<<string,string>>|No
94
+ | <<plugins-{type}s-{plugin}-heartbeat_interval_ms>> |<<string,string>>|No
95
+ | <<plugins-{type}s-{plugin}-jaas_path>> |a valid filesystem path|No
96
+ | <<plugins-{type}s-{plugin}-kerberos_config>> |a valid filesystem path|No
97
+ | <<plugins-{type}s-{plugin}-key_deserializer_class>> |<<string,string>>|No
98
+ | <<plugins-{type}s-{plugin}-max_partition_fetch_bytes>> |<<string,string>>|No
99
+ | <<plugins-{type}s-{plugin}-max_poll_interval_ms>> |<<string,string>>|No
100
+ | <<plugins-{type}s-{plugin}-max_poll_records>> |<<string,string>>|No
101
+ | <<plugins-{type}s-{plugin}-metadata_max_age_ms>> |<<string,string>>|No
102
+ | <<plugins-{type}s-{plugin}-partition_assignment_strategy>> |<<string,string>>|No
103
+ | <<plugins-{type}s-{plugin}-poll_timeout_ms>> |<<number,number>>|No
104
+ | <<plugins-{type}s-{plugin}-receive_buffer_bytes>> |<<string,string>>|No
105
+ | <<plugins-{type}s-{plugin}-reconnect_backoff_ms>> |<<string,string>>|No
106
+ | <<plugins-{type}s-{plugin}-request_timeout_ms>> |<<string,string>>|No
107
+ | <<plugins-{type}s-{plugin}-retry_backoff_ms>> |<<string,string>>|No
108
+ | <<plugins-{type}s-{plugin}-sasl_jaas_config>> |<<string,string>>|No
109
+ | <<plugins-{type}s-{plugin}-sasl_kerberos_service_name>> |<<string,string>>|No
110
+ | <<plugins-{type}s-{plugin}-sasl_mechanism>> |<<string,string>>|No
111
+ | <<plugins-{type}s-{plugin}-security_protocol>> |<<string,string>>, one of `["PLAINTEXT", "SSL", "SASL_PLAINTEXT", "SASL_SSL"]`|No
112
+ | <<plugins-{type}s-{plugin}-send_buffer_bytes>> |<<string,string>>|No
113
+ | <<plugins-{type}s-{plugin}-session_timeout_ms>> |<<string,string>>|No
114
+ | <<plugins-{type}s-{plugin}-ssl_endpoint_identification_algorithm>> |<<string,string>>|No
115
+ | <<plugins-{type}s-{plugin}-ssl_key_password>> |<<password,password>>|No
116
+ | <<plugins-{type}s-{plugin}-ssl_keystore_location>> |a valid filesystem path|No
117
+ | <<plugins-{type}s-{plugin}-ssl_keystore_password>> |<<password,password>>|No
118
+ | <<plugins-{type}s-{plugin}-ssl_keystore_type>> |<<string,string>>|No
119
+ | <<plugins-{type}s-{plugin}-ssl_truststore_location>> |a valid filesystem path|No
120
+ | <<plugins-{type}s-{plugin}-ssl_truststore_password>> |<<password,password>>|No
121
+ | <<plugins-{type}s-{plugin}-ssl_truststore_type>> |<<string,string>>|No
122
+ | <<plugins-{type}s-{plugin}-topics>> |<<array,array>>|No
123
+ | <<plugins-{type}s-{plugin}-topics_pattern>> |<<string,string>>|No
124
+ | <<plugins-{type}s-{plugin}-value_deserializer_class>> |<<string,string>>|No
125
+ |=======================================================================
126
+
127
+ Also see <<plugins-{type}s-{plugin}-common-options>> for a list of options supported by all
128
+ input plugins.
129
+
130
+ &nbsp;
131
+
132
+ [id="plugins-{type}s-{plugin}-auto_commit_interval_ms"]
133
+ ===== `auto_commit_interval_ms`
134
+
135
+ * Value type is <<string,string>>
136
+ * Default value is `"5000"`
137
+
138
+ The frequency in milliseconds that the consumer offsets are committed to Kafka.
139
+
140
+ [id="plugins-{type}s-{plugin}-auto_offset_reset"]
141
+ ===== `auto_offset_reset`
142
+
143
+ * Value type is <<string,string>>
144
+ * There is no default value for this setting.
145
+
146
+ What to do when there is no initial offset in Kafka or if an offset is out of range:
147
+
148
+ * earliest: automatically reset the offset to the earliest offset
149
+ * latest: automatically reset the offset to the latest offset
150
+ * none: throw exception to the consumer if no previous offset is found for the consumer's group
151
+ * anything else: throw exception to the consumer.
152
+
153
+ [id="plugins-{type}s-{plugin}-bootstrap_servers"]
154
+ ===== `bootstrap_servers`
155
+
156
+ * Value type is <<string,string>>
157
+ * Default value is `"localhost:9092"`
158
+
159
+ A list of URLs of Kafka instances to use for establishing the initial connection to the cluster.
160
+ This list should be in the form of `host1:port1,host2:port2` These urls are just used
161
+ for the initial connection to discover the full cluster membership (which may change dynamically)
162
+ so this list need not contain the full set of servers (you may want more than one, though, in
163
+ case a server is down).
164
+
165
+ [id="plugins-{type}s-{plugin}-check_crcs"]
166
+ ===== `check_crcs`
167
+
168
+ * Value type is <<string,string>>
169
+ * There is no default value for this setting.
170
+
171
+ Automatically check the CRC32 of the records consumed. This ensures no on-the-wire or on-disk
172
+ corruption to the messages occurred. This check adds some overhead, so it may be
173
+ disabled in cases seeking extreme performance.
174
+
175
+ [id="plugins-{type}s-{plugin}-client_id"]
176
+ ===== `client_id`
177
+
178
+ * Value type is <<string,string>>
179
+ * Default value is `"logstash"`
180
+
181
+ The id string to pass to the server when making requests. The purpose of this
182
+ is to be able to track the source of requests beyond just ip/port by allowing
183
+ a logical application name to be included.
184
+
185
+ [id="plugins-{type}s-{plugin}-connections_max_idle_ms"]
186
+ ===== `connections_max_idle_ms`
187
+
188
+ * Value type is <<string,string>>
189
+ * There is no default value for this setting.
190
+
191
+ Close idle connections after the number of milliseconds specified by this config.
192
+
193
+ [id="plugins-{type}s-{plugin}-consumer_threads"]
194
+ ===== `consumer_threads`
195
+
196
+ * Value type is <<number,number>>
197
+ * Default value is `1`
198
+
199
+ Ideally you should have as many threads as the number of partitions for a perfect
200
+ balance — more threads than partitions means that some threads will be idle
201
+
202
+ [id="plugins-{type}s-{plugin}-decorate_events"]
203
+ ===== `decorate_events`
204
+
205
+ * Value type is <<boolean,boolean>>
206
+ * Default value is `false`
207
+
208
+ Option to add Kafka metadata like topic, message size to the event.
209
+ This will add a field named `kafka` to the logstash event containing the following attributes:
210
+
211
+ * `topic`: The topic this message is associated with
212
+ * `consumer_group`: The consumer group used to read in this event
213
+ * `partition`: The partition this message is associated with
214
+ * `offset`: The offset from the partition this message is associated with
215
+ * `key`: A ByteBuffer containing the message key
216
+
217
+ [id="plugins-{type}s-{plugin}-enable_auto_commit"]
218
+ ===== `enable_auto_commit`
219
+
220
+ * Value type is <<string,string>>
221
+ * Default value is `"true"`
222
+
223
+ If true, periodically commit to Kafka the offsets of messages already returned by the consumer.
224
+ This committed offset will be used when the process fails as the position from
225
+ which the consumption will begin.
226
+
227
+ [id="plugins-{type}s-{plugin}-exclude_internal_topics"]
228
+ ===== `exclude_internal_topics`
229
+
230
+ * Value type is <<string,string>>
231
+ * There is no default value for this setting.
232
+
233
+ Whether records from internal topics (such as offsets) should be exposed to the consumer.
234
+ If set to true the only way to receive records from an internal topic is subscribing to it.
235
+
236
+ [id="plugins-{type}s-{plugin}-fetch_max_bytes"]
237
+ ===== `fetch_max_bytes`
238
+
239
+ * Value type is <<string,string>>
240
+ * There is no default value for this setting.
241
+
242
+ The maximum amount of data the server should return for a fetch request. This is not an
243
+ absolute maximum, if the first message in the first non-empty partition of the fetch is larger
244
+ than this value, the message will still be returned to ensure that the consumer can make progress.
245
+
246
+ [id="plugins-{type}s-{plugin}-fetch_max_wait_ms"]
247
+ ===== `fetch_max_wait_ms`
248
+
249
+ * Value type is <<string,string>>
250
+ * There is no default value for this setting.
251
+
252
+ The maximum amount of time the server will block before answering the fetch request if
253
+ there isn't sufficient data to immediately satisfy `fetch_min_bytes`. This
254
+ should be less than or equal to the timeout used in `poll_timeout_ms`
255
+
256
+ [id="plugins-{type}s-{plugin}-fetch_min_bytes"]
257
+ ===== `fetch_min_bytes`
258
+
259
+ * Value type is <<string,string>>
260
+ * There is no default value for this setting.
261
+
262
+ The minimum amount of data the server should return for a fetch request. If insufficient
263
+ data is available the request will wait for that much data to accumulate
264
+ before answering the request.
265
+
266
+ [id="plugins-{type}s-{plugin}-group_id"]
267
+ ===== `group_id`
268
+
269
+ * Value type is <<string,string>>
270
+ * Default value is `"logstash"`
271
+
272
+ The identifier of the group this consumer belongs to. Consumer group is a single logical subscriber
273
+ that happens to be made up of multiple processors. Messages in a topic will be distributed to all
274
+ Logstash instances with the same `group_id`
275
+
276
+ [id="plugins-{type}s-{plugin}-heartbeat_interval_ms"]
277
+ ===== `heartbeat_interval_ms`
278
+
279
+ * Value type is <<string,string>>
280
+ * There is no default value for this setting.
281
+
282
+ The expected time between heartbeats to the consumer coordinator. Heartbeats are used to ensure
283
+ that the consumer's session stays active and to facilitate rebalancing when new
284
+ consumers join or leave the group. The value must be set lower than
285
+ `session.timeout.ms`, but typically should be set no higher than 1/3 of that value.
286
+ It can be adjusted even lower to control the expected time for normal rebalances.
287
+
288
+ [id="plugins-{type}s-{plugin}-jaas_path"]
289
+ ===== `jaas_path`
290
+
291
+ * Value type is <<path,path>>
292
+ * There is no default value for this setting.
293
+
294
+ The Java Authentication and Authorization Service (JAAS) API supplies user authentication and authorization
295
+ services for Kafka. This setting provides the path to the JAAS file. Sample JAAS file for Kafka client:
296
+ [source,java]
297
+ ----------------------------------
298
+ KafkaClient {
299
+ com.sun.security.auth.module.Krb5LoginModule required
300
+ useTicketCache=true
301
+ renewTicket=true
302
+ serviceName="kafka";
303
+ };
304
+ ----------------------------------
305
+
306
+ Please note that specifying `jaas_path` and `kerberos_config` in the config file will add these
307
+ to the global JVM system properties. This means if you have multiple Kafka inputs, all of them would be sharing the same
308
+ `jaas_path` and `kerberos_config`. If this is not desirable, you would have to run separate instances of Logstash on
309
+ different JVM instances.
310
+
311
+ [id="plugins-{type}s-{plugin}-kerberos_config"]
312
+ ===== `kerberos_config`
313
+
314
+ * Value type is <<path,path>>
315
+ * There is no default value for this setting.
316
+
317
+ Optional path to kerberos config file. This is krb5.conf style as detailed in https://web.mit.edu/kerberos/krb5-1.12/doc/admin/conf_files/krb5_conf.html
318
+
319
+ [id="plugins-{type}s-{plugin}-key_deserializer_class"]
320
+ ===== `key_deserializer_class`
321
+
322
+ * Value type is <<string,string>>
323
+ * Default value is `"org.apache.kafka.common.serialization.StringDeserializer"`
324
+
325
+ Java Class used to deserialize the record's key
326
+
327
+ [id="plugins-{type}s-{plugin}-max_partition_fetch_bytes"]
328
+ ===== `max_partition_fetch_bytes`
329
+
330
+ * Value type is <<string,string>>
331
+ * There is no default value for this setting.
332
+
333
+ The maximum amount of data per-partition the server will return. The maximum total memory used for a
334
+ request will be `#partitions * max.partition.fetch.bytes`. This size must be at least
335
+ as large as the maximum message size the server allows or else it is possible for the producer to
336
+ send messages larger than the consumer can fetch. If that happens, the consumer can get stuck trying
337
+ to fetch a large message on a certain partition.
338
+
339
+ [id="plugins-{type}s-{plugin}-max_poll_interval_ms"]
340
+ ===== `max_poll_interval_ms`
341
+
342
+ * Value type is <<string,string>>
343
+ * There is no default value for this setting.
344
+
345
+ The maximum delay between invocations of poll() when using consumer group management. This places
346
+ an upper bound on the amount of time that the consumer can be idle before fetching more records.
347
+ If poll() is not called before expiration of this timeout, then the consumer is considered failed and
348
+ the group will rebalance in order to reassign the partitions to another member.
349
+ The value of the configuration `request_timeout_ms` must always be larger than max_poll_interval_ms
350
+
351
+ [id="plugins-{type}s-{plugin}-max_poll_records"]
352
+ ===== `max_poll_records`
353
+
354
+ * Value type is <<string,string>>
355
+ * There is no default value for this setting.
356
+
357
+ The maximum number of records returned in a single call to poll().
358
+
359
+ [id="plugins-{type}s-{plugin}-metadata_max_age_ms"]
360
+ ===== `metadata_max_age_ms`
361
+
362
+ * Value type is <<string,string>>
363
+ * There is no default value for this setting.
364
+
365
+ The period of time in milliseconds after which we force a refresh of metadata even if
366
+ we haven't seen any partition leadership changes to proactively discover any new brokers or partitions
367
+
368
+ [id="plugins-{type}s-{plugin}-partition_assignment_strategy"]
369
+ ===== `partition_assignment_strategy`
370
+
371
+ * Value type is <<string,string>>
372
+ * There is no default value for this setting.
373
+
374
+ The class name of the partition assignment strategy that the client uses to
375
+ distribute partition ownership amongst consumer instances. Maps to
376
+ the Kafka `partition.assignment.strategy` setting, which defaults to
377
+ `org.apache.kafka.clients.consumer.RangeAssignor`.
378
+
379
+ [id="plugins-{type}s-{plugin}-poll_timeout_ms"]
380
+ ===== `poll_timeout_ms`
381
+
382
+ * Value type is <<number,number>>
383
+ * Default value is `100`
384
+
385
+ Time kafka consumer will wait to receive new messages from topics
386
+
387
+ [id="plugins-{type}s-{plugin}-receive_buffer_bytes"]
388
+ ===== `receive_buffer_bytes`
389
+
390
+ * Value type is <<string,string>>
391
+ * There is no default value for this setting.
392
+
393
+ The size of the TCP receive buffer (SO_RCVBUF) to use when reading data.
394
+
395
+ [id="plugins-{type}s-{plugin}-reconnect_backoff_ms"]
396
+ ===== `reconnect_backoff_ms`
397
+
398
+ * Value type is <<string,string>>
399
+ * There is no default value for this setting.
400
+
401
+ The amount of time to wait before attempting to reconnect to a given host.
402
+ This avoids repeatedly connecting to a host in a tight loop.
403
+ This backoff applies to all requests sent by the consumer to the broker.
404
+
405
+ [id="plugins-{type}s-{plugin}-request_timeout_ms"]
406
+ ===== `request_timeout_ms`
407
+
408
+ * Value type is <<string,string>>
409
+ * There is no default value for this setting.
410
+
411
+ The configuration controls the maximum amount of time the client will wait
412
+ for the response of a request. If the response is not received before the timeout
413
+ elapses the client will resend the request if necessary or fail the request if
414
+ retries are exhausted.
415
+
416
+ [id="plugins-{type}s-{plugin}-retry_backoff_ms"]
417
+ ===== `retry_backoff_ms`
418
+
419
+ * Value type is <<string,string>>
420
+ * There is no default value for this setting.
421
+
422
+ The amount of time to wait before attempting to retry a failed fetch request
423
+ to a given topic partition. This avoids repeated fetching-and-failing in a tight loop.
424
+
425
+ [id="plugins-{type}s-{plugin}-sasl_jaas_config"]
426
+ ===== `sasl_jaas_config`
427
+
428
+ * Value type is <<string,string>>
429
+ * There is no default value for this setting.
430
+
431
+ JAAS configuration setting local to this plugin instance, as opposed to settings using config file configured using `jaas_path`, which are shared across the JVM. This allows each plugin instance to have its own configuration.
432
+
433
+ If both `sasl_jaas_config` and `jaas_path` configurations are set, the setting here takes precedence.
434
+
435
+ Example (setting for Azure Event Hub):
436
+ [source,ruby]
437
+ input {
438
+ kafka {
439
+ sasl_jaas_config => "org.apache.kafka.common.security.plain.PlainLoginModule required username='auser' password='apassword';"
440
+ }
441
+ }
442
+
443
+ [id="plugins-{type}s-{plugin}-sasl_kerberos_service_name"]
444
+ ===== `sasl_kerberos_service_name`
445
+
446
+ * Value type is <<string,string>>
447
+ * There is no default value for this setting.
448
+
449
+ The Kerberos principal name that Kafka broker runs as.
450
+ This can be defined either in Kafka's JAAS config or in Kafka's config.
451
+
452
+ [id="plugins-{type}s-{plugin}-sasl_mechanism"]
453
+ ===== `sasl_mechanism`
454
+
455
+ * Value type is <<string,string>>
456
+ * Default value is `"GSSAPI"`
457
+
458
+ http://kafka.apache.org/documentation.html#security_sasl[SASL mechanism] used for client connections.
459
+ This may be any mechanism for which a security provider is available.
460
+ GSSAPI is the default mechanism.
461
+
462
+ [id="plugins-{type}s-{plugin}-security_protocol"]
463
+ ===== `security_protocol`
464
+
465
+ * Value can be any of: `PLAINTEXT`, `SSL`, `SASL_PLAINTEXT`, `SASL_SSL`
466
+ * Default value is `"PLAINTEXT"`
467
+
468
+ Security protocol to use, which can be either of PLAINTEXT,SSL,SASL_PLAINTEXT,SASL_SSL
469
+
470
+ [id="plugins-{type}s-{plugin}-send_buffer_bytes"]
471
+ ===== `send_buffer_bytes`
472
+
473
+ * Value type is <<string,string>>
474
+ * There is no default value for this setting.
475
+
476
+ The size of the TCP send buffer (SO_SNDBUF) to use when sending data
477
+
478
+ [id="plugins-{type}s-{plugin}-session_timeout_ms"]
479
+ ===== `session_timeout_ms`
480
+
481
+ * Value type is <<string,string>>
482
+ * There is no default value for this setting.
483
+
484
+ The timeout after which, if the `poll_timeout_ms` is not invoked, the consumer is marked dead
485
+ and a rebalance operation is triggered for the group identified by `group_id`
486
+
487
+ [id="plugins-{type}s-{plugin}-ssl_endpoint_identification_algorithm"]
488
+ ===== `ssl_endpoint_identification_algorithm`
489
+
490
+ * Value type is <<string,string>>
491
+ * Default value is `"https"`
492
+
493
+ The endpoint identification algorithm, defaults to `"https"`. Set to empty string `""` to disable endpoint verification
494
+
495
+
496
+ [id="plugins-{type}s-{plugin}-ssl_key_password"]
497
+ ===== `ssl_key_password`
498
+
499
+ * Value type is <<password,password>>
500
+ * There is no default value for this setting.
501
+
502
+ The password of the private key in the key store file.
503
+
504
+ [id="plugins-{type}s-{plugin}-ssl_keystore_location"]
505
+ ===== `ssl_keystore_location`
506
+
507
+ * Value type is <<path,path>>
508
+ * There is no default value for this setting.
509
+
510
+ If client authentication is required, this setting stores the keystore path.
511
+
512
+ [id="plugins-{type}s-{plugin}-ssl_keystore_password"]
513
+ ===== `ssl_keystore_password`
514
+
515
+ * Value type is <<password,password>>
516
+ * There is no default value for this setting.
517
+
518
+ If client authentication is required, this setting stores the keystore password
519
+
520
+ [id="plugins-{type}s-{plugin}-ssl_keystore_type"]
521
+ ===== `ssl_keystore_type`
522
+
523
+ * Value type is <<string,string>>
524
+ * There is no default value for this setting.
525
+
526
+ The keystore type.
527
+
528
+ [id="plugins-{type}s-{plugin}-ssl_truststore_location"]
529
+ ===== `ssl_truststore_location`
530
+
531
+ * Value type is <<path,path>>
532
+ * There is no default value for this setting.
533
+
534
+ The JKS truststore path to validate the Kafka broker's certificate.
535
+
536
+ [id="plugins-{type}s-{plugin}-ssl_truststore_password"]
537
+ ===== `ssl_truststore_password`
538
+
539
+ * Value type is <<password,password>>
540
+ * There is no default value for this setting.
541
+
542
+ The truststore password
543
+
544
+ [id="plugins-{type}s-{plugin}-ssl_truststore_type"]
545
+ ===== `ssl_truststore_type`
546
+
547
+ * Value type is <<string,string>>
548
+ * There is no default value for this setting.
549
+
550
+ The truststore type.
551
+
552
+ [id="plugins-{type}s-{plugin}-topics"]
553
+ ===== `topics`
554
+
555
+ * Value type is <<array,array>>
556
+ * Default value is `["logstash"]`
557
+
558
+ A list of topics to subscribe to, defaults to ["logstash"].
559
+
560
+ [id="plugins-{type}s-{plugin}-topics_pattern"]
561
+ ===== `topics_pattern`
562
+
563
+ * Value type is <<string,string>>
564
+ * There is no default value for this setting.
565
+
566
+ A topic regex pattern to subscribe to.
567
+ The topics configuration will be ignored when using this configuration.
568
+
569
+ [id="plugins-{type}s-{plugin}-value_deserializer_class"]
570
+ ===== `value_deserializer_class`
571
+
572
+ * Value type is <<string,string>>
573
+ * Default value is `"org.apache.kafka.common.serialization.StringDeserializer"`
574
+
575
+ Java Class used to deserialize the record's value
576
+
577
+
578
+
579
+ [id="plugins-{type}s-{plugin}-common-options"]
580
+ include::{include_path}/{type}.asciidoc[]
581
+
582
+ :default_codec!: