-
Notifications
You must be signed in to change notification settings - Fork 16
/
Copy pathrabbitmqent.html.md.erb
316 lines (241 loc) · 19.1 KB
/
rabbitmqent.html.md.erb
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
---
title: RabbitMQ
owner: Services
---
## <a id='integrating-your-service'></a> Integrating the Service With Your App
After the [creation](../devguide/services/managing-services.html#create) of the service and the [binding](../devguide/services/application-binding.html#bind) of the service to the application, the environment variable [VCAP_SERVICES](../devguide/deploy-apps/environment-variable.html#VCAP-SERVICES) is created. Information about the credentials are stored in this variable as shown here:
```json
{
"rabbitmqent": [
{
"credentials": {
"hostname": "rabbitmq.service.consul",
"hostnames": [
"rabbitmq-node-0.service.consul",
"rabbitmq-node-1.service.consul",
"rabbitmq-node-2.service.consul"
],
"http_api_uri": "http://i0zRgICRqzDB5fll:[email protected]:15672/api",
"http_api_uris": [
"http://i0zRgICRqzDB5fll:[email protected]:15672/api",
"http://i0zRgICRqzDB5fll:[email protected]:15672/api",
"http://i0zRgICRqzDB5fll:[email protected]:15672/api"
],
"password": "rvUgjRjnMa6mejiW",
"protocols": {
"amqp": {
"host": "rabbitmq.service.consul",
"hosts": [
"rabbitmq-node-0.service.consul",
"rabbitmq-node-1.service.consul",
"rabbitmq-node-2.service.consul"
],
"password": "rvUgjRjnMa6mejiW",
"port": 5672,
"ssl": false,
"uri": "amqp://i0zRgICRqzDB5fll:[email protected]/bd0c989a-8a16-4641-8bcc-70a99c7c7264",
"uris": [
"amqp://i0zRgICRqzDB5fll:[email protected]/bd0c989a-8a16-4641-8bcc-70a99c7c7264",
"amqp://i0zRgICRqzDB5fll:[email protected]/bd0c989a-8a16-4641-8bcc-70a99c7c7264",
"amqp://i0zRgICRqzDB5fll:[email protected]/bd0c989a-8a16-4641-8bcc-70a99c7c7264"
],
"username": "i0zRgICRqzDB5fll",
"vhost": "bd0c989a-8a16-4641-8bcc-70a99c7c7264"
},
"management": {
"host": "rabbitmq.service.consul",
"hosts": [
"rabbitmq-node-0.service.consul",
"rabbitmq-node-1.service.consul",
"rabbitmq-node-2.service.consul"
],
"password": "rvUgjRjnMa6mejiW",
"path": "/api",
"port": 15672,
"ssl": false,
"uri": "http://i0zRgICRqzDB5fll:[email protected]:15672/api",
"uris": [
"http://i0zRgICRqzDB5fll:[email protected]:15672/api",
"http://i0zRgICRqzDB5fll:[email protected]:15672/api",
"http://i0zRgICRqzDB5fll:[email protected]:15672/api"
],
"username": "i0zRgICRqzDB5fll"
},
"mqtt": {
"host": "rabbitmq.service.consul",
"hosts": [
"rabbitmq-node-0.service.consul",
"rabbitmq-node-1.service.consul",
"rabbitmq-node-2.service.consul"
],
"password": "rvUgjRjnMa6mejiW",
"port": 1883,
"ssl": false,
"uri": "mqtt://bd0c989a-8a16-4641-8bcc-70a99c7c7264%3Ai0zRgICRqzDB5fll:[email protected]",
"uris": [
"mqtt://bd0c989a-8a16-4641-8bcc-70a99c7c7264%3Ai0zRgICRqzDB5fll:[email protected]",
"mqtt://bd0c989a-8a16-4641-8bcc-70a99c7c7264%3Ai0zRgICRqzDB5fll:[email protected]",
"mqtt://bd0c989a-8a16-4641-8bcc-70a99c7c7264%3Ai0zRgICRqzDB5fll:[email protected]"
],
"username": "bd0c989a-8a16-4641-8bcc-70a99c7c7264:i0zRgICRqzDB5fll"
},
"stomp": {
"host": "rabbitmq.service.consul",
"hosts": [
"rabbitmq.service.consul",
"rabbitmq-node-0.service.consul",
"rabbitmq-node-1.service.consul",
"rabbitmq-node-2.service.consul"
],
"password": "rvUgjRjnMa6mejiW",
"port": 61613,
"ssl": false,
"uri": "stomp://i0zRgICRqzDB5fll:[email protected]",
"uris": [
"stomp://i0zRgICRqzDB5fll:[email protected]",
"stomp://i0zRgICRqzDB5fll:[email protected]",
"stomp://i0zRgICRqzDB5fll:[email protected]"
],
"username": "i0zRgICRqzDB5fll",
"vhost": "bd0c989a-8a16-4641-8bcc-70a99c7c7264"
}
},
"ssl": false,
"uri": "amqp://i0zRgICRqzDB5fll:[email protected]/bd0c989a-8a16-4641-8bcc-70a99c7c7264",
"uris": [
"amqp://i0zRgICRqzDB5fll:[email protected]/bd0c989a-8a16-4641-8bcc-70a99c7c7264",
"amqp://i0zRgICRqzDB5fll:[email protected]/bd0c989a-8a16-4641-8bcc-70a99c7c7264",
"amqp://i0zRgICRqzDB5fll:[email protected]/bd0c989a-8a16-4641-8bcc-70a99c7c7264"
],
"username": "i0zRgICRqzDB5fll",
"vhost": "bd0c989a-8a16-4641-8bcc-70a99c7c7264"
},
"label": "rabbitmqent",
"name": "testrabbit",
"plan": "usage",
"tags": []
}
]
}
```
### Offered protocols and plugins
RabbitMQ provides the following protocols which are all exposed in `VCAP_SERVICES`:
- AMQP
- MQTT
- STOMP
- HTTP (API and management interface)
The application should **only** use the `uri` or `hostname` property of the desired protocol within `VCAP_SERVICES` to achieve proper loadbalancing.
These properties contain `rabbitmq.service.consul` so the connection is established to a HAProxy that automatically distributes requests to all available RabbitMQ nodes.
The alternative properties `uris` and `hostnames` should only be used if single nodes need to be targeted explicitly.
This can be used to achieve slightly better performance by connecting publishers and subscribers to the queue master node.
However, this comes with a serious disadvantage, since the connections fail in case the single node goes down.
The full list of exposed plugins:
- `rabbitmq_consistent_hash_exchange`
- `rabbitmq_event_exchange`
- `rabbitmq_management`
- `rabbitmq_mqtt`
- `rabbitmq_prometheus`
- `rabbitmq_shovel`
- `rabbitmq_shovel_management`
- `rabbitmq_stomp`
### Advanced configuration
In most cases, connecting your applications to the loadbalanced service is completely sufficient.
For some high end setups, getting the last drop of performance and scalability from the cluster is an advanced task and might include strategies which need more logic on the client side:
#### Non-HA queues
By default, every queue created on RabbitMQ is created on one node (the queue master) and is replicated to other nodes to achieve high availability.
In certain cases, high availability is sacrificed to avoid the overhead of replication to achieve higher performance. For this, just create the queue with the prefix `appcloud-non-ha` and it will not be replicated to other nodes.
#### Consistent Hash exchange
For a different approach to scale your queues on the cluster, the [Consistent hash exchange plugin](https://github.com/rabbitmq/rabbitmq-consistent-hash-exchange) is enabled.
## Limitations
The RabbitMQ service enforces some limitations to ensure the cluster's performance.
### Per-Queue Message TTL of 14 days
To protect the cluster's disk from filling up with old messages that were never picked up, the [per-queue message TTL](https://www.rabbitmq.com/ttl.html#per-queue-message-ttl) is set to **14 days**. Messages that are in the queue for longer than 14 days are automatically discarded.
### Operator Policies
The RabbitMQ cluster implements [operator policies](https://www.rabbitmq.com/parameters.html#operator-policies) to set certain limits on queues and messages. The following operator policies are defined:
- `max-length: 100’000`: If a queue holds 100'000 messages, the most recently published messages will be discarded.
- `max-length-bytes: 100MB`: If a queue holds more than 100MB of message data, the oldest messages are discarded from the head of the queue.
### Connection Limits
To avoid high load caused by too many open connections, a limit on service instance level is implemeted. A single service instance can hold a maximum of 500 connections. Once this limit is reached, all further connections will be terminated.
## <a id='managing'></a> Managing your RabbitMQ instances
To get access to the RabbitMQ management console, you can use the [`cf ssh` command](../devguide/deploy-apps/ssh-services.html). Here's how it works:
1. You'll need to use an app as a host for SSH-ing. Any running app will do. If you don't have one yet, push a default app through the web console
1. Create a service key for your RabbitMQ instance with `cf create-service-key <service-name> manage` (replace the term in `<>` with your service's name)
1. Look at the service key's credentials with `cf service-key <service-name> manage`
1. SSH into your app with `cf ssh -L 13000:<credentials.hostname>:<credentials.management_port> <app-name>` (replace the values in `<>` with the actual values from the service key and with your app's name)
1. Open your browser and visit <http://localhost:13000> and use `<credentials.username>` and `<credentials.password>` from your service key as the credentials
1. When you're done, simply close the terminal window with the open SSH connection
## <a id='sample-application'></a> Sample Application
Cloud Foundry: [RabbitMQ Example](https://github.com/cloudfoundry-samples/rabbitmq-cloudfoundry-samples)
## <a id='best-pracitces'></a> Best Practices
### Queues
#### Keep queues short
RabbitMQ queues work fast and reliably when the they're empty / have only a few messages queued up. RabbitMQ is not designed to hold message over a long period of time and it should always be the goal to keep the queues empty. Not only does this minimise the risk of message loss, it also increases the overall stability of the cluster and its performance.
#### Know when to use lazy queues
Lazy queues are queues that save their messages to disk to achieve some persistency. This has a negative impact on the performance of the specific queue but more predictable performance overall when as it minimises memory usage. Small and fast queues should (in general) not be declared as lazy queues. For large queues that don't require a fast throughput, lazy queues are advised. Lazy queues are enabled by default on the Application Cloud RabbitMQ service offering.
#### Auto-delete queues
Queues will remain forever on the cluster by default (even when they're not active). This will put unnecessary strain on the cluster. To free up resources and don't run into limits, queues should be declared with the auto-delete flag set to true. With this, the queues will be deleted as soon as there are no more active connections using them.
### Connections
#### General
Connections in RabbitMQ use up a lot of memory. In order to keep the cluster healthy and fast, it is important that there are as few open connections as possible. This can be achieved by keeping connections open for client applications. A good practise is to keep the connection open throughout the lifetime of the client application.
#### Auto reconnect
Connections between client applications and the RabbitMQ cluster are stable for the most part. However, it's possible that a network outage can occur or the cluster has an unexpected downtime. In such a case, the client application usually needs to reconnect to the cluster and reopen connections and channels. To automate this process, most RabbitMQ Client APIs support the auto-reconnect feature (see the Java Client API for an example). It is advised to enable the auto-reconnect feature on the client applications.
### Channels
#### General
Channels share the same as restrictions as connections - they can use up quite a lot of memory if opened and closed frequently. Although less memory is used when creating a channel, it's still recommended that channels are kept open for a long time.
#### Threads
Channels are not thread safe and should not be shared between different threads.
#### Prefetch
RabbitMQ allows the for the control over how many messages are sent to the consumer at the same time. Per default, the prefetch value is sent to unlimited, meaning the server tries to send as many messages as possible at the same time. Setting a proper prefetch value can help to achieve the desired performance of the service. If the prefetch count is set too low, the server while idle and wait for more the client to send more message. On the other hand, if the prefetch count is too high the client might take a long time to consume the messages.
Setting the correct prefetch value depends on the setup of the client application. As such, there is not one prefetch value that is best for all cases. As a general rule the prefetch count should be higher if there is only one / few consumers that consume messages quickly. If the client application feature many consumers, a lower prefetch value should be set. More information can be found here: https://www.rabbitmq.com/consumer-prefetch.html.
### Messages
#### Message size
Large messages can take up a lot of resources and should generally be avoided. RabbitMQ is designed for high message throughput and as such it is better to keep the messages sent over RabbitMQ small. Splitting up large payloads into smaller message should be considered when designing the client application.
#### Message ack's and confirms
RabbitMQ features acknowledgments (ack's) for receiving message and publisher confirms (confirms) for sending messages. These mechanisms can be used to ensure that a message has been send / received correctly. Ack's and confirms are generally a good idea to implement as they increase the reliability and consistency of the message flow.
Note: Ack's and confirms have a small performance impact.
## <a id='migration-to-quorum-queues'></a> Migration to Quorum Queues
### Quorum Queues
Quorum Queues are a superior replacement for Classic Mirrored Queues that were introduced in RabbitMQ version 3.8. And there are two complementary reasons why you would need to migrate:
- Classic Mirrored Queues were deprecated in 3.9 and will be completely removed in 4.0
- they are more reliable and predictable, faster for most workloads and require less maintenance
Quorum Queues are better in all regards, but they are not 100%-compatible feature-wise with Mirrored Queues.
### How to Make Use of Quorum Queues
Declare a queue by setting the `x-queue-type` argument to `quorum`. This will declare a quorum queue with three replicas.
After declaring a quorum queue, you can bind it to any exchange just as with other queue types.
### How to Migrate to Quorum Queues from Classic Mirrored Queues
Directly changing queue types is not possible. Instead, there are two options when moving from classic mirrored queues to quorum queues:
**Option 1: Create a new queue with the type quorum and move the publisher, then the consumer.**
- **Declare a Quorum Queue:** Set the `x-queue-type` queue argument to `quorum` . The queue argument should be provided by the client and set when declaring the queue.
- **Move the publisher and empty the queue:** Empty the queue by moving the publisher to the new queue and allowing the consumer to consume messages from the old queue until it's empty.
- **Move the consumer:** Finally, switch the consumer to the new queue.
**Option 2: Move messages from the old queue to the new queue by using a shovel.**
- **Set up a shovel:** Create a shovel from the RabbitMQ management interface. Click on the Admin tab and select Shovel Management from the dropdown menu.
- **Declare quorum queues:** Set the x-queue-type queue argument to quorum . The queue argument should be provided by the client and set when declaring the queue.
- **Move messages:** Use the created shovel to move messages from one queue to the other.
### Compatibility considerations
The RabbitMQ documentation has a dedicated page on [Quorum Queues](https://www.rabbitmq.com/docs/quorum-queues). Specifically, in this document there is a
[feature matrix](https://www.rabbitmq.com/docs/quorum-queues#feature-matrix) which provides a list with all differences between Mirrored Classic Queues and Mirrored Queues.
These differences can require a different amount of work for a successful migration. Some of [them](https://www.rabbitmq.com/blog/2023/03/02/quorum-queues-migration#trivial-changes) can be trivial
to change, while [others](https://www.rabbitmq.com/blog/2023/03/02/quorum-queues-migration#breaking-changes) can require changes in the way an application interacts with RabbitMQ.
<%= image_tag("./images/rmq_queues_feature_matrix.png") %>
#### Breaking changes
**Priority queues**
Classic mirrored queues actually create a separate queue for every priority behind the scenes. For migration it’s necessary for the applications to explicitly handle creation of those queues, and also publishing/consuming to and from them.
**Overflow dead lettering**
Overflow mode `reject-publish-dlx` is not supported by quorum queues. The code needs to be updated to use publisher confirms and to do dead lettering by itself.
**Global QoS for consumers**
Global QoS for consumers is not supported for quorum queues. A decision needs to be made about how necessary results can be achieved using alternative means, e.g. by using a lower per-consumer QoS that can give approximately the same results (given the known application load pattern).
**`x-cancel-on-ha-failover` for consumers**
Mirrored queues consumers can be automatically cancelled when a queue leader fails over. This can cause a loss of information about which messages were sent to which consumer, and redelivery of such messages.
Quorum queues are less exposed to such behaviour - the only case when it still can happen is when a whole node goes down. For other leader changes (e.g. caused by rebalancing), there will be no redeliveries.
And redeliveries can also happen for inflight messages when the consumer is cancelled or the channel is closed. So application needs to be prepared for redeliveries anyway, without specifically asking for such information.
#### Trivial Changes
**Lazy queues**
Classic queue can optionally operate in lazy mode, but for quorum queues this is the only way of operation.
**Non-durable queues**
Non-durable queues will be deleted on a node/cluster boot.
Non-durable queues concept is also going away in the future releases: the only option for ephemeral queues will be exclusive queues. This affects only durability of queue definitions, messages can still be marked transient.
For such queues a decision has to be made one way or another: is this queue content important enough to get availability guarantees of quorum queues, or it's better to downgrade it to a classic (but durable) queue.
**Exclusive queues**
Exclusive queues are not mirrored even if the policy says so. But attempt to declare an exclusive quorum queue will result in an error.
This is clearly one of the cases where migration is not needed, but care must be taken as to avoid exclusive queue declarations with an explicit `x-queue-type: quorum` argument.