@hiiretail/gcp-infra-cli 0.83.0 → 0.83.2
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
|
@@ -6,27 +6,27 @@
|
|
|
6
6
|
|
|
7
7
|
**Incident end**: <!-- The date and time when the incident was resolved, for example: 2022-06-14 14.53 CET -->
|
|
8
8
|
|
|
9
|
-
**Problem statement**: <!-- Describe
|
|
9
|
+
**Problem statement**: <!-- Describe what the problem was. It is important to keep this short, one or two sentances are enough. Example: An Out of Memory error was thrown -->
|
|
10
10
|
|
|
11
11
|
**Impacted customer(s)**: <!-- What customer(s) that were affected -->
|
|
12
12
|
|
|
13
|
-
**Impact to customer**: <!-- Describe how the customer, and end customers, was affected by the incident -->
|
|
13
|
+
**Impact to customer**: <!-- Describe how the customer, and end customers, was affected by the incident. Where they completely unable to use the system? Or was there some feature that wasn't working? -->
|
|
14
14
|
|
|
15
15
|
**Ticket information**: <!-- Add link(s) to any Jira issues -->
|
|
16
16
|
|
|
17
|
-
**Services involved**: <!-- List the
|
|
17
|
+
**Services involved**: <!-- List the Hii Retail-service(s) that were involved in the incident, for example checkout-poslog -->
|
|
18
18
|
|
|
19
19
|
## Sequence of Events
|
|
20
20
|
|
|
21
21
|
<!--
|
|
22
|
-
Describe the events that caused the incident, starting from
|
|
22
|
+
Describe the events that caused the incident, starting from when the issues started up until the incident was resolved. If no alert was triggered, the first event would be when the issues first started.
|
|
23
23
|
|
|
24
24
|
Example:
|
|
25
25
|
|
|
26
|
-
2022-06-14 14.36 - Alert X was triggered
|
|
27
|
-
2022-06-14 14.36 - Team started working on the incident
|
|
28
|
-
2022-06-14 14.49 - A fix was pushed and deployed
|
|
29
|
-
2022-06-14 14.53 - Incident was resolved
|
|
26
|
+
- 2022-06-14 14.36 - Alert X was triggered
|
|
27
|
+
- 2022-06-14 14.36 - Team started working on the incident
|
|
28
|
+
- 2022-06-14 14.49 - A fix was pushed and deployed
|
|
29
|
+
- 2022-06-14 14.53 - Incident was resolved
|
|
30
30
|
-->
|
|
31
31
|
|
|
32
32
|
## Five Whys
|
|
@@ -54,22 +54,25 @@ Problem: The vehicle won't start
|
|
|
54
54
|
## Summary
|
|
55
55
|
|
|
56
56
|
<!--
|
|
57
|
-
|
|
57
|
+
This section should be written last, when all of the other bullets in the RCA has been written. The summary should include:
|
|
58
|
+
|
|
59
|
+
* What the problem statement was, what the root cause of it was and a short summary of the highest prioritized action points.
|
|
58
60
|
-->
|
|
59
61
|
|
|
60
62
|
## Action items
|
|
61
63
|
|
|
62
64
|
<!--
|
|
63
|
-
A table that describes the different actions that was the outcome of the analysis
|
|
64
|
-
|
|
65
|
+
A table that describes the different actions that was the outcome of the analysis and who is the owner of the task. The table should be in order of priority.
|
|
66
|
+
|
|
67
|
+
When creating Jira issues, add the label "RCA" to them.
|
|
65
68
|
|
|
66
69
|
Example:
|
|
67
|
-
| Description | Owner |
|
|
70
|
+
| Description | Owner | Jira issue |
|
|
68
71
|
|-------------|-------|------|--------|
|
|
69
|
-
| Create alert for high CPU Usage | Bob the Builder |
|
|
70
|
-
| | | |
|
|
72
|
+
| Create alert for high CPU Usage | Bob the Builder | HII-1234 |
|
|
73
|
+
| | | |
|
|
71
74
|
-->
|
|
72
75
|
|
|
73
|
-
| Description | Owner |
|
|
74
|
-
|
|
75
|
-
| | |
|
|
76
|
+
| Description | Owner | Jira issue |
|
|
77
|
+
|-------------|-------|------------|
|
|
78
|
+
| | | |
|
|
@@ -2,6 +2,20 @@
|
|
|
2
2
|
|
|
3
3
|
## General
|
|
4
4
|
|
|
5
|
+
### Terms and abbreviations
|
|
6
|
+
|
|
7
|
+
<!--
|
|
8
|
+
Write any terms and/or abbreviations that might occur in the text below. Remember that the reader of this Runbook might be new to Hii Retail and how this system interacts with others.
|
|
9
|
+
|
|
10
|
+
Example:
|
|
11
|
+
|
|
12
|
+
GCP - Google Cloud Platform. Extenda Retail's choice of Cloud Platform for Hii Retail.
|
|
13
|
+
CE - Checkout Engine. A system where the thin POS clients connects to and perform their daily sales operations.
|
|
14
|
+
|
|
15
|
+
-->
|
|
16
|
+
|
|
17
|
+
### Purpose
|
|
18
|
+
|
|
5
19
|
<!-- Describe, in short, what the system does and the interactions with external and/or third party systems.
|
|
6
20
|
|
|
7
21
|
Example:
|
|
@@ -83,6 +97,12 @@ Add examples on how to use the Just-In-Time Access system (https://jit-access.re
|
|
|
83
97
|
|
|
84
98
|
## Contact & Escalation Matrix
|
|
85
99
|
|
|
100
|
+
### Slack
|
|
101
|
+
|
|
102
|
+
<!-- What is the public channel to contact the team responsible for the system? For example: For general questions, reach us on Slack in #our-channel -->
|
|
103
|
+
|
|
104
|
+
### Contact & Escalation Matrix
|
|
105
|
+
|
|
86
106
|
<!-- If the team is unable to resolve or need to escalate an incident, who is the first to contact?
|
|
87
107
|
|
|
88
108
|
| # | Name | Role | E-Mail | Phone number |
|