Communication is key. This is true in all aspects of life, and it especially applies to managing critical incidents in your company. Managing communication effectively with your customers can ensure good-will is maintained and they continue to use your product; alternately, failing to keep your customers informed can result in a loss of business and angry customers. Building and maintaining good communication channels within your company and with your customers is key to ensuring your product continues to be patronised. Done properly, when (inevitable) outages occur, the impact (both technologically and emotionally) is limited.
In a technical sense, proper communication channels help technical teams who may be unaware of how each other operate can work together efficiently to action and resolve problems quickly. In an emotional sense, business teams, management, and customers will be happy to know that their data and technology is in the hands of people who know what theyâre doing. Levels of comfort among these stakeholders are enhanced by being included in the process, having their concerns proactively acknowledged, and, being treated as equals by technical teams.
Why audience-appropriate communication is important
Many âtechiesâ enjoy communicating using complex, domain-specific language when managing their services, and this usually never poses a problem in the day-to-day of their jobs. However, when a service-impacting incident is underway, itâs not just you and your team members who are fixing it: everyone is. In the same way spectators at a football game cheer on their home team, your managers and customers are there to provide you with the support you need to resolve issues. But when you speak to them in complex language that is difficult to understand, you unintentionally gate-keep which prevents those same people from assisting you because they just donât understand what youâre saying. In the same way, product managers and marketing have a tendency to âsanitiseâ public communication to customers; by the time information makes its way into their inboxes itâs functionally useless.
Consider the following: âour internet service provider has published incorrect routing information, which means that everything on our internal network does not know how to reach the internet. We can fix it by temporarily overriding the incorrect routing information, but our ISP will need to correct their configurationâ. This explanation uses very clear, common phrases to explain the issue: internet service provider, internal network, internet, route. It contains no overtly technical information, and it also provides methods for resolution. Ironically, I have very rarely seen an example of clear communication from most technical staff. Usually, a manager comes along with a question like âwhy is the network down?â, and the following answers are given:
- âThe router is broken, Iâm fixing it.â
- âOne of our vendors misconfigured their BGP and has taken down our network.â
- âWeâre receiving a strange route from our extranet.â
The first response lacks any clarifying information and doesnât provide any additional context to the question. The last two cases are so technical that without domain-specific knowledge, anyone who is not on your team including those people in other technical teams will not understand it. This example was adapted from the service outage review conducted by Cloudflare for their outage on the 17th of July 2020. Iâm a customer with Cloudflare and the way they clearly communicated their understanding of the issue, steps they took to resolve, and post-mortem of the issue gave me the confidence to continue being their customer. If they had responded with âthe service is out, weâre looking into itâ, I would have moved to a better provider. This is usually what happens when these simplistic, pointless updates are given because people lose trust in a service provider to actually do their job.
Hot tip: customers already know the service is out, they donât need reassurance that the outage is occurring, they need assurance the service is being fixed.
These pointless updates are usually caused by technical teams who provide little-to-no-context to product managers and external communications. In turn, these teams do the same for customers. Conversely, it is possible to be âtoo communicativeâ, whereby you notify customers of outages to individual infrastructure that is redundant or will not impact the customer in a material way.
There are a few keys to communicating with your customers that will ensure that they remain customers:
- Honesty: if you are honest with customers, they will trust you.
- Clarity: if you are clear with your customers, they will understand you.
- Teamwork: if your company works together during an outage, your customers will be happier. You can only work together with appropriate communication.
- Timeliness: Customers need information so that they can resolve downstream issues with their services, the quicker you get information to them, the faster you allow them to limit the impact to their customers.