Infrastructure Operations and Site Reliability Teams at Ping use quite a few tools internally to keep tabs on how our services are performing, but we also believe in a high level of transparency. Displaying as much information as possible in real time is one of the ways we keep our customers informed with up-to-minute status of all our services.
If you're a Ping customer, a prospect, or just someone interested in how we do things, here are a few sites to check out.
Our main status site for all hosted services. Here you'll find up-to-minute status posted directly from the Site Reliability Engineering Team. The site is hosted by the folks at statuspage.io and is not hosted on our own infrastructure, so it will be up even if everything else is down. We hook into monitoring API's from our production systems to show customers uptime and response time statistics from the main segments of our services, and post incident updates as they happen. Customers can also see post mortem reports written by our engineering teams that go into further technical detail on each major outage. At the top of the page, you'll also find a "Subscribe to Updates" button that gives you lots of options for keeping tabs on us, including SMS, Email, Twitter, or RSS feeds that you can hook directly into your own monitoring dashboards. We also have a complete Incident Archive hosted on our site.
I'm not a huge believer in raw uptime SLAs or statistics, but a lot of people still are. Our stats site gives a great historical breakdown of each service, showing when it was down and for how long. It's not just a blanket percentage of uptime. The site is hosted by Pingdom, which we use for external service uptime monitoring. I really like using Pingdom's services for this, as they are a neutral third-party. Ping simply sets up the endpoints for monitoring, but has no control over reported statistics. Pingdom reports what it sees, and presents that to our customers. We've exposed all uptime data since we started using the service, and it's available to everyone.
Ping Identity also publishes an up-to-date statement that covers our security practices, datacenter architecture, monitoring, crypto standards, industry standard certifications and more. If you have any questions about operations at Ping Identity, this is a great place to start.
All of our automated monitoring and status systems also report up to our Twitter feed. Following @pingoneops will give you immediate notification of any events we're tracking on the production systems. Twitter also provides the ability for users to sign up for free SMS notifications whenever an event is posted to the @pingoneops feed. If you're like me, your brain is tuned into late-night SMS messages via PagerDuty, so this option works out nicely.