Uptime Monitoring

Does anyone use any particular tools for monitoring uptime and other related events for their SOAP API and/or REST services? We would love to hear how others handle this company website related issue (assuming it's not left to a whole other third party).

Thank you,

  • When I was at the National Theatre we added New Relic to our servers, this was mainly for tracking how the new website and our new api were doing but was added to the Tessitura APi servers so we could see the whole path.

    It may be more than you need, but the information gleaned from it was often invaluable. The best example of this was when we upgraded to V11 and immediately saw that the SYOS screen calls jump to over 1s, and when Tessitura released the fix (which was quick) we could immediately see the effect on the Test site.

    Didn't get a chance to play around with it as much as I wanted to but was good for general stats, helped our Sys Admins off recognising which servers in a cluster were having issues and gave more in depth stats for some of the development issues we had. Only down side was that their .Net integration was still being developed when we were using it, but that is almost 2 years ago so they may have resolved those issues now.

    Mark

  • Our slightly outdated, but functional monitoring setup uses Nagios to retrieve the SOAP API Diagnostic endpoint and the Services Dashboard, and basically just checks that neither of them say "FAILED" or whatever the non-passing output is. Pretty simple, but it works.