Hi all,
We have a site that has been running SQL SE for a while now. They are currently running version 2.92.00 and will be upgraded to v2.93.01 soon. However, this particular issue spans multiple previous versions so is not limited to 2.92.00.
This site works unusual hours and they start work very early in the morning, usually between 2am and 4am. Sometimes, when they come in at that time the first person who tries to log into Opera will receive the message 'the Pegasus opera service is unreachable'. We've now given them a batch file to run which restarts the service on the server and once this has run they can log in and work normally. It doesn't always occur at this time however, there is no pattern to the days/times and it can happen during the day as well.
They were getting this issue regularly until we installed an upgrade that was meant to fix this issue - v2.91.02. However, the issue has not completely gone away.
I've scoured the SAM logs and SQL logs and can't find anything helpful. The Windows event viewer just seems to log when the service is restarted.
They do have another service which is linked and polls the Opera service every couple of minutes to check if any data needs importing/exporting. The event viewer reports an issue under this service (PODFather) when it can't 'ping' the Opera service. The Opera service is still showing as running when this happens but it's like it's stopped responding.
Has anyone else come across anything similar or has any ideas about where to start looking to get this resolved? The issue probably occurs on average once a week now, sometimes a bit more often and sometimes less. Their IT company want the issue resolved so the batch files to restart the service can be removed from the users desktops (they say this is a security risk).
Any pointers anyone can give would be gratefully received.
Thanks,
Alex