Don't want to reinvent the wheel so I am wondering if any logging system already supports something like what I am proposing to do.
Background: I am working on an extremely large system where tens of thousands of users access the servers at any given time. There are a lot of surrounding infrastructures involved so you can imagine what it looks like to investigate rare occurring bugs just by reading the logs in such an ecosystem.
Our system uses log4j.
The main root of the problem was that the original implementors were rather 'economical', to put it mildly, when encountering an unknown error. Sure, the DEBUG level would be helpful in most cases, but of course, production logs are set on ERROR level only :(
Now I am trying to make a better world for our children and want to expand the logging system to be more 'forgiving' about the log level.
What I have in mind goes like this:
Since the surrounding DEBUG level logs (N before and M after the ERROR log) might contain important data, why not lower the log level of the system around the ERROR.
My idea:
There are some issues to work out but in general it could work.
Any ideas?
Cheers