"Profiling in production with function call traces" - https://yosefk.com/blog/profiling-in-production-with-function-call-traces.html
super interesting and opinionated blog post! It argues for an ideally very cheap, but always on tracing profiler. the profiler keeps a ring buffer of a bunch of function entries and exit and saves it out on user-triggered events. can be used to produce a trace from a core dump, or to debug tail latency ('this event was extra slow? let's dump the ring buffer').
I want to try or reimplement this!