Tight long-term tail decay of (clipped) SGD in non-convex optimization